Exllama V2 released! Available in Ooba! Big speed upgrades!

noneabove1182@sh.itjust.works · edit-2 1 year ago

Exllama V2 released! Available in Ooba! Big speed upgrades!

kelvie@lemmy.ca · 1 year ago

Why’d you create your own dockerfile repo vs just improving/changing the one in the main ooba repo?

noneabove1182@sh.itjust.works · 1 year ago

Good question, at the time I made it there wasn’t a good option, and the one in the main repo is very comprehensive and overwhelming, I wanted to make one that was straight forward and easier to digest to see what’s actually happening

Exllama V2 released! Available in Ooba! Big speed upgrades!

Exllama V2 released! Available in Ooba! Big speed upgrades!

GitHub - turboderp/exllamav2: A fast inference library for running LLMs locally on modern consumer-class GPUs

Exllama v1

Exllama v2