"The cost of running LLMs is just too damn high"

SuspiciousCarrot78@aussie.zone · edit-2 2 days ago

"The cost of running LLMs is just too damn high"

SuspiciousCarrot78@aussie.zone · 1 day ago

I hear you; I’m not wildly enamored with reddit either…but that convo is a good springboard.

I see almost everyone chasing bigger GPUs, more parameters, more more more. I figure when 9 people say “go right”, there should be at least someone that can make the plausible case for “actually, here’s why go left works”.

Eg: I think there should be some discussion about watts per token vs tokens per second.

I’m still re-writing the FAQ for my project - when it’s done (and if there’s interest) I will post it here.