cross-posted from: https://programming.dev/post/51407459
Check what can you use and at what rate of token per seconds would it be… It has examples of many models and quantization levels. Huge resource!
You must log in or register to comment.
What has this to do with degoogling?
Gemini is one of the most used LLMs. This shows alternatives.



