Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
So far, running LLMs has required a large amount of computing resources, mainly GPUs. Running locally, a simple prompt with a typical LLM takes on an average Mac ...