Avanika Narayan, Dan Biderman, and Sabri Eyuboglu from Christopher Ré’s Stanford Hazy Research lab, along with Avner May, Scott Linderman, James Zou, have developed a way to shift a substantial portion of LLM workloads to consumer devices by having small on-device models (such as Llama 3.2 with Ollama) collaborate with larger models in the cloud (such as GPT-4o).
Category
Tags

Comments are closed

Ollama.Cloud - Free AI Services & Tools |  Image Creator |  Background Remover |  Apps Developer |  Ollama Manager
>