Brian Lovin
/
Hacker News
MLC-LLM: GPT/Llama on consumer-class GPUs and phones