Gemma locally on iOS, Android, web browsers, and GPUs with a single framework
old.reddit.com2b model running at 20tok/sec on iphone, nice potential for future applications
2b model running at 20tok/sec on iphone, nice potential for future applications