ANEMLL

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

05:17

2026-06-04

github.com

large-language-models

Show HN: iPhone ANE holds LLM tok/s while MLX and LiteRT thermal-throttle

A new open-source benchmark, "apple-silicon-llm-bench," reveals that Google's LiteRT-LM runtime outperforms MLX-Swift on the iPhone 17 Pro for Gemma 4 E2B inference, achieving 55.4 tok/s with 4.5x les…

// co-occurs with top 7 entities

Apple 1 Google 1 MLX 1 LiteRT 1 llama.cpp 1 CoreML 1 Gemma 1

// topics top 5 topics

large language models 1 machine learning 1 artificial intelligence 1 ai infrastructure 1 ai chips 1