cd/entity/MLX· home entities MLX
grep -l @mlx /news/*.json | wc -l → 9

@MLX

mentions 9 type Organization feed RSS
09:55
2026-06-05
letsdatascience.com
large-language-models

Google LiteRT-LM Accelerates Gemma 4 Local Inference

Google added native support for Gemma 4 Multi-Token Prediction (MTP) to LiteRT-LM, its on-device LLM runtime built on LiteRT (formerly TensorFlow Lite). Google reports the integration yields MTP decod…

04:09
2026-06-05
magenta.withgoogle.com
generative-ai

Magenta RealTime 2: Open and Local Live Music Models

Google released Magenta RealTime 2, an open-weights AI model and real-time inference engine that enables live, low-latency music generation on a MacBook. The model responds instantly to MIDI, audio, a…

16:19
2026-05-29
liquid.ai
large-language-models

Liquid AI reveals 8B-A1B MoE trained on 38T

Liquid AI released LFM2.5-8B-A1B, an edge model designed for fast tool calling on consumer hardware, with a 128K context window and pretraining scaled to 38 trillion tokens. The model, available on Hu…

20:54
2026-05-18
dev.to
artificial-intelligence

Codex Chronicle was paying for every frame.

OpenAI's cloud-based Chronicle service, which incurred per-frame costs for analyzing screen captures, with a local Gemma 4 E4B 4-bit MLX model running on a $599 Mac mini. This new setup processes vide…

22:48
2026-04-21
gist.github.com
large-language-models

Running Claude Code with a local LLM

The article provides instructions for running Claude Code using a local large language model (LLM) instead of Anthropic's cloud-based models. It recommends downloading specific quantized Qwen3.6 model…

// co-occurs with top 8 entities