18:03
2026-06-16
lusob.github.io
machine-learning
Embeddings is all you need
A new in-browser voice-to-action system uses a tiny embedding model (MiniLM-L6-v2) to classify intents via cosine similarity, achieving sub-50ms latency without any server or large language model. Theβ¦