cd/entity/Ollama· home entities Ollama
grep -l @ollama /news/*.json | wc -l → 448

Ollama

mentions 448 type Organization page 2/23 feed RSS

// recent coverage 448 mentions

11:16
2026-07-01
byteiota.com
ai-tools

OpenCode at 160K Stars: The Model-Agnostic Coding Agent

OpenCode, a model-agnostic coding agent, crossed 160,000 GitHub stars and 7.5 million monthly active developers in June 2026 by refusing to lock users into a single AI model. Unlike competitors Claude…

09:39
2026-07-01
ianwootten.co.uk
ai-tools

Ditching Claude for OpenCode and OpenRouter

Developer Ian Wootten replaced his Claude subscription with open-source models accessed via OpenRouter and the OpenCode harness, citing cost, flexibility, and a desire to avoid vendor lock-in. He buil…

07:12
2026-07-01
dev.to
developer-tools

Never Write a README.md from Scratch Again — Meet readmeai

A developer built readmeai, a Python CLI that scans project structure and config files, then uses an LLM to generate a complete README.md with badges, install instructions, and usage examples. The too…

06:50
2026-07-01
github.com
large-language-models

Ragit – chat with any folder of documents using a local LLM

Ragit, a new command-line tool, enables users to chat with any folder of documents using a local LLM via Ollama. It implements Retrieval-Augmented Generation (RAG) by indexing files, creating embeddin…

00:22
2026-07-01
ollama.com
large-language-models

Faster Gemma 4 on MLX with multi-token prediction

Gemma 4 in Ollama 0.31 generates tokens nearly 90% faster on Apple Silicon using multi-token prediction (MTP), which employs a small draft model to propose multiple tokens that the main model verifies…

15:03
2026-06-30
dev.to
large-language-models

RAG for Code: Why Chunking by Function Beats Chunking by Lines

A developer built a retrieval-augmented generation (RAG) system for code and found that chunking by function boundaries dramatically outperformed line-based chunking. By using a parser to extract comp…

13:44
2026-06-30
aimultiple.com
ai-products

DGX Spark vs. Mac Studio and Halo

NVIDIA's DGX Spark, a $4,699 desktop AI supercomputer with 128GB unified memory, launched in 2025, offering one petaflop of FP4 performance. Benchmarks show it excels at prompt processing but lags in …

13:06
2026-06-30
blog.devgenius.io
machine-learning

Vector and Embeddings : 101

A developer explains that vector embeddings convert text meaning into geometric coordinates, enabling similarity search via cosine distance, but warns that this lossy compression can silently cause re…

12:01
2026-06-30
dev.to
artificial-intelligence

Building Your Own AI News Digest: A Developer’s Tutorial

A developer built a personalized AI news digest using Python, NewsAPI, and an LLM. The system fetches articles, filters them by relevance, and generates one-sentence summaries for a scannable daily di…

08:23
2026-06-30
xda-developers.com
large-language-models

Good article about local LLM on MacBook Air

Ollama's new MLX engine enables local LLM inference on MacBook Air at twice the speed, making powerful AI accessible on consumer hardware without cloud dependency.…

05:42
2026-06-30
dev.to
artificial-intelligence

Stop Building OpenAI Wrappers: How to Build Defensible AI Apps

A developer argues that 90% of AI startups launched last year were thin wrappers over LLM APIs and failed when providers released native features. To build defensible AI apps, the developer recommends…

00:43
2026-06-30
dev.to
developer-tools

I Built Byte Because OpenWebUI Kept Breaking

A developer built Byte, a native macOS app that runs free local AI models via Ollama or supports API keys for Claude, ChatGPT, and Gemini, after finding OpenWebUI unstable and resource-heavy. Byte aim…

← prev page 2 / 23 next →
// co-occurs with top 8 entities