cd/entity/Ollama· home entities Ollama
grep -l @ollama /news/*.json | wc -l → 448

Ollama

mentions 448 type Organization page 1/23 feed RSS

// recent coverage 448 mentions

10:32
2026-07-04
dev.to
large-language-models

Solving the GPU Pinning Saga and Gemma's Meta-Commentary

Glad Labs fixed a GPU pinning issue where LiteLLM 1.89.2's global api_base override prevented per-model routing, causing vision tasks to cold-load onto the wrong GPU. The team also hardened content gu…

09:55
2026-07-04
dev.to
large-language-models

Scaling LLMs: Why Deterministic Hashing Isn't Enough

A developer built a Go library for semantic LLM caching that combines deterministic hashing with vector similarity search to reduce costs from repeated but differently worded queries. The library supp…

09:00
2026-07-04
letsdatascience.com
ai-tools

PewDiePie Releases Open-Source Odysseus AI Workspace

Felix Kjellberg (PewDiePie) released Odysseus, an open-source, self-hosted AI workspace bundling chat, agents, research, and local model workflows under an AGPL-3.0 license. The project, launched in M…

06:59
2026-07-04
maloyan.xyz
large-language-models

Running Qwen 3.6 Locally on a Mac Mini M4 with 16GB RAM

Qwen open-sourced the 35-billion parameter Mixture of Experts model Qwen 3.6-35B-A3B, which activates only 3 billion parameters per token and runs on a $599 Mac Mini M4 with 16GB RAM at 17 tok/s with …

06:59
2026-07-04
loomcycle.dev
ai-infrastructure

Budgets, costs, and encrypted credentials (v1.9.0 to v1.11.1)

Loomcycle released versions 1.9.0 to 1.11.1 with four major arcs: a security hardening pass closing 17 findings, a new CredentialDef system for encrypted per-tenant secrets using AES-256-GCM, cost att…

06:59
2026-07-04
dev.to
developer-tools

OrinIDE v1.0.8 is here and it's a whole vibe upgrade 🚀

OrinIDE v1.0.8, an AI-powered code editor that runs entirely in the browser without cloud accounts or subscriptions, now supports offline AI models via Ollama and introduces a 4-agent workflow for pla…

18:20
2026-07-02
jacob.gold
ai-safety

Coding Models Are Code

A security researcher warns that coding models should be treated as executable code, as they can generate malicious tool calls that exfiltrate environment variables or introduce subtle vulnerabilities…

06:07
2026-07-02
firethering.com
ai-tools

OmniRoute: Connect All AI Models & Providers Through One API

OmniRoute, an open-source AI gateway, has released version 3.8.42, enabling developers to connect to over 236 AI providers through a single API endpoint. The tool offers automatic failover, token comp…

18:28
2026-07-01
blog.alexewerlof.com
large-language-models

Sampling args in llama-server

Llama.cpp users can significantly improve inference speed and output quality by tuning sampling parameters such as temperature, TopP, MinP, TopK, repeat penalty, DRY, XTC, Dynatemp, Adaptive-P, and Mi…

page 1 / 23 next →
// co-occurs with top 8 entities