cd/entity/GGUF· home entities GGUF
grep -l @gguf /news/*.json | wc -l → 32

GGUF

mentions 32 type Organization page 1/2 feed RSS

// recent coverage 32 mentions

10:13
2026-07-01
github.com
artificial-intelligence

Transcribe.cpp – ggml speech-to-text inference engine

Transcribe.cpp, a C/C++ speech-to-text inference library, has been released supporting 16 model families and 60+ variants via GGUF models on the ggml runtime. It offers Metal, Vulkan, and CUDA backend…

20:06
2026-06-25
dev.to
large-language-models

Your Local LLM Is Not as Private as You Think

Cyera Research disclosed a critical vulnerability in Ollama, a popular tool for running large language models locally. Tracked as CVE-2026-7482 with a CVSS score of 9.1, the flaw allows attackers to l…

17:36
2026-06-25
devclubhouse.com
large-language-models

Quantize and Run Llama 3.2 on Apple Silicon with llama.cpp

Mariana Souza published a tutorial on quantizing and running Meta's Llama 3.2 3B model on Apple Silicon using llama.cpp with Metal GPU acceleration, achieving local inference with Q4_K_M quantization.…

09:46
2026-06-24
github.com
artificial-intelligence

GELab-Zero: Android automation framework for multimodal LLMs

GELab-Zero, an open-source Android automation framework for multimodal LLMs, has been released, featuring a 4B GUI agent model and plug-and-play engineering infrastructure with no cloud dependencies. …

10:14
2026-06-20
dev.to
ai-safety

Cool AI Projects That Failed: The File Integrity Gap

A developer identified a recurring failure mode in local AI projects: teams assume model artifacts like .gguf and .safetensors files are self-documenting and safe to consume without inspection. To add…

23:15
2026-06-18
unsloth.ai
artificial-intelligence

Unsloth: Easily run and train models locally

Unsloth launched Unsloth Studio, a desktop application for Mac and Windows that runs AI models offline, supporting GGUF and Safetensors formats with tool-calling, web search, and an OpenAI-compatible …

06:55
2026-06-17
github.com
large-language-models

Native Inference Engine for macOS 14 or newer

Embershard, a macOS chat app with its own LLM inference engine, has been released in beta v0.1.1 for Apple Silicon devices running macOS 14 or newer. The app bypasses llama.cpp for inference, instead …

23:50
2026-06-13
llama-cpp.com
large-language-models

Llama.cpp – Run LLM Inference in C/C++

Llama.cpp is an open-source C/C++ library that enables running large language model inference locally on consumer hardware, supporting multiple platforms and GPU backends. It automatically optimizes e…

14:58
2026-06-06
vettedconsumer.com
large-language-models

GGUF vs. GPTQ vs. AWQ: The Plain-English Guide to LLM Quantization

GGUF, GPTQ, and AWQ are the three dominant formats for running quantized large language models locally, each optimized for different hardware and use cases. GGUF, the format used by llama.cpp and its …

03:22
2026-06-06
dev.to
large-language-models

Run Gemma-4 12B on WSL2 with llama.cpp

A developer has published a guide for running Google's Gemma-4 12B instruction-tuned model on Windows Subsystem for Linux 2 (WSL2) using the llama.cpp framework. The process involves installing build …

page 1 / 2 next →
// co-occurs with top 8 entities