cd/entity/Hugging Face· home entities Hugging Face
grep -l @hugging face /news/*.json | wc -l → 21

@Hugging Face

Hugging Face is an AI community platform and company providing a hub for open-source machine learning models, datasets, and demo spaces. It hosts over 500,000 models and is widely used by the AI research community.

mentions 21 type Person page 1/2 feed RSS
sameAs · en.wikipedia.org · www.wikidata.org
18:48
2026-05-23
dev.to
large-language-models

GGUF & Modelfile: The Power User's Guide to Local LLMs

The article explains how power users can download GGUF (GPT-Generated Unified Format) model files directly from Hugging Face, quantize them (using Q4_K_M as the optimal balance of size and quality), a…

17:44
2026-05-23
dev.to
artificial-intelligence

Gemma 4 on Android: Tricks for Faster On-Device Inference

Here is a factual summary of the article: The article provides technical guidance for optimizing on-device inference of the Gemma 4 E2B model on Android using the LiteRT-LM library, highlighting that…

17:23
2026-05-23
dev.to
artificial-intelligence

The Subconscious Powered by Edge AI

Remora is a privacy-first "Subconscious Social Network" app that uses Google's Gemma 4 AI model to analyze dreams entirely on a user's smartphone, ensuring no raw dream data is uploaded to the cloud. …

03:39
2026-05-23
dev.to
large-language-models

BeeLlama v0.2.0: 164 tok/s on a 27B model, one RTX 3090

BeeLlama v0.2.0 demonstrates that speculative decoding can achieve a 4.4x to 4.93x throughput multiplier on a single RTX 3090, running 27B and 31B parameter models at 37-36 tokens per second baseline …

07:14
2026-05-20
dev.to
large-language-models

I Thought Fine-Tuning LLMs Needed Expensive GPUs. I Was Wrong.

The author successfully fine-tuned a 1.1 billion parameter TinyLlama model using QLoRA on consumer hardware, training only 0.2% of the model's parameters via low-rank adapter matrices. The project inv…

00:00
2026-05-08
together.ai
ai-agents

Deploy and inference any model from HuggingFace

Netflix released void-model on Hugging Face, and a developer used the Goose CLI agent with Together's dedicated containers skill to deploy the model for inference on release day with a single prompt. …

00:00
2026-04-29
huggingface.co
artificial-intelligence

DeepInfra on Hugging Face Inference Providers 🔥

DeepInfra has been added as a supported Inference Provider on the Hugging Face Hub, offering serverless AI inference with over 100 models and cost-effective per-token pricing. The integration initiall…

09:14
2025-03-13
gist.github.com
artificial-intelligence

white-box LLM jailbreak using weight orthogonization

The provided text contains a Python script for a "white-box LLM jailbreak" technique that uses weight orthogonalization. The script loads harmful and harmless instruction datasets, extracts hidden sta…

08:30
2023-10-28
gist.github.com
machine-learning

Hugging Face's Text Embeddings Inference Library

Based on the provided text, the article describes using the Hugging Face Text Embeddings Inference library via a local server at port 8080. It demonstrates sending HTTP POST requests to the `/embed` e…

page 1 / 2 next →
// co-occurs with top 8 entities