cd/sources/hugging-face-blog· home› sources› Hugging Face Blog

cat /sources/hugging-face-blog.feed | wc -l → 49

Hugging Face Blog

articles 49 domain huggingface.co → page 1/3 feed RSS

15:56

2026-06-12

huggingface.co

large-language-models

olmo-eval: An evaluation workbench for the model development loop

The Allen Institute for AI released olmo-eval, an open-source evaluation workbench designed to support the iterative process of developing large language models. The tool builds on the institute's ear…

00:00

2026-06-11

huggingface.co

machine-learning

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

PyTorch's `nn.Linear` module transposes its weight tensor before performing matrix multiplication and addition, as revealed by profiler traces showing an `aten::t` operation that only modifies tensor …

10:46

2026-06-09

huggingface.co

ai-agents

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces

A coding agent built a 3D Paris gallery by chaining two Hugging Face Spaces — one for image generation and one for 3D reconstruction — without any human intervention in asset creation. The agent used …

00:00

2026-06-09

huggingface.co

mlops

Migrating Your GitHub CI to Hugging Face Jobs

Hugging Face launched a new GitHub Actions integration that allows developers to run CI jobs on Hugging Face's infrastructure instead of GitHub's default runners. The setup uses a dispatcher Space tha…

00:00

2026-06-08

huggingface.co

ai-agents

The Open Source Community is backing OpenEnv for Agentic RL

The open source community is backing OpenEnv, a tool for creating agentic execution environments, as it transitions to a committee-led governance structure with support from Meta-PyTorch, Nvidia, Hugg…

01:12

2026-06-07

huggingface.co

computer-vision

Deepfake Detector Robustness Testing

A new benchmark dataset, the Social Media Robustness Benchmark, evaluates how deepfake detectors perform on images re-encoded by social media platforms like Instagram, Facebook, TikTok, and X. The dat…

19:02

2026-06-06

huggingface.co

artificial-intelligence

Five labs, five minds: building a multi-model finance drama on small models

Five research labs built a multi-model finance simulation game where each of four woodland creature agents runs on a different lab's small language model, with a human player acting as a shadow financ…

15:36

2026-06-06

huggingface.co

artificial-intelligence

Job Searcher

A team of developers has built an AI-powered job search tool that uses a fine-tuned Qwen3-8B language model to generate LinkedIn queries, scrape job postings, and score each role against a candidate's…

11:42

2026-06-06

huggingface.co

ai-agents

Persona Atlas: Mapping How Famous Minds Think

Persona Atlas, a tool developed during the "build-small" hackathon, transforms public figures into measurable behavioral portraits by having a small-model agent research them online and answer open-en…

22:18

2026-06-05

huggingface.co

ai-agents

Thousand Token Wood: shipping a multi-agent economy on a 3B model

A team of developers built Thousand Token Wood, a multi-agent economic simulation where five AI-powered woodland creatures trade goods using a 3-billion-parameter Qwen2.5-3B model. The simulation, cre…

18:57

2026-06-04

huggingface.co

ai-safety

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Nvidia released Nemotron 3.5 Content Safety, a single 4-billion-parameter model that unifies multimodal input, multilingual coverage across 140 languages, custom enterprise policy enforcement, and aud…

12:59

2026-06-04

huggingface.co

artificial-intelligence

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

NVIDIA released Nemotron 3.5 ASR, a 600M-parameter streaming multilingual speech-to-text model that transcribes 40 language-locales from a single checkpoint with built-in punctuation and capitalizatio…

12:24

2026-06-04

huggingface.co

artificial-intelligence

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow released EVA-Bench Data 2.0, expanding its enterprise voice agent benchmark from one domain to three—Airline Customer Service Management, Enterprise IT Service Management, and Healthcare HR…

11:24

2026-06-04

huggingface.co

artificial-intelligence

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

NVIDIA researchers developed a task-seeded synthetic Q&A generation workflow for Nemotron-family pretraining that uses public task training splits as capability seeds to generate new task-aligned exam…

05:18

2026-06-04

huggingface.co

artificial-intelligence

A population-scale synthetic dataset for El Salvador

A new population-scale synthetic dataset for El Salvador has been released, containing over 8,000 multilingual, region-specific personas with detailed demographic, professional, and cultural attribute…

00:00

2026-06-04

huggingface.co

ai-agents

Designing the hf CLI as an agent-optimized way to work with the Hub

Hugging Face redesigned its `hf` command-line interface to optimize it for AI coding agents, which now account for significant traffic on the Hub. The new CLI auto-detects when an agent is driving it …

12:55

2026-06-03

huggingface.co

large-language-models

Direct Preference Optimization Beyond Chatbots

Dharma-AI released DharmaOCR, a structured OCR model, and published a paper demonstrating that Direct Preference Optimization (DPO) reduced text degeneration rates by an average of 59.4% across all te…

00:00

2026-06-03

huggingface.co

robotics

Adding MCP Tools to Reachy Mini

Pollen Robotics released remote tool support for the Reachy Mini robot, allowing users to add third-party capabilities like web search and weather lookups with a single command. The new system enables…

14:13

2026-06-02

huggingface.co

ai-agents

Holo3.1: Fast & Local Computer Use Agents

Holo3.1, a new family of computer-use agents, is now available with improved robustness across web, desktop, and mobile environments. The release introduces quantized checkpoints for local inference, …

15:45

2026-06-01

huggingface.co

large-language-models

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains released Mellum2, a 12-billion-parameter Mixture-of-Experts model trained on natural language and code that activates only 2.5 billion parameters per token for efficient inference. The open-…

page 1 / 3 next →