cd/entity/Qwen· home entities Qwen
grep -l @qwen /news/*.json | wc -l → 240

Qwen

mentions 240 type Organization page 5/12 feed RSS

// recent coverage 240 mentions

04:03
2026-06-24
dev.to
artificial-intelligence

How I Stopped Overpaying For AI Models (And You Can Too)

A developer compared API pricing versus self-hosting costs for open-source AI models, finding that for small projects with 1 million tokens per day, using an API is 32 times cheaper than self-hosting.…

02:54
2026-06-24
lesswrong.com
natural-language-processing

Can You Hide From a Natural Language Autoencoder?

Researchers stress-tested Natural Language Autoencoders (NLAs) by optimizing activation vectors to flip AV explanations while preserving model behavior, achieving an 81.4% flip rate with 99.6% label p…

02:21
2026-06-24
arxiv.org
large-language-models

Qwen-AgentWorld: Language World Models for General Agents

Alibaba's Qwen team released Qwen-AgentWorld-35B-A3B and Qwen-AgentWorld-397B-A17B, the first language world models capable of simulating agentic environments across seven domains via long chain-of-th…

06:54
2026-06-22
dev.to
large-language-models

Mastering Ollama AI endpoints: How to use each one correctly

Ollama provides a REST API with 14 endpoints for running large language models locally. The API includes endpoints for text generation, chat, embeddings, model management, and OpenAI compatibility. De…

00:00
2026-06-22
fergusfinn.com
large-language-models

Adaptive speculative decoding: picking draft lengths at runtime

Researchers have developed adaptive speculative decoding, a method that dynamically selects draft lengths at runtime to optimize token generation efficiency in large language models. The approach addr…

00:00
2026-06-22
huggingface.co
large-language-models

We got local models to triage the OpenClaw repo for FREE!

Hugging Face engineer Onur developed a real-time notification system for the OpenClaw repository using local open-weight models like Gemma and Qwen, running on an NVIDIA GB10 with 128 GB of unified me…

11:08
2026-06-21
blog.jackdavis.net
ai-safety

Agent Privacy

Jack Davis released an open-source repository, agent-privacy, that implements four privacy actions—allow, redact, handoff, and block—for CLI-style agent harnesses handling sensitive data. The system t…

20:58
2026-06-20
vettedconsumer.com
large-language-models

Qwen3-30B-A3B: The Open Model Most People Should Actually Run

Alibaba's Qwen team released Qwen3-30B-A3B, a Mixture-of-Experts model with 30.5 billion total parameters but only 3.3 billion active per token, enabling it to run on a single 24 GB graphics card at s…

17:25
2026-06-20
chess-bench.com
artificial-intelligence

Was lucky to have tested fable 5 on chess-bench

A new chess benchmark, chess-bench, ranks AI models by performance, with Gemini 3.5 Flash leading at 61.3%, followed by Grok 4.1 Fast at 58.7% and Gemini 3.1 Pro Preview at 55.3%. The test included Cl…

13:05
2026-06-20
agide.dev
developer-tools

Ag.ide Index, rank, and refactor your repo's worst code

AG.IDE, a new tool that indexes, ranks, and refactors code repositories locally, aims to combat technical debt accumulation accelerated by AI coding agents. The tool runs entirely on hardware without …

00:17
2026-06-20
modal.com
large-language-models

Speculation Is All You Need

Modal Labs released state-of-the-art DFlash speculators for Qwen 3.5 and Qwen 3.6 models on Hugging Face, achieving 5-20% additional speedups and enabling Qwen 3.5 122B-A10B to run at over 1000 tok/s …

20:25
2026-06-19
lmsys.org
large-language-models

The next generation of speculative decoding: DFlash and Spec V2

Modal and Z Lab released DFlash, a speculative decoding model for Qwen 3.5 397B-A17B, achieving over 4.3x throughput versus baseline and 1.5x versus MTP on HumanEval at concurrency 1. The model uses a…

20:23
2026-06-19
dev.to
large-language-models

How to Access 50+ Chinese AI Models Through One API

AIWave has launched an API that provides access to over 50 Chinese AI models through a single OpenAI-compatible endpoint. The service supports models from DeepSeek, Zhipu, Qwen, and others, enabling d…

← prev page 5 / 12 next →
// co-occurs with top 8 entities