cd/entity/Qwen· home› entities› Qwen

grep -l @qwen /news/*.json | wc -l → 240

Qwen

mentions 240 type Organization page 5/12 feed RSS

// recent coverage 240 mentions

07:40

2026-06-24

dev.to

large-language-models

One API Key for GPT, Claude, Gemini, and Qwen: A Practical Guide to OpenAI-Compatible Model Routing

TokenBay, an AI model API gateway, enables developers to route requests to multiple LLMs like GPT, Claude, Gemini, and Qwen using a single OpenAI-compatible interface. The approach simplifies switchin…

04:03

2026-06-24

dev.to

artificial-intelligence

How I Stopped Overpaying For AI Models (And You Can Too)

A developer compared API pricing versus self-hosting costs for open-source AI models, finding that for small projects with 1 million tokens per day, using an API is 32 times cheaper than self-hosting.…

02:54

2026-06-24

lesswrong.com

natural-language-processing

Can You Hide From a Natural Language Autoencoder?

Researchers stress-tested Natural Language Autoencoders (NLAs) by optimizing activation vectors to flip AV explanations while preserving model behavior, achieving an 81.4% flip rate with 99.6% label p…

02:21

2026-06-24

arxiv.org

large-language-models

Qwen-AgentWorld: Language World Models for General Agents

Alibaba's Qwen team released Qwen-AgentWorld-35B-A3B and Qwen-AgentWorld-397B-A17B, the first language world models capable of simulating agentic environments across seven domains via long chain-of-th…

17:00

2026-06-22

kdnuggets.com

artificial-intelligence

ChatLLM by Abacus AI Review: A Multi-Model AI Workspace Built for Daily Work

Abacus AI's ChatLLM platform offers a multi-model AI workspace that integrates leading models like GPT, Claude, Gemini, Grok, DeepSeek, and Qwen under a single subscription, along with AI agents, codi…

06:54

2026-06-22

dev.to

large-language-models

Mastering Ollama AI endpoints: How to use each one correctly

Ollama provides a REST API with 14 endpoints for running large language models locally. The API includes endpoints for text generation, chat, embeddings, model management, and OpenAI compatibility. De…

03:04

2026-06-22

dev.to

artificial-intelligence

Your Cloud AI Has No Failover. Here's the Architecture That Does.

An engineer argues that local AI models have become production-capable for enterprise use, citing hardware advances like Apple's M5 chip and NVIDIA's DGX Spark, and open-weight models that now rival c…

00:00

2026-06-22

fergusfinn.com

large-language-models

Adaptive speculative decoding: picking draft lengths at runtime

Researchers have developed adaptive speculative decoding, a method that dynamically selects draft lengths at runtime to optimize token generation efficiency in large language models. The approach addr…

00:00

2026-06-22

huggingface.co

large-language-models

We got local models to triage the OpenClaw repo for FREE!

Hugging Face engineer Onur developed a real-time notification system for the OpenClaw repository using local open-weight models like Gemma and Qwen, running on an NVIDIA GB10 with 128 GB of unified me…

22:55

2026-06-21

teachmecoolstuff.com

large-language-models

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

A developer fine-tuned a tiny 0.6B-parameter Qwen 3 model to categorize household questions into metadata categories like pool, car, and HVAC. The baseline model achieved only 10% accuracy via prompti…

22:21

2026-06-21

dev.to

large-language-models

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

A backend engineer migrated from GPT-4o to DeepSeek after comparing Chinese and US AI models on real production workloads, finding DeepSeek V4 Flash delivers competitive performance at 40-60x lower co…

11:08

2026-06-21

blog.jackdavis.net

ai-safety

Agent Privacy

Jack Davis released an open-source repository, agent-privacy, that implements four privacy actions—allow, redact, handoff, and block—for CLI-style agent harnesses handling sensitive data. The system t…

01:40

2026-06-21

github.com

large-language-models

Show HN: Cc-fleet – run other LLMs as Claude Code workers, your sub drives

A new open-source tool called cc-fleet enables Claude Code to use third-party large language models as workers, allowing users to run models from providers like DeepSeek, GLM, Kimi, and Qwen within Cl…

20:58

2026-06-20

vettedconsumer.com

large-language-models

Qwen3-30B-A3B: The Open Model Most People Should Actually Run

Alibaba's Qwen team released Qwen3-30B-A3B, a Mixture-of-Experts model with 30.5 billion total parameters but only 3.3 billion active per token, enabling it to run on a single 24 GB graphics card at s…

17:25

2026-06-20

chess-bench.com

artificial-intelligence

Was lucky to have tested fable 5 on chess-bench

A new chess benchmark, chess-bench, ranks AI models by performance, with Gemini 3.5 Flash leading at 61.3%, followed by Grok 4.1 Fast at 58.7% and Gemini 3.1 Pro Preview at 55.3%. The test included Cl…

13:05

2026-06-20

agide.dev

developer-tools

Ag.ide Index, rank, and refactor your repo's worst code

AG.IDE, a new tool that indexes, ranks, and refactors code repositories locally, aims to combat technical debt accumulation accelerated by AI coding agents. The tool runs entirely on hardware without …

00:17

2026-06-20

modal.com

large-language-models

Speculation Is All You Need

Modal Labs released state-of-the-art DFlash speculators for Qwen 3.5 and Qwen 3.6 models on Hugging Face, achieving 5-20% additional speedups and enabling Qwen 3.5 122B-A10B to run at over 1000 tok/s …

20:25

2026-06-19

lmsys.org

large-language-models

The next generation of speculative decoding: DFlash and Spec V2

Modal and Z Lab released DFlash, a speculative decoding model for Qwen 3.5 397B-A17B, achieving over 4.3x throughput versus baseline and 1.5x versus MTP on HumanEval at concurrency 1. The model uses a…

20:23

2026-06-19

dev.to

large-language-models

How to Access 50+ Chinese AI Models Through One API

AIWave has launched an API that provides access to over 50 Chinese AI models through a single OpenAI-compatible endpoint. The service supports models from DeepSeek, Zhipu, Qwen, and others, enabling d…

18:13

2026-06-19

dev.to

developer-tools

I Wired OpenRouter Free Models Into My OpenClaw Fallback Chain. Here's What Actually Works.

A developer fixed a broken fallback chain in their OpenClaw agent that was causing request timeouts during peak hours. The new chain includes seven entries: two local Ollama models, three OpenRouter f…

← prev page 5 / 12 next →

// co-occurs with top 8 entities

DeepSeek 92 Claude 41 OpenAI 39 Anthropic 37 Alibaba 34 Ollama 33 Gemma 33 GLM 28