cd/entity/Qwen· home entities Qwen
grep -l @qwen /news/*.json | wc -l → 240

Qwen

mentions 240 type Organization page 3/12 feed RSS

// recent coverage 240 mentions

11:37
2026-06-26
om.co
artificial-intelligence

The Copy and the Guru

Aaron Levie, CEO of Box, warns that CEOs are prone to 'AI psychosis' due to distance from hands-on work. Om Malik describes creating a personal AI assistant but criticizes the trend of digital twins, …

09:49
2026-06-26
github.com
ai-tools

Llama.cpp flags auto-tuning tool

Llama.cpp developer released ggrun, an auto-tuning tool that measures GPU, RAM, and PCIe topology to compute optimal multi-GPU and MoE expert placement for GGUF models, serving an OpenAI-compatible AP…

04:40
2026-06-26
dev.to
artificial-intelligence

Building a Self-Verifying FTIR Agent with Qwen Function Calling

A developer built ChemSpectra Agent, an FTIR spectral analysis system using Qwen-3.7-Max function calling, for the Qwen Cloud Hackathon. The agent autonomously selects from five analysis tools, cross-…

00:00
2026-06-26
runagentrun.co.uk
artificial-intelligence

DeepSeek Flash breaks the agent cost curve

Retriever, a browser-agent startup, cut the cost of automated web workflows by over 100x by swapping its planning model from a frontier API to DeepSeek V4 Flash, an openly licensed Chinese model. A mu…

23:42
2026-06-25
lesswrong.com
large-language-models

Exploring Generalization in NLA's

A researcher reproduced Anthropic's paper on natural language activations (NLAs), training models to generate textual descriptions of neural network activations. The study found that a single model tr…

22:16
2026-06-25
letsdatascience.com
ai-safety

Anthropic Alleges Distillation Theft by Alibaba Qwen Lab

Anthropic accused Alibaba's Qwen AI lab of conducting the largest known distillation attack on its Claude models, generating over 28.8 million exchanges through 25,000 fraudulent accounts from April t…

20:42
2026-06-25
huggingface.co
ai-infrastructure

Run a vLLM Server on HF Jobs in One Command

Hugging Face launched a one-command method to run a vLLM server on its Jobs infrastructure, enabling users to quickly deploy models for testing, evaluation, or batch generation. The feature uses the o…

18:08
2026-06-25
byteiota.com
artificial-intelligence

Alibaba Distilled Claude: Anthropic’s 28.8M-Query Alert

Anthropic accused Alibaba and its AI lab Qwen of orchestrating the largest known model distillation attack, using nearly 25,000 fraudulent accounts to generate 28.8 million exchanges with Claude betwe…

11:29
2026-06-25
discuss.huggingface.co
large-language-models

LLM "curving" via prompting

A researcher has developed a prompting technique called 'LLM curving' that shifts large language models from token-by-token prediction to a holistic self-organization mode, aiming to improve reasoning…

10:01
2026-06-25
discuss.huggingface.co
large-language-models

Deepseek? Qwen?

A single H200 GPU with 141GB HBM3e cannot comfortably run DeepSeek V4 Flash (284B total, 13B active parameters) due to VRAM constraints, even with 2TB system RAM for offloading. The model requires an …

09:50
2026-06-25
oracomputing.com
large-language-models

ORA: Smaller Models. Same Intelligence

Ora Computing launched an automated LLM compression engine that reduces model size by up to 70% with minimal accuracy loss, enabling deployment on edge devices, on-prem servers, or cloud infrastructur…

08:05
2026-06-25
dev.to
artificial-intelligence

I Replaced 2.5 Hours of Daily Busywork with a $0 AI Agent Setup

A developer replaced 2.5 hours of daily busywork with a $0 AI agent setup running on a Mac Mini M4. The system uses local LLMs (Ollama with Qwen models), Python scripts, and cron jobs to automate emai…

← prev page 3 / 12 next →
// co-occurs with top 8 entities