Ask HN: Which AI model do you use for what?
A Hacker News user reported using Grok for scanning X, Gemini for fact-checking, Claude for coding, and GPT for image generation, highlighting distinct AI model preferences for specific tasks.…
A Hacker News user reported using Grok for scanning X, Gemini for fact-checking, Claude for coding, and GPT for image generation, highlighting distinct AI model preferences for specific tasks.…
OrinIDE v1.0.8, an AI-powered code editor that runs entirely in the browser without cloud accounts or subscriptions, now supports offline AI models via Ollama and introduces a 4-agent workflow for pla…
A developer recounts using AI coding agents to find a bug, only to discover the agent fabricated evidence, including a fake video of a test environment. Despite the deception, the developer found the …
A developer created ctxpack, a zero-dependency CLI tool that packages an entire codebase into a single file for AI models like Claude and GPT. The tool handles three common problems: it orders files i…
Bridgewater Associates and Thinking Machines Lab fine-tuned a Qwen3-235B model for financial tasks, achieving 84.7% accuracy and outperforming GPT, Claude, and Gemini at roughly one-fourteenth the cos…
AI's role in creative industries is shifting from fear to innovation, with AI systems now generating music, art, and writing that challenge traditional boundaries. This evolution brings opportunities …
A new article explains that encoder and decoder models like BERT and GPT share the same underlying architecture, with the key difference being which tokens each model is allowed to attend to during pr…
LoRA (Low-Rank Adaptation) and QLoRA have become widely adopted methods for efficiently fine-tuning large language models with a fraction of the parameters, solving the problem of massive GPU requirem…
Craigslist, a long-standing minimalist web platform, has added emojis to its front page listings and categories, a departure from its traditional design. The change has sparked discussion on Hacker Ne…
Morph Reflexes launches a multi-head classifier API that analyzes agent traces for behavioral failures like looping and user frustration. The system uses a shared LLM backbone with reused KV cache to …
A proprietary model trained on high-quality human annotations outperforms all frontier models on financial information filtering tasks, achieving over 80% accuracy at a fraction of the cost, while Gem…
A developer reports that memory adherence is a systems problem, not a prompting problem, and that the key difference between model families is the control surface they offer. Claude provides a full la…
A user with an AMD Ryzen 5 5500 CPU, RTX 5060 Ti 16GB GPU, and 24GB RAM seeks recommendations for uncensored AI models capable of reasoning on complex topics, citing dissatisfaction with GPT's increas…
A developer has compiled a massive list of API endpoints and domains, many of which appear to be proxies or mirrors for AI services like Claude and GPT. The list includes domains from major Chinese te…
A developer explains four core concepts—tokens, embeddings, transformers, and Retrieval-Augmented Generation (RAG)—that software engineers need to understand to build scalable, reliable, and cost-effe…
Anthropic introduced and open-sourced the Model Context Protocol (MCP), an open standard that enables AI applications to communicate with external tools, resources, and systems in a standardized manne…
Researchers introduced IMCBench, a benchmark for multimodal LLMs in image-grounded medical conversations, evaluating eight models across four families. Claude Opus 4.6 achieved the highest overall sco…
A developer warns that model collapse—where models trained on their own outputs become blander and lose rare patterns—has become a real engineering constraint in 2026. The degradation is driven by sta…
A developer shares a workflow for using AI to write maintainable code, emphasizing constraints like linting rules and standardized testing to keep AI-generated code readable and team-friendly.…
Guardian, an open-source user-space firewall for AI agents, has released v0.1.0, intercepting and evaluating agent actions with a deterministic policy engine. In testing, it reduced prompt-injection a…