The AI era is pulling FP64 hardware away from scientific HPC

wpnews.pro

cd /news/artificial-intelligence/the-ai-era-is-pulling-fp64-hardware-… · home › topics › artificial-intelligence › article

[ARTICLE · art-31382] src=fortran-lang.discourse.group ↗ pub=2026-06-17T16:05Z topic=artificial-intelligence verified=true sentiment=↓ negative

The AI era is pulling FP64 hardware away from scientific HPC

The AI boom is pulling GPU vendors away from double-precision (FP64) hardware essential for scientific HPC, as NVIDIA, AMD, and Intel prioritize low-precision AI cores. New chips like NVIDIA's B200 and AMD's MI355X show flat or reduced FP64 performance, while Intel canceled its dedicated HPC GPU. Only AMD's MI430X offers a strong FP64 line, but the broader market trend threatens the survival of first-class FP64 hardware for science.

read3 min views37 publishedJun 17, 2026

Hi all.

I have one, maybe two, questions for you. This question came out of a webinar series on High Performance Computing (HPC) I took part in (the Italy–Germany HPC webinars organised on the Italian side through CNR). I raised this concern there, and my impression was that the other speakers did not share it to the same degree. The room leaned more optimistic than I am. That is exactly why I want to put it to a wider audience: I may be wrong, and I would like to hear where others land.

The concern is precision. Most scientific HPC needs double precision (FP64). In computational fluid dynamics, which is my field, we resolve physical scales spanning many orders of magnitude, and to do that correctly (with very high-order accuracy methods), we need 64-bit floating point.

AI computing does not need this. Training and inference work well at 8-bit (now even at 4-bit). So, the two workloads require different hardware: AI needs many low-precision cores, while science requires strong FP64 capabilities.

The problem is that the vendors follow the AI market because that is where the money is. Comparing on vector FP64 (peak, dense), the recent trend is to hold it flat or lower it, and spend the transistors on low-precision math instead:

NVIDIA H100: 34 TFLOP/s vector FP64, or 67 with the FP64 tensor-core path. The newer B200 does about 40 vector FP64. Blackwell dropped the dedicated FP64 tensor-core path that Hopper had, and gained around 20 PFLOP/s of FP4 for AI. The Rubin roadmap reportedly cuts FP64 further. AMD MI300X: 81.7 TFLOP/s FP64. The newer MI355X does 78.6, below its own predecessor, with the gains all in FP8/FP4 for AI inference. Intel has stepped back from a dedicated HPC GPU. Its current HPC silicon, the Max-series (Ponte Vecchio) in Aurora, has no standalone successor. Intel cancelled Falcon Shores as a product in early 2025 and folded its HPC and AI lines into one chip, Jaguar Shores, due around 2026/2027. Intel describes it as serving both AI and HPC, but says it will compete on total cost of ownership rather than peak FLOPS, and has published no FP64 figure. Consumer silicon makes the direction plainest. NVIDIA’s N1X, the new Blackwell laptop chip, publishes only AI-precision figures (NVFP4, around 1000 TOPS) and quotes no FP64 at all. Double precision is simply not a design goal there.

So across all three vendors the direction looks the same. The new chips are built for AI, and double precision gets quietly de-prioritized along the way.

There is one strong counter-current. AMD’s MI430X, coming this year, is a deliberate HPC part. AMD claims more than 200 TFLOP/s of FP64, and independent estimates back out around 211 from the Alice Recoque exascale contract, which would be the highest of any GPU so far, while it still carries FP4/FP8 for AI. It will power Alice Recoque, the next European exascale machine, alongside the US Discovery and Germany’s Herder. So a dedicated FP64 line still exists, for now.

But it is one product line, from one vendor, against a whole market moving the other way. That is what I cannot resolve: whether a first-class FP64 hardware line survives, or shrinks to a small premium niche while everything else is optimized for AI.

Two questions for you:

Do you share this concern, or do you think I am overstating it?
If you share it, do you already see a way out?

I would be glad to hear how others in the Fortran and HPC community are thinking about this.

Stefano

source & further reading

fortran-lang.discourse.group — original article Bit-reproducibilty on GPU-accelerated Fortran Getting Fortran running on GPU's natively Example of using Claude Fable

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-ai-era-is-pulling-fp…

Read original on fortran-lang.discourse.group → fortran-lang.discourse.group/t/the-ai-era-is-pul…

mentioned entities

NVIDIA

AMD

Intel

H100

B200

MI300X

MI355X

MI430X

metadata

slugthe-ai-era-is-pulling-fp64-hardware-away-from-scientific-hpc

topic#artificial-intelligence

secondary2 topics

sentimentnegative

canonicalfortran-lang.discourse.group

navigation

← prevShow HN: Bacon – an ad network t…

next →I built an AI crypto trading bot…

── more in #artificial-intelligence 4 stories · sorted by recency

wafer.ai · 2 Aug · #artificial-intelligence

Running Kimi K3 on MI355X at Better Performance per Dollar Than B300

cryptobriefing.com · 1 Aug · #artificial-intelligence

Chipmakers report record earnings, but stocks decline amid skepticism over AI spending

runinfra.ai · 1 Aug · #artificial-intelligence

$0.09 and $290.12 are both the price of 1M output tokens

runtimewire.com · 2 Aug · #artificial-intelligence

Wafer says AMD's MI355X beats Nvidia B300 on Kimi K3 cost efficiency

── more on @nvidia 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required