cd /news/machine-learning/emyx-fast-and-efficient-all-atom-pro… · home topics machine-learning article
[ARTICLE · art-33561] src=arxiv.org ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

Emyx: Fast and efficient all-atom protein generation

Researchers introduced Emyx, a 140M-parameter conditional flow matching model for all-atom protein generation, which outperforms larger models like RFdiffusion3 on enzyme design benchmarks while requiring only 682 GPU-hours of training. The model achieves higher success rates in global fold recovery and catalytic geometry accuracy, and offers greater structural novelty and diversity.

read1 min views1 publishedJun 19, 2026

arXiv:2606.19377v1 Announce Type: new Abstract: Computational enzyme design requires generating proteins that scaffold catalytic residues and ligands, a task that demands both geometric accuracy and structural diversity from the underlying generative model. Current all-atom generators inherit expensive architectures from structure prediction, leading to high training costs and limited sample diversity. We argue that much of this complexity is unnecessary for generators, which condition on sparse geometric constraints rather than rich co-evolutionary signals. Emyx is a 140M-parameter conditional flow matching model that concentrates capacity within standard transformer blocks, replacing heavy embedding stacks with lightweight conditional representations and sparse connectivity. We additionally derive an exact reparametrisation of the flow matching interpolant into the EDM noise-level framework, bridging flow matching training efficiency with state-of-the-art sampling methods designed for diffusion models without retraining. Despite being the smallest model, Emyx outperforms both Prote'ina-Complexa and RFdiffusion3 against the AME enzyme design benchmark across success rate under strict evaluation requiring both global fold recovery and catalytic geometry accuracy, structural novelty, scaffold diversity, and geometric validity, while training in just $682$ GPU-hours, roughly $4\times$ less than RFdiffusion3.

── more in #machine-learning 4 stories · sorted by recency
── more on @emyx 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/emyx-fast-and-effici…] indexed:0 read:1min 2026-06-19 ·