OLMo

mentions 4 type Organization feed RSS

// recent coverage 4 mentions

23:01

2026-07-21

dev.to

artificial-intelligence

VIDRAFT Releases Aether-7B-5Attn: A Fully Open-Source MoE LLM with Five Heterogeneous Attention Mechanisms

Korean AI startup VIDRAFT has released Aether-7B-5Attn, a 6.59B-parameter Mixture-of-Experts foundation model on Hugging Face under Apache-2.0. The model features five heterogeneous attention mechanis…

19:13

2026-06-21

discuss.huggingface.co

large-language-models

OLMo-core + Engram graft: small-scale debug comparison

A debug comparison between a base OLMo3 600M model and an Engram memory variant showed the grafted model achieved lower training and evaluation cross-entropy loss and faster gradient norm stabilizatio…

04:00

2026-06-18

actu.epfl.ch

large-language-models

EPFL launches the first open medical LLMs

EPFL researchers released MeditronFO, the first fully open framework for building medical large language models, making every stage of development publicly available to ensure transparency and account…

21:33

2026-06-12

dev.to

large-language-models

LLM KV Cache Optimization, Open Model Evaluation, & Agent Engineering Skills for Local Deployment

LMCache introduces a novel KV cache optimization layer to accelerate LLM inference, enabling faster local deployment on consumer hardware. AllenAI releases olmo-eval, a workbench for evaluating open l…

// co-occurs with top 8 entities

Apertus 2 Engram 1 DeepSeek 1 Dion 1 Microsoft 1 Weights & Biases 1 EPFL 1 MeditronFO 1

// topics top 6 topics

large language models 4 ai research 3 ai products 2 artificial intelligence 2 ai infrastructure 1 ai ethics 1