Gemma-2-9B

mentions 3 type Organization feed RSS

// recent coverage 3 mentions

04:00

2026-06-25

arxiv.org

large-language-models

Evidence for feature-specific error correction in LLMs

Researchers propose an empirical test for error correction in large language models, finding that residual-stream activations are robust to small perturbations and that feature-specific directions are…

04:00

2026-05-29

arxiv.org

large-language-models

Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models

Researchers at an undisclosed institution analyzed LoRA fine-tuning in Gemma-2-9B using sparse autoencoders, finding that adapter-specific feature dictionaries show weak geometric alignment with pretr…

04:00

2026-05-25

arxiv.org

large-language-models

Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection

Researchers have developed a principled method for multilingual language steering in large language models using sparse autoencoders (SAEs), addressing the unreliability of existing English-only SAE a…

// co-occurs with top 8 entities

LoRA 1 Sparse Autoencoders 1 Gemma 1 LLaMA-3.1-8B 1 CrossSumm 1 SpBLEU 1 ROUGE-L 1 COMET 1