cd /news/large-language-models/dataset-efficient-llm-papers-quantiz… · home topics large-language-models article
[ARTICLE · art-27771] src=discuss.huggingface.co ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

[Dataset] Efficient LLM papers (quantization, LoRA, MoE, FlashAttention) from arXiv + Semantic Scholar — 1,734 records, quality-scored, JSONL

A new dataset, fineset-io/efficient-llm-papers, compiles 1,734 records of arXiv and Semantic Scholar papers on efficient LLM techniques like quantization, LoRA, MoE, and FlashAttention, each quality-scored in JSONL format. The dataset aims to serve as a reference for state-of-the-art efficiency methods and a clean corpus for fine-tuning models to reason about these techniques.

read1 min publishedJun 15, 2026

Most of us aren’t training frontier models — we’re trying to fit a good one onto the

hardware we actually have. The research that makes that possible (quantization, LoRA/PEFT,

mixture-of-experts, FlashAttention, KV-cache tricks, Mamba/SSMs) is scattered across

hundreds of arXiv papers, and it’s some of the fastest-moving work in ML right now.

So I assembled it into one dataset: fineset-io/efficient-llm-papers I find it useful as a “what’s the current state of the art for making this cheaper”

reference — and as a clean corpus if you’re fine-tuning a model to reason about

efficiency techniques.

Happy to take suggestions on gaps or answer questions about how the pipeline works.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/dataset-efficient-ll…] indexed:0 read:1min 2026-06-15 ·