cd/sources/djdumpling-auto-discoveredยท homeโ€บ sourcesโ€บ Djdumpling (auto-discovered)
cat /sources/djdumpling-auto-discovered.feed | wc -l โ†’ 2

Djdumpling (auto-discovered)

articles 2 domain djdumpling.github.io โ†’ feed RSS
00:00
2026-12-31
djdumpling.github.io
machine-learning

paper reading catalog

DeepSeek researchers introduced manifold-constrained hyper-connections to restore the identity mapping property in transformer architectures, addressing training instability and scalability issues cauโ€ฆ

06:39
2026-05-26
djdumpling.github.io
large-language-models

Frontier Model Training Methodologies

Seven open-weight frontier models, including Hugging Face's SmolLM3, DeepSeek-R1, and OpenAI's gpt-oss-120b, were analyzed to distill common training methodologies for multi-billion parameter models, โ€ฆ