23:30
2026-05-01
gist.github.com
large-language-models
Transplant MTP block from one GGUF file into another
A developer has released a Python script that transplants extra tensors—such as Multi-Token Prediction (MTP) layers—from one GGUF file into another, enabling the creation of mixed-quantization models.…