SikamikanikoBG

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

03:35

2026-06-03

dev.to

large-language-models

Fitting WhisperX large-v3 + a 24B LLM on one 3090: a reproducible context-capping recipe

A developer successfully ran both WhisperX large-v3 (7.7GB) and a 24B parameter LLM (Devstral Small 2) simultaneously on a single 24GB RTX 3090 by reducing the LLM's context window from 40,960 to 8,19…

// co-occurs with top 6 entities

WhisperX 1 Ollama 1 devstral-small-2 1 Claude Code 1 RTX 3090 1 NVIDIA 1