arXiv:2605.27654v1 Announce Type: new Abstract: Generative translation systems are cultural technologies because they decide how socially meaningful cues are rendered within culturally specific grammatical systems. We study one concrete notion of successful cultural translation: when an English source explicitly encodes gender, an English-to-Hindi translation should preserve the recoverability of that cue unless the source itself is ambiguous. We evaluate this criterion on a 37,345-instance benchmark spanning twelve categories and show that five systems frequently erase gender through ergative and honorific constructions. We then introduce two mechanism-aware inference-time interventions. The first, the Source-Aware Reranker (SAR), prefers candidates that avoid gender-neutralizing syntax. The second, the Phenomenon-Aware Reranker (PAR), preserves gender through targeted lexical marking even when ergative syntax remains. Across GPT-4o-mini and Sarvam, PAR improves target-subset accuracy from 11.07% to 54.47% and from 15.99% to 49.66%, respectively. Human evaluation shows that PAR increases gender preservation from 10.3% to 81.3%, but reduces mean fluency from 4.36 to 3.37. These findings place the two interventions on a preservation and fluency frontier rather than supporting a single dominant solution, and show how culturally situated generation can require explicit tradeoffs among fidelity, fluency, and stylistic naturalness.
Cultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverability
Researchers evaluating English-to-Hindi translation systems found that five major generative models frequently erase gender cues through ergative and honorific constructions, failing to preserve gender recoverability from English source text. The team introduced two inference-time interventions—Source-Aware Reranker (SAR) and Phenomenon-Aware Reranker (PAR)—with PAR improving gender preservation from 10.3% to 81.3% in human evaluation, though at the cost of reducing mean fluency from 4.36 to 3.37. The findings establish a preservation-fluency frontier for culturally situated translation, demonstrating that fidelity to gender cues requires explicit tradeoffs with fluency and stylistic naturalness.
Run your AI side-project on zahid.host
EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.