FLAN-T5

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-26

arxiv.org

large-language-models

Know2Guess: A Contamination-Aware Multi-Zone Benchmark for Knowledge-Boundary Evaluation in Large Language Models

Researchers introduced Know2Guess, a contamination-aware multi-zone benchmark with 1,200 items across five domains to evaluate large language models' ability to distinguish answerable knowledge from a…

// co-occurs with top 5 entities

Know2Guess 1 Qwen2.5-Instruct 1 Llama-3-Instruct 1 Qwen2.5-3B-Instruct 1 arXiv 1