Pelican, or pelican't? A hint at Claude evals A user discovered that Claude's deep research feature occasionally returns irrelevant or broken HTML in task titles, including unrelated topics like Hebrew calligraphy and SVG generation, suggesting a backend rendering race condition affecting 5-10% of requests. I’m a big fan and frequent user of Claude’s deep research feature, so it caught my eye when the task title from my research subagent was a bit longer than usual. I’d asked about WWI causes and got the research title “Causes of WWI research disabled warnings about …” with some unexpected trailing text at the end. Confused, I inspected the title with Chrome DevTools and found that the h2 header overflowed far past the truncated text shown in the UI. The full title discussed re-enabling system update warnings for Windows and macOS—clearly unrelated to my prompt Was this just a one-off error? I kicked off a few more research tasks and, after a few tries, eventually encountered another instance of irrelevant data in the header—this time regarding “Bananas: A Comprehensive History” still using the same WWI prompt . So, not only weird, but inconsistent. Perhaps some kind of backend rendering race condition? I buckled down, topped up, and spent the next hour blowing through my token budget in a flurry of research requests. I’d get a long/irrelevant header about 5–10% of the time. Most header suffixes were simply innocuous broken HTML fragments