cd /news/large-language-models/excuse-me-may-i-say-something-colabs… · home topics large-language-models article
[ARTICLE · art-40508] src=aclanthology.org ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

"Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations

Researchers introduced CoLabScience, a proactive AI assistant that uses a reinforcement learning framework called PULI to autonomously intervene in biomedical research discussions. The system, trained on a new benchmark dataset BSDD, significantly outperformed existing methods in intervention precision and collaborative utility, demonstrating the potential of proactive LLMs in scientific discovery.

read2 min views1 publishedJun 22, 2026
"Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations
Image: Aclanthology (auto-discovered)

"Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations

[Yang Wu](/people/yang-wu-3505/),
[Jinhong Yu](/people/jinhong-yu/unverified/),
[Jingwei Xiong](/people/jingwei-xiong/),
[Zhimin Tao](/people/zhimin-tao/),
[Xiaozhong Liu](/people/xiaozhong-liu/)
Abstract

The integration of Large Language Models (LLMs) into scientific workflows presents exciting opportunities to accelerate biomedical discovery. However, the reactive nature of LLMs, which respond only when prompted, limits their effectiveness in collaborative settings that demand foresight and autonomous engagement. In this study, we introduce CoLabScience, a proactive LLM assistant designed to enhance biomedical collaboration between AI systems and human experts through timely, context-aware interventions. At the core of our method is PULI (Positive-Unlabeled Learning-to-Intervene), a novel framework trained with a reinforcement learning objective to determine when and how to intervene in streaming scientific discussions, by leveraging the team’s project proposal and long- and short-term conversational memory. To support this work, we introduce BSDD (Biomedical Streaming Dialogue Dataset), a new benchmark of simulated research discussion dialogues with intervention points derived from PubMed articles. Experimental results show that PULI significantly outperforms existing baselines in both intervention precision and collaborative task utility, highlighting the potential of proactive LLMs as intelligent scientific assistants.- Anthology ID:

- 2026.acl-long.1671
- Volume:
[Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)](/volumes/2026.acl-long/)- Month:
  • July
  • Year:
  • 2026
  • Address:
  • San Diego, California, United States
- Editors:
[Maria Liakata](/people/maria-liakata/),[Viviane P. Moreira](/people/viviane-p-moreira/unverified/),[Jiajun Zhang](/people/jiajun-zhang/unverified/),[David Jurgens](/people/david-jurgens/)- Venue:
[ACL](/venues/acl/)- SIG:
- Publisher:
  • Association for Computational Linguistics
- Note:
- Pages:
  • 36109–36129
- Language:
- URL:
[https://aclanthology.org/2026.acl-long.1671/](https://aclanthology.org/2026.acl-long.1671/)- DOI:
- Cite (ACL):
── more in #large-language-models 4 stories · sorted by recency
── more on @colabscience 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/excuse-me-may-i-say-…] indexed:0 read:2min 2026-06-22 ·