21:43
2026-06-27
dev.to
artificial-intelligence
Can an AI Agent Pass the Test We Give 4-Year-Olds?
Shridhar Shah, a senior software engineer at Outshift by Cisco, built two AI agents to test theory of mind using the Sally-Anne false-belief test. One agent, which only tracks reality, fails the test โฆ