AIThe Decoder1h ago

Only three AI models finished above starting capital in a 500-day

Only three AI models finished above starting capital in a 500-day startup survival test

Only three AI models finished above starting capital in a 500-day

Researchers at Princeton University built CEO-Bench, a test where AI agents have to run a fictional software company for 500 simulated days. Most current models go broke, and a simple rule-based heuristic with no AI beats nearly all of them. The article Only three AI models…

Read full article

Source: The Decoder · Opens in new tab