TechSlashdot1h ago

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best

This week OpenAI announced a 750-task test to to measure "whether AI systems can support realistic life science research tasks, not just answer biology questions." But while OpenAI's top-performing GPT-Rosalind model led the rankings, Slashdot reader BrianFagioli notes that "it…

Read full article

Source: Slashdot · Opens in new tab