How should we test AI for human-level intelligence? OpenAI’s o3 electrifies quest
OpenAI’s o3 tops new AI league table for answering scientific questions
DeepMind and OpenAI models solve maths problems at level of top students
Generating credible referenced medical research: A comparative study of openAI's GPT-4 and Google's gemini
‘AI models are capable of novel research’: OpenAI’s chief scientist on what to expect
OpenAI’s ‘deep research’ tool: is it useful for scientists?