Vivek Subbiah, Chief of Early-Phase Drug Development at the Sarah Cannon Research Institute, shared a post on LinkedIn:
“While SpaceX goes IPO and the world’s glued to FIFA World Cup 2026™ – Canada, Mexico and the United States an interesting paper popped up for your Friday read at Nature Portfolio.
Frontier general LLMs (GPT-5.2, Gemini 3.1 Pro, Claude Opus 4.6) outperformed specialized clinical AI tools (OpenEvidence, UpToDate Expert AI) on medical knowledge, clinician alignment + 1,800 blinded physician annotations on real clinical queries
This result was unexpected – ‘Specialized’ ≠ ‘Better’.”
Title: General-purpose large language models outperform specialized clinical AI tools on medical benchmarks
Authors: Krithik Vishwanath, Anton Alyakin, Mrigayu Ghosh, Ali Hage, Sean Neifert, Cordelia Orillac, Nataniel Mandelberg, Hammad Khan, Jin Vivian Lee, Jie Yao, William Small, Aakaash Varma, D. Brock Hewitt, Yindalon Aphinyanaphongs, Daniel Alber, Eric Oermann
Read the Full Article.

Other articles featuring Vivek Subbiah on OncoDaily.