Return to Article Details Benchmarking Large Language Models on Diagnostic Inference Tasks in Medical Texts Download Download PDF