Despite near-perfect exam scores, large language models falter when real people rely on them for medical advice, exposing a critical gap between AI knowledge and safe patient decision-making. Study: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results