Conclusion: In complex primary care cases, GPT-4 performs worse than human doctors taking the family medicine specialist examination. Future GPT-based chatbots may perform better, but comprehensive evaluations are needed before implementing chatbots for medical decision support in primary care.
Utvärdering av GPT i jämförelse med läkare i svensk sjukvård.
Vidare till källan: bmjopen.bmj.com