Introduction In emergency medicine, accurate triage is vital for patient outcomes and resource management. The Canadian Triage and Acuity Scale (CTAS) has been essential in training healthcare providers to make prompt and precise triage decisions.1 With the rise of artificial intelligence (AI), chatbots are being considered for their potential to support or even replace human decision-making in various medical situations. This study aims to assess the agreement between AI chatbot triage decisions and those made by experienced emergency physicians using CTAS. Methods This study involved a comparative analysis between an AI chatbot and two expert emergency physicians, each with over ten years of experience. We used a dataset of 60 emergency case scenarios, which have been utilized for over 8-10 years to train medical personnel at the start of their careers.1 The AI chatbot received training materials on CTAS and triage before being tasked with assigning appropriate triage levels for each scenario. Meanwhile, the expert physicians independently triaged the same cases. Scenarios where the two experts disagreed on the triage level were excluded, leaving 35 case scenarios for the final analysis. To evaluate the agreement between the AI chatbot and the expert physicians, we used the Cohen’s Kappa coefficient. This included determining the Cohen’s Kappa coefficient value, the p-value, and the 95% confidence interval (CI) to assess the statistical significance and reliability of the agreement. Results The Cohen’s Kappa coefficient value between the AI chatbot and the expert physicians was 0.721, indicating a substintial level of agreement. The p-value was <0.001, showing a statistically significant correlation. The 95% confidence interval ranged from 0.539 to 0.903, further supporting the reliability of the observed agreement. Discussion The substintial agreement suggests that the AI chatbot can make triage decisions that closely match those of experienced emergency physicians. This finding is important as it demonstrates the potential of AI chatbots to assist in emergency medicine, particularly in settings with limited resources or during peak times when quick decision-making is crucial.2 However, it is noteworthy that the study excluded scenarios where the experts disagreed, possibly indicating more complex or ambiguous cases. Further research should investigate the chatbot's performance in such scenarios and its ability to manage a broader range of cases.2 Conclusion This study shows that AI chatbots have the potential to achieve a high level of agreement with expert physicians in triaging emergency cases using CTAS. Integrating AI in emergency departments could improve triage efficiency and support medical staff, ultimately enhancing patient care. Future studies should examine the chatbot's performance in more complex cases and its integration into real-world clinical settings. References 1. Beveridge R, Clarke B, Janes L, et al. Implementation guidelines for the Canadian Emergency Department Triage & Acuity Scale (CTAS). Can J Emerg Med. 1999;1(3 Suppl). 2. Dong SL, Bullard MJ, Meurer DP, et al. Emergency triage: comparing a novel computer triage program with standard triage. Acad Emerg Med. 2005;12(6):502-507.
Keywords: Triage, AI, Health Informatics, Emergency Department