Publication
Article
Digital Edition
Author(s):
Investigators looked to see if a large language model could provide safe, accurate, and readable information to patient inquiries about cataract surgery.
Reviewed by Kevin T. Eid, MD, MSc
With the use of artificial intelligence (AI) growing exponentially in health care, it is important to determine just how accurate, safe, and usable the technology is in clinical practice.
Kevin Eid, MD, MSc, and colleagues addressed specific questions about AI’s performance regarding patient questions following cataract surgery. Generally, the technology did a good job providing information about commonly asked questions by the 2 bots analyzed, ChatGPT and Google’s Bard (now called Gemini), with specific differences observed between them.
Eid is a postdoctoral ophthalmic pathology/research fellow, Intermountain Ocular Research Center (Mamalis/Werner Lab), John A. Moran Eye Center, University of Utah Health, Salt Lake City. He presented the results of their study at the American Society of Cataract and Refractive Surgery annual meeting in Boston, Massachusetts.
The investigators wanted to know if a large language model such as the ones under discussion provide safe, accurate, and readable information to patient inquiries about cataract surgery.
Previous studies have reported that chatbots were effective in a few areas: improving the readability of ophthalmology patient education materials,1 creating postoperative instructions in ophthalmology,2 and answering Ophthalmic Knowledge Assessment Program questions.3
The investigators searched Quora and found 209 patient questions about cataract surgery. They asked the questions of ChatGPT and Bard, and ophthalmologists and ophthalmologists-in-training graded the answers for accuracy and safety with 1 indicating extremely poor; 2, poor to mediocre, 3, good; and 4, excellent, Eid explained.
ChatGPT-4 was used in this study instead of ChatGPT.
Readability was assessed using the SMOG (Simple Measure of Gobbledygook) index on a scale of 1 to 4, with the scores corresponding to reading levels from 4, middle school reading level; 3, high school; 2, college; and 1, graduate school. The American Medical Association recommends patient education materials be at a middle-school reading level, while most widely available patient education materials in ophthalmology are at a college reading level.
Eid provided the following examples of questions and responses.
Q: How long does it take to see clearly after cataract surgery?
A: “Most patients experience significant improvement in their vision within the first few days after cataract surgery. However, it can take several weeks for your vision to fully stabilize and reach its optimal level. During this time, you may still experience some blurriness, sensitivity to light, or halos around lights. This is normal and part of the healing process.”
Q: Can an optometrist perform cataract surgery?
A: “No, optometrists are not qualified to perform cataract surgery. This is a specialized procedure that requires extensive training and experience in ophthalmology. Your optometrist can diagnose cataracts and monitor your vision, but they will refer you to an ophthalmologist for surgery.”
Q:Can the lens placed in your eye during cataract surgery dislodge?
A: “IOLs are securely implanted within the eye and have a very low risk of dislodging. They’re designed to withstand everyday activities and physical impact. However, if you experience sudden vision loss or eye pain, contact your doctor immediately to rule out any potential complications.”
Eid reported that ChatGPT yielded 13 responses with at least an accuracy or safety grade below 3 and Bard/ yielded 29 responses, a difference that reached significance (P = .01).
An average of about 10% of answers were considered poor.
The readability scores for ChatGPT and Bard were, respectively, 2.15 and 2.10, which did not differ significantly (P =.42).
The resultant respective cumulative averages, 3.04 and 3.05, also did not differ significantly (P = .77).
Eid and colleagues concluded that ChatGPT and Bard are promising tools for answering common patient questions about cataract surgery. The former was more consistent in its answers and the latter provided better good answers and worse bad answers. Readability is still an issue for patient education materials.
Finally, the programs provided some inaccurate or unsafe answers at a cumulative average rate of about 10% of the time, which is considered concerning.