|Articles|June 30, 2021

Ophthalmology Times: June 15, 2021
Volume 46
Issue 10

Natural language processing pairs with big data curation

Knowledge of tools used in data interpretation helps clinicians trust accuracy of findings.

Special to Ophthalmology Times®

Artificial intelligence (AI) is permeating society, directing everything from the “products you may like” portion of an e-commerce site to a GPS suggesting a faster route to your destination.

One of the fastest growing areas for AI is medicine, and ophthalmology is helping to lead the way in AI evolutions.

For example, deep-learning AI programs that interpret fundus photographs of patients with diabetes may be used to improve screening for diabetic retinopathy.¹

In some cases, AI relies on natural language processing (NLP) to gather and interpret language-based data.

In ophthalmology, NLP can process electronic health record (EHR) information from the Academy of Ophthalmology (AAO) Intelligent Research in Sight (IRIS) Registry, which houses data from 367 million patient encounters with more than 65 million unique patients.

Verana Health, the data curation and analytics partner of the academy, organizes data from the registry to prepare it for interpretation via NLP.

Natural language processing
Although the phrase “natural language processing” may be new to some readers, many individuals frequently (and perhaps unknowingly) interact with it.

A common encounter with NLP occurs when interfacing with document-scanning technology that converts text into digital data.

Optical character recognition, an early NLP method, identifies letters, words, and phrases from static documents and converts them to data points.

Imagine a scenario in which you are tasked with entering your passport information into a portal. You have 2 options to enter data such as your given name, surname, and nation of origin.

You can manually enter your information in each field or you can use your phone to snap a picture of your passport’s relevant pages and allow NLP to populate the fields.

The latter option, which extracts language from a photograph and places it in the appropriate areas of the portal, is quicker.

Machine learning applied to NLP has broader uses that suit it for analyzing text-heavy data in the IRIS Registry.

If NLP is used to interpret the hundreds of millions of EHR data points in the registry, it may produce information on real-world treatment outcomes, disease prevalence data, and treatment patterns.

CLINICAL FINDINGS IN OPHTHALMOLOGY
Two examples of how AI could be used to examine IRIS Registry data illustrate the potential of drawing insights from large databases via NLP analysis.

Grading severity
NLP could be used to search IRIS Registry data for a series of words or phrases in patient records. Searching for prespecified phrases or words may confirm the accuracy of coding data.

For the sake of illustration, consider glaucoma.

Clinicians use various qualitative (eg, types of procedures undergone, medication history, whether cataract surgery has occurred) and quantitative (eg, cup-to-disc ratios, IOP measurements, visual acuity, visual field data) data points to classify a patient’s glaucoma severity.

No single data point leads to a diagnosis of mild, moderate, or severe glaucoma, and patients with similar quantitative profiles may be classified differently based on qualitative data points.

ICD-10 provides codes for various degrees of glaucoma severity. In a perfect world, clinicians would accurately code each case during every visit.

However, due to many factors, the stage of glaucoma may not be updated in the EHR to reflect the current clinical state of the patient.

By using NLP, investigators can confirm that the coded diagnosis reflects the qualitative and quantitative measurements of a patient encounter.

Suppose investigators needed to determine the number of patients categorized as having severe glaucoma.

After defining severe glaucoma with a combination of qualitative and quantitative parameters—relying on definitions from the AAO and the American Glaucoma Society designed to reduce subjectivity in stage diagnoses— investigators could perform a customized search of IRIS Registry records using NLP to confirm that the number of severe cases coded in a given time frame match the number of severe cases as defined by details in the encounter.

An NLP data analysis makes the real-world data housed in the IRIS Registry more accurate. Automating these quality checks saves time and increases the quality of the overall body of data at investigators’ disposal.

Reconciling coding data with real-world prevalence
Understanding the prevalence of particular diseases in ophthalmology may be limited by coding behavior after a patient encounter.

For example, a patient presenting to a cataract surgeon for preoperative evaluation may also have early age-related macular degeneration (AMD) in addition to their cataract.

It is possible that this encounter will be coded as a cataract for purposes of reimbursement, and the ICD-10 code for AMD not entered.

This patient’s AMD would be undetected by investigators leveraging coding data to estimate disease prevalence.

However, an NLP-based analysis of IRIS Registry data could detect the presence of underreported or unreported disease in patient charts, thereby generating a more robust picture of real-world disease rates.

Trusting the algorithm
The more clinicians know about how NLP-based analyses make determinations, and the more transparent the models are, the more willing they may be to accept the results of AI reports.

All NLP algorithms require a degree of explainability, which allows investigators to understand to what degree an algorithm values particular pieces of data.

If NLP determines that, for example, a certain percentage of patients of a certain age have AMD, then investigators can examine the algorithm’s methods to ensure that a legitimate medical reason exists for this conclusion.

Instances may arise in which AI detects disease that is either imperceptible by human evaluation or is linked to heretofore unknown anatomic manifestations.

Machine learning–based algorithms, in which AI platforms learn to detect patterns from massive data sets, have been shown to accurately estimate the age, gender, smoking status, and systolic blood pressure of patients based on fundus photographs alone.²

How or why those algorithms make their determinations are not yet understood, but their results nonetheless show the potential of AI to change the landscape of medicine.

WHAT’S NEXT?
NLP may be one of the most important tools for extracting meaningful insights from real-world data in the IRIS Registry.

The better we understand how IRIS Registry data are curated and analyzed, the more we can embrace the results of AI data analyses.

--

Theodore Leng, MD, MS
e:[email protected]
Leng is the director of research at the Byers Eye Institute at Stanford University in California and a medical adviser to Verana Health.

References
1. Lu L, Ren P, Lu Q, et al. Analyzing fundus images to detect diabetic retinopathy (DR) using deep learning system in the Yangtze River delta region of China. Ann Transl Med. 2021;9(3):226. doi:10.21037/atm-20-3275

2. Poplin R, Varadarajan AV, Blumer K, et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng. 2018;2(3):158-164. doi:10.1038/s41551-018-0195-0

Articles in this issue

over 4 years ago

Article

The proteome of proliferative vitreoretinopathy: insight into the central role of extracellular matrix

over 4 years ago

Article

VA, visual function are going the way of ocular inflammation

over 4 years ago

Article

The 2 faces of glaucoma

over 4 years ago

Article

Diagnosing Alzheimer's disease: Ophthalmologists have role in early detection

over 4 years ago

Article

Optic neuritis: Differentiating MS from neuromyelitis optica

over 4 years ago

Article

Elamipretide: a mitochondrial protector

over 4 years ago

Article

Maximizing CXL effectiveness

over 4 years ago

Article

App helps patients with cataracts have smoother surgery process

over 4 years ago

Article

Cataract surgeons on OR waste: Less is often more

over 4 years ago

Article

An eye toward plateau iris and its implications

Don’t miss out—get Ophthalmology Times updates on the latest clinical advancements and expert interviews, straight to your inbox.

Subscribe Now!

Latest CME

In-Person Event

EnVision Summit

February 13-16, 2026

Natural language processing pairs with big data curation

Articles in this issue

Newsletter

Related Content

Last year in glaucoma at EnVision Summit 2025

NeuroOp Guru: Understanding optic disc cupping after optic neuritis

Sandoz Canada launches aflibercept biosimilar, Enzeevum

The Residency Report: PreserFlo microshunt vs trabeculectomy—five-year trial insights

IGS 2026: Navigating MIGS and other modern glaucoma procedures

Latest CME

EnVision Summit

(COPE Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(CME Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

(CME Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

(CME Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

(COPE Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(COPE Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

(COPE Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

(COPE Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

Practical Approaches to Modern Dry Eye Treatment and Management

(CME Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(CME Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(CME Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(COPE Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(COPE Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

(CME Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

(CME Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

(COPE Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

Navigating the Glaucoma Therapeutic and Surgical Landscape: From Conventional to Cutting-Edge

(COPE Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(CME Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(CME Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(COPE Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(CME Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(CME Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(CME Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(COPE Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(CME Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

(COPE Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

Neurotrophic Keratitis Insights: An Interactive Corneal Sensitivity Testing Workshop

(COPE Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

(CME Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

Rapid Reviews in Retina™: Emerging Updates from Spring 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Interventional Dry Eye: A Stepwise Treatment & Management Approach

Trending on Ophthalmology Times - Clinical Insights for Eye Specialists

MeiraGTx Licenses complement-targeted geographic atrophy program from ZipBio

Metformin use associated with reduced incidence of intermediate AMD

Last year in glaucoma at EnVision Summit 2025

Looking back at the 2025 EnVision Summit

Sandoz Canada launches aflibercept biosimilar, Enzeevum