Source author record

Cecilia S. Lee

Cecilia S. Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning Other Quantitative Biology

Catalog footprint

What is connected

2works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

Open Data Sharing in Clinical Research and Participants Privacy: Challenges and Opportunities in the Era of Artificial Intelligence

Sharing clinical research data is key for increasing the pace of medical discoveries that improve human health. However, concern about study participants' privacy, confidentiality, and safety is a major factor that deters researchers from openly sharing clinical data, even after deidentification. This concern is further heightened by the evolution of artificial intelligence (AI) approaches that pose an ever-increasing threat to the reidentification of study participants. Here, we discuss the challenges AI approaches create that blur the lines between identifiable and non-identifiable data. We present a concept of pseudo-reidentification, and discuss how these challenges provide opportunities for rethinking open data sharing practices in clinical research. We highlight the novel open data sharing approach we have established as part of the Artificial Intelligence Ready and Exploratory Atlas for Diabetes Insights project, one of the four Data Generation Projects funded by the National Institutes of Health Common Fund's Bridge2AI Program.

preprint2016arXiv

Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration

Objective: The advent of Electronic Medical Records (EMR) with large electronic imaging databases along with advances in deep neural networks with machine learning has provided a unique opportunity to achieve milestones in automated image analysis. Optical coherence tomography (OCT) is the most commonly obtained imaging modality in ophthalmology and represents a dense and rich dataset when combined with labels derived from the EMR. We sought to determine if deep learning could be utilized to distinguish normal OCT images from images from patients with Age-related Macular Degeneration (AMD). Methods: Automated extraction of an OCT imaging database was performed and linked to clinical endpoints from the EMR. OCT macula scans were obtained by Heidelberg Spectralis, and each OCT scan was linked to EMR clinical endpoints extracted from EPIC. The central 11 images were selected from each OCT scan of two cohorts of patients: normal and AMD. Cross-validation was performed using a random subset of patients. Area under receiver operator curves (auROC) were constructed at an independent image level, macular OCT level, and patient level. Results: Of an extraction of 2.6 million OCT images linked to clinical datapoints from the EMR, 52,690 normal and 48,312 AMD macular OCT images were selected. A deep neural network was trained to categorize images as either normal or AMD. At the image level, we achieved an auROC of 92.78% with an accuracy of 87.63%. At the macula level, we achieved an auROC of 93.83% with an accuracy of 88.98%. At a patient level, we achieved an auROC of 97.45% with an accuracy of 93.45%. Peak sensitivity and specificity with optimal cutoffs were 92.64% and 93.69% respectively. Conclusions: Deep learning techniques are effective for classifying OCT images. These findings have important implications in utilizing OCT in automated screening and computer aided diagnosis tools.