Development of self-phenotyping tools to empower patients and improve diagnostics

Kent Shefchek,Sonja Ziniel,Julie A McMurry,Catherine A Brownstein,John S Brownstein,Erin Rooney Riggs,Matthew Might,Damian Smedley,Amy Clugston,Alan H Beggs,Heather Paterson,Peter N Robinson,Nicole A Vasilevsky,Ingrid A Holm,Melissa A Haendel

DOI: https://doi.org/10.1101/2024.06.13.24308791

2024-06-14

Abstract:Deep phenotyping is important for improving diagnostics and rare diseases research and is especially effective when standardized using Human Phenotype Ontology (HPO). Patients are an under-utilized source of information, so to facilitate self-phenotyping we previously 'translated' HPO into plain language ('layperson HPO'). Another self-phenotyping survey, GenomeConnect, asks patient-friendly questions that map to HPO. However, self-reported data has not been assessed. Since not all HPO terms are translated to layperson HPO or in the GenomeConnect survey, we created theoretical maximum-accuracy phenotype profiles for each disease for each instrument, representing the theoretical maximum performance. Both instruments performed well in analyses of semantic similarity (area under the curve 0.991 and 0.954, respectively). To explore the real-world implications, we randomized participants with diagnosed genetic diseases to complete the GenomeConnect, Phenotypr, or both instruments. For each diagnosed disease, we compared the derived disease profile to the patient-completed profile for each instrument. Profiles resulting from participant responses to the GenomeConnect survey were more accurate than to the Phenotypr instrument. The Phenotypr instrument had a tighter distribution of scores for respondents who did both instruments and was therefore more precise. We evaluated the ability of each known Mendelian disease HPO phenotype profile to retrieve the corresponding disease. We conducted interviews and generally participants preferred the GenomeConnect multiple choice format over the autocomplete Phenotypr format. Our results demonstrate that individuals can provide rich HPO phenotype data. These results suggest that self-phenotyping source of information could be used to support diagnostics or supplement profiles created by clinicians.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to enhance patient engagement and improve the diagnosis of rare diseases by developing self - phenotyping tools. Specifically, the research focuses on two self - phenotyping methods: the GenomeConnect questionnaire and Layperson HPO (i.e., the plain - language version of the Human Phenotype Ontology). These two methods aim to enable patients to describe their symptoms in a structured manner, thereby supplementing the phenotypic data created by clinicians, supporting the diagnostic process or being used for research. The main objectives of the study include: 1. **Evaluating the effectiveness of self - phenotyping tools**: By comparing the phenotypic data generated by these tools with the standard clinical phenotypic data (i.e., Gold Standard), evaluate the accuracy and practicality of these self - phenotyping tools in practical applications. 2. **Exploring the actual performance of patients**: By randomly assigning participants to complete the GenomeConnect questionnaire, the Phenotypr tool, or both, compare the performance of different tools in actual use. 3. **Verifying the accuracy of self - phenotyping data**: By calculating semantic similarity (using the PhenoDigm algorithm) to assess whether the structured phenotypic data generated by patients accurately reflects their real phenotypic characteristics. 4. **Improving diagnostic efficiency and patient engagement**: The ultimate goal is to integrate self - phenotyping tools into the patient assessment process to promote clinical diagnosis and research, while giving patients more autonomy so that they can actively participate in their own medical care. The research methods include the generation of theoretically maximum accuracy (TMax) phenotypic data, the generation of simulated "real - life" (Sim) phenotypic data, and the analysis of the phenotypic data actually generated by patients. Through these methods, the research team hopes to prove that the structured phenotypic data provided by patients can be used as valuable supplementary information to support diagnosis or research.

Development of self-phenotyping tools to empower patients and improve diagnostics

A Robust Phenotype-Driven Likelihood Ratio Analysis Approach Assisting Interpretable Clinical Diagnosis of Rare Diseases.

Towardcross-Platformelectronic Health Record-Drivenphenotyping Using Clinical Quality Language

Usability of a Phenotype Builder Prototype and Lessons Learned for the Design of Phenotyping Tools

Design and Validation of a FHIR-based EHR-driven Phenotyping Toolbox

Enabling phenotypic big data with PheNorm.

Improving Automated Deep Phenotyping Through Large Language Models Using Retrieval Augmented Generation

Talking about diseases; developing a model of patient and public-prioritised disease phenotypes

Precision medicine integrating whole-genome sequencing, comprehensive metabolomics, and advanced imaging

Phen2Disease: a phenotype-driven model for disease and gene prioritization by bidirectional maximum matching semantic similarities.

Review and evaluation of electronic health records-driven phenotype algorithm authoring tools for clinical and translational research.

Deep phenotyping for precision medicine

PhenoPro: a Novel Toolkit for Assisting in the Diagnosis of Mendelian Disease

Novel phenotype–disease matching tool for rare genetic diseases

PheNominal: an EHR-integrated web application for structured deep phenotyping at the point of care

Electronic Health Record Phenotyping with Internally Assessable Performance (PhIAP) using Anchor-Positive and Unlabeled Patients

PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework

TherapyPal: Towards a Privacy-Preserving Companion Diagnostic Tool Based on Digital Symptomatic Phenotyping

High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP)

Standardised and Reproducible Phenotyping Using Distributed Analytics and Tools in the Data Analysis and Real World Interrogation Network (DARWIN EU)

The point‐of‐care use of a facial phenotyping tool in the genetics clinic: Enhancing diagnosis and education with machine learning