Development of self-phenotyping tools to empower patients and improve diagnostics

Kent Shefchek,Sonja Ziniel,Julie A McMurry,Catherine A Brownstein,John S Brownstein,Erin Rooney Riggs,Matthew Might,Damian Smedley,Amy Clugston,Alan H Beggs,Heather Paterson,Peter N Robinson,Nicole A Vasilevsky,Ingrid A Holm,Melissa A Haendel
DOI: https://doi.org/10.1101/2024.06.13.24308791
2024-06-14
Abstract:Deep phenotyping is important for improving diagnostics and rare diseases research and is especially effective when standardized using Human Phenotype Ontology (HPO). Patients are an under-utilized source of information, so to facilitate self-phenotyping we previously 'translated' HPO into plain language ('layperson HPO'). Another self-phenotyping survey, GenomeConnect, asks patient-friendly questions that map to HPO. However, self-reported data has not been assessed. Since not all HPO terms are translated to layperson HPO or in the GenomeConnect survey, we created theoretical maximum-accuracy phenotype profiles for each disease for each instrument, representing the theoretical maximum performance. Both instruments performed well in analyses of semantic similarity (area under the curve 0.991 and 0.954, respectively). To explore the real-world implications, we randomized participants with diagnosed genetic diseases to complete the GenomeConnect, Phenotypr, or both instruments. For each diagnosed disease, we compared the derived disease profile to the patient-completed profile for each instrument. Profiles resulting from participant responses to the GenomeConnect survey were more accurate than to the Phenotypr instrument. The Phenotypr instrument had a tighter distribution of scores for respondents who did both instruments and was therefore more precise. We evaluated the ability of each known Mendelian disease HPO phenotype profile to retrieve the corresponding disease. We conducted interviews and generally participants preferred the GenomeConnect multiple choice format over the autocomplete Phenotypr format. Our results demonstrate that individuals can provide rich HPO phenotype data. These results suggest that self-phenotyping source of information could be used to support diagnostics or supplement profiles created by clinicians.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to enhance patient engagement and improve the diagnosis of rare diseases by developing self - phenotyping tools. Specifically, the research focuses on two self - phenotyping methods: the GenomeConnect questionnaire and Layperson HPO (i.e., the plain - language version of the Human Phenotype Ontology). These two methods aim to enable patients to describe their symptoms in a structured manner, thereby supplementing the phenotypic data created by clinicians, supporting the diagnostic process or being used for research. The main objectives of the study include: 1. **Evaluating the effectiveness of self - phenotyping tools**: By comparing the phenotypic data generated by these tools with the standard clinical phenotypic data (i.e., Gold Standard), evaluate the accuracy and practicality of these self - phenotyping tools in practical applications. 2. **Exploring the actual performance of patients**: By randomly assigning participants to complete the GenomeConnect questionnaire, the Phenotypr tool, or both, compare the performance of different tools in actual use. 3. **Verifying the accuracy of self - phenotyping data**: By calculating semantic similarity (using the PhenoDigm algorithm) to assess whether the structured phenotypic data generated by patients accurately reflects their real phenotypic characteristics. 4. **Improving diagnostic efficiency and patient engagement**: The ultimate goal is to integrate self - phenotyping tools into the patient assessment process to promote clinical diagnosis and research, while giving patients more autonomy so that they can actively participate in their own medical care. The research methods include the generation of theoretically maximum accuracy (TMax) phenotypic data, the generation of simulated "real - life" (Sim) phenotypic data, and the analysis of the phenotypic data actually generated by patients. Through these methods, the research team hopes to prove that the structured phenotypic data provided by patients can be used as valuable supplementary information to support diagnosis or research.