Abstract
Purpose:
Primary mitochondrial diseases (PMD) arise from variants in the mitochondrial or nuclear genomes. Phenotype-based recognition of specific PMD genotypes remains difficult, prolonging the diagnostic odyssey. We expanded the MitoPhen database to characterize phenotypic variation across PMD more systematically.
Methods:
Individual-level data on mitochondrial DNA disorders, nuclear-encoded mitochondrial diseases, and single large-scale mitochondrial DNA deletions were manually curated with Human Phenotype Ontology (HPO) terms to produce MitoPhen v2. Principal-component analysis summarized system-level abnormalities; HPO-level enrichment and mean phenotype-similarity scores were then used to distinguish common PMD genotypes.
Results:
MitoPhen v2 adds 3940 individuals to the original release, now encompassing 1597 publications, 10,626 individuals, and 117 genotypes. Among 7586 affected cases, 72,861 HPO terms were recorded. Principal-component analysis revealed 6 phenotype dimensions capturing most system-level variance. At the HPO level, we observed genotype-specific enrichments and identified 111 gene-phenotype links absent from the current HPO database. Using MT-TL1, single large-scale mitochondrial DNA deletions, and POLG as exemplars, phenotype-similarity scores reliably separated individuals with these genotypes from those without.
Conclusion:
MitoPhen v2 enabled systematic, genotype-aware analysis of heterogeneous PMD phenotypes and highlighted the diagnostic value of structured, individual-level data. Phenotype-similarity metrics from such data sets can refine variant interpretation in large rare-disease cohorts and provide a transferable framework for other phenotypically complex genetic disorders.
Primary mitochondrial diseases (PMD) arise from variants in the mitochondrial or nuclear genomes. Phenotype-based recognition of specific PMD genotypes remains difficult, prolonging the diagnostic odyssey. We expanded the MitoPhen database to characterize phenotypic variation across PMD more systematically.
Methods:
Individual-level data on mitochondrial DNA disorders, nuclear-encoded mitochondrial diseases, and single large-scale mitochondrial DNA deletions were manually curated with Human Phenotype Ontology (HPO) terms to produce MitoPhen v2. Principal-component analysis summarized system-level abnormalities; HPO-level enrichment and mean phenotype-similarity scores were then used to distinguish common PMD genotypes.
Results:
MitoPhen v2 adds 3940 individuals to the original release, now encompassing 1597 publications, 10,626 individuals, and 117 genotypes. Among 7586 affected cases, 72,861 HPO terms were recorded. Principal-component analysis revealed 6 phenotype dimensions capturing most system-level variance. At the HPO level, we observed genotype-specific enrichments and identified 111 gene-phenotype links absent from the current HPO database. Using MT-TL1, single large-scale mitochondrial DNA deletions, and POLG as exemplars, phenotype-similarity scores reliably separated individuals with these genotypes from those without.
Conclusion:
MitoPhen v2 enabled systematic, genotype-aware analysis of heterogeneous PMD phenotypes and highlighted the diagnostic value of structured, individual-level data. Phenotype-similarity metrics from such data sets can refine variant interpretation in large rare-disease cohorts and provide a transferable framework for other phenotypically complex genetic disorders.
| Original language | English |
|---|---|
| Article number | 101620 |
| Number of pages | 17 |
| Journal | Genetics in Medicine |
| Volume | 28 |
| Issue number | 1 |
| Early online date | 24 Oct 2025 |
| DOIs | |
| Publication status | E-pub ahead of print - 24 Oct 2025 |
Bibliographical note
Publisher Copyright:© 2025 The Authors. Published by Elsevier Inc. on behalf of American College of Medical Genetics and Genomics.