Aggregation of population-based genetic variation over protein domain homologues and its potential use in genetic diagnostics
Publication year
2017Source
Human Mutation, 38, 11, (2017), pp. 1454-1463ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
CMBI
Human Genetics
Bioinformatics
Journal title
Human Mutation
Volume
vol. 38
Issue
iss. 11
Page start
p. 1454
Page end
p. 1463
Subject
Bioinformatics; Radboudumc 19: Nanomedicine RIMLS: Radboud Institute for Molecular Life Sciences; Radboudumc 6: Metabolic Disorders RIMLS: Radboud Institute for Molecular Life Sciences; Radboudumc 7: Neurodevelopmental disorders DCMN: Donders Center for Medical Neuroscience; CMBI - Radboud University Medical Center; Human Genetics - Radboud University Medical CenterAbstract
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population-based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease-causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 "meta-domains." These meta-domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta-domains high-frequency population variation re-occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta-domains can improve the interpretation of genetic variation.
This item appears in the following Collection(s)
- Academic publications [244228]
- Electronic publications [131195]
- Faculty of Medical Sciences [92893]
- Faculty of Science [37109]
- Open Access publications [105220]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.