ClinVar and Gene Homology

In order to recieve or download the high-resolution image of graphs, please click the graphs you interested in.




Distribution of the homologs of the genes in ClinVar across the species:

Firstly, we compiled Chimp, Macaca, Mouse, Rat, and Xenopus orthologous proteins from MGI Homology List. When we noticed that many Chimp and Macaca genes do not have human counterparts, we performed BLASTp analysis for the remaining genes. DIOPT (DRSC Integrative Ortholog Prediction Tool) was used to find human orthologs in Drosophila while human orthologs for C. elegans were generated with an in-house BLASTp pipeline together with the integration of literature search. For Zebrafish and human orthology matching, https://zfin.org/downloads/human_orthos.txt database was used. All the resulting the orthology gene list was compiled to achieve high accuracy in orthology, which is essentially needed for comparative analyses.

We then obtained all human gene IDs from the ClinVar file downloaded in February 2019 and matched them in other species with their counterparts using the homology curation table explained above. We are currently in the process of automating the update of the data sets from ClinVar, gnomAD, COSMIC and PTMs. The codes are also available in our GitHub repository.