Data underlying the article: Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance
datasetposted on 26.10.2021, 11:05 authored by Brandon BongersBrandon Bongers, Marina Gorostiola GonzalezMarina Gorostiola Gonzalez, Xuesong Wang, Herman W. T. van Vlijmen, Willem Jespers, Hugo Gutiérrez-de-Terán, Kai Ye, Adriaan P. IJzerman, Laura H. Heitman, Gerard van WestenGerard van Westen
This repository contains the datasets and source code supporting the conclusions of the manuscript "Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance". G protein-coupled receptors (GPCRs) form the most frequently exploited drug target family, moreover they are often found mutated in cancer. Here we used an aggregated dataset of mutations found in cancer patient samples derived from the Genomic Data Commons and compared it to the natural human variance as exemplified by data from the 1000 Genomes project. We investigated the location of these mutations across the protein domains and conserved residues in GPCRs such as the “DRY” motif. We subsequently created a ranking of high scoring GPCRs, using a multi-objective approach (Pareto Front Ranking). In conclusion, this study identifies a list of GPCRs that are prioritized for experimental follow up characterization to elucidate their role in cancer. The computational approach here described can be adapted to investigate the roles in cancer of any protein family.