Data underlying the article: Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance
doi:10.4121/15022410.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/15022410
doi: 10.4121/15022410
Datacite citation style:
Bongers, Brandon; Gorostiola Gonzalez, Marina; Wang, Xuesong; W. T. van Vlijmen, Herman; Jespers, Willem et. al. (2021): Data underlying the article: Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/15022410.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This repository contains the datasets and source code supporting the conclusions of the manuscript "Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance". G protein-coupled receptors (GPCRs) form the most frequently
exploited drug target family, moreover they are often found mutated in cancer. Here
we used an aggregated dataset of mutations found in cancer patient samples derived
from the Genomic Data Commons and compared it to the natural human variance as
exemplified by data from the 1000 Genomes project. We investigated the location of these
mutations across the protein domains and conserved
residues in GPCRs such as the “DRY” motif. We subsequently created a ranking of
high scoring GPCRs, using a multi-objective approach (Pareto Front Ranking). In conclusion, this study identifies a
list of GPCRs that are prioritized for experimental follow up characterization to
elucidate their role in cancer. The computational approach here described can
be adapted to investigate the roles in cancer of any protein family.
history
- 2021-10-26 first online, published, posted
publisher
4TU.ResearchData
format
/data contains .txt tab-separated files and .csv comma-separated files
/GDC_dataset contains 7a compressed .sql file
/PP_scripts contains .xml files
funding
- STW-Veni #14410
- Oncode Institute funding
organizations
Leiden Academic Centre for Drug Research (LACDR), The NetherlandsLeiden University, The Netherlands
Oncode Institute, The Netherlands
Janssen Pharmaceutica NV, Beerse, Belgium
Uppsala University, Sweden
Xi'an Jiaotong University, China
DATA
files (2)
- 1,542 bytesMD5:
96a5cc396f42a63380ff2cf658e76957
README.txt - 2,859,848,869 bytesMD5:
f4447189b5ac1e54a9a3e50ae9f6df0e
Supplementary_information.zip -
download all files (zip)
2,859,850,411 bytes unzipped