####################################################################################################################################################### Pan-cancer in silico analysis of somatic mutations in G-protein coupled receptors: The effect of evolutionary conservation and natural variance ---------------------------- SUPPLEMENTARY INFORMATION Last updated: October 25 2021 ---------------------------- Authors: BJ Bongers*, M Gorostiola González*, X Wang, HWT van Vlijmen, W Jespers, H Gutiérrez-de-Terán, K Ye, AP IJzerman, LH Heitman, GJP van Westen** * Equal contribution ** Corresponding author (email: gerard@lacdr.leidenuniv.nl) ####################################################################################################################################################### This repository contains the datasets and source code supporting the conclusions of the manuscript; organized in the following directories: /data Contains the raw (/rawdata) and derived data from the PP scripts. /GDC_dataset Contains the GDC v22.0 mySQL dump; as well as the full and simplified schema of the SQL database. /PP_scripts Contains the Acelerys Pipeline Pilot version 18 scripts used to process and analyze the data. They can be used out of the box, but the paths used in the protocols need to point to the correct location. The /GDC_dataset/GDC_22_SQL-dump_20210212.sql.7a and /data/rawdata/uniprot_variants_Oct2020_1000G.txt were the starting point for the GDC and 1000 Genomes datasets.