Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data
Datacite citation style:
Koehorst, J.J. (Jasper Jan) (2018): Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:948c10fe-7ea5-47f3-bdb2-d5b0f908820b
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
Pseudomonas is a highly versatile genus containing species that can be harmful to humans and plants while others are widely used for bioengineering and bioremediation. We analysed 432 sequenced Pseudomonas strains by integrating results from a large scale functional comparison using protein domains with data from six metabolic models, nearly a thousand transcriptome measurements and four large scale transposon mutagenesis experiments. Through heterogeneous data integration we linked gene essentiality, persistence and expression variability. The pan-genome of Pseudomonas is closed indicating a limited role of horizontal gene transfer in the evolutionary history of this genus. A large fraction of essential genes are highly persistent, still non essential genes represent a considerable fraction of the core-genome. Our results emphasize the power of integrating large scale comparative functional genomics with heterogeneous data for exploring bacterial diversity and versatility.
history
- 2018-09-26 first online, published, posted
publisher
4TU.Centre for Research Data
format
media types: application/x-gzip, text/plain
references
organizations
Wageningen University & Research, Department of Agrotechnology and Food Sciences
DATA
files (2)
- 1,199 bytesMD5:
810ea579c459f02d1dec91e61241a71c
README.txt - 5,337,078,004 bytesMD5:
31853fd46675bdc25313623045aea679
pseudomonas.rdf.gz -
download all files (zip)
5,337,079,203 bytes unzipped