Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data (dataset)

Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data

doi: 10.4121/uuid:948c10fe-7ea5-47f3-bdb2-d5b0f908820b

Datacite citation style:

Koehorst, J.J. (Jasper Jan) (2018): Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:948c10fe-7ea5-47f3-bdb2-d5b0f908820b

Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

usage stats

1561

views

446

downloads

categories

keywords

database, genome annotation, protein domains, Pseudomonas, Semantic web

licence

CC BY 4.0

export as...

RefWorks, BibTeX, Reference Manager, Endnote, DataCite, NLM, DC, CFF

by J.J. (Jasper Jan) Koehorst

Pseudomonas is a highly versatile genus containing species that can be harmful to humans and plants while others are widely used for bioengineering and bioremediation. We analysed 432 sequenced Pseudomonas strains by integrating results from a large scale functional comparison using protein domains with data from six metabolic models, nearly a thousand transcriptome measurements and four large scale transposon mutagenesis experiments. Through heterogeneous data integration we linked gene essentiality, persistence and expression variability. The pan-genome of Pseudomonas is closed indicating a limited role of horizontal gene transfer in the evolutionary history of this genus. A large fraction of essential genes are highly persistent, still non essential genes represent a considerable fraction of the core-genome. Our results emphasize the power of integrating large scale comparative functional genomics with heterogeneous data for exploring bacterial diversity and versatility.

history