Data and code underlying the publication: Machine Learning-Assisted Pathway Optimization in Large Combinatorial Design Spaces: a p-Coumaric Acid Case Study
DOI:10.4121/fa0782ab-760d-4fa7-babf-09bdaab0f509.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
DOI: 10.4121/fa0782ab-760d-4fa7-babf-09bdaab0f509
DOI: 10.4121/fa0782ab-760d-4fa7-babf-09bdaab0f509
Datacite citation style
van Lent, Paul; Thomas Abeel; van der Hoek, Rianne; Schmitz, Joep; Moreno Paz, Sara et. al. (2025): Data and code underlying the publication: Machine Learning-Assisted Pathway Optimization in Large Combinatorial Design Spaces: a p-Coumaric Acid Case Study. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/fa0782ab-760d-4fa7-babf-09bdaab0f509.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This repository consists of the code necessary for reproduction of results in the paper Machine Learning-Assisted Pathway Optimization in Large Combinatorial Design Spaces: a p-Coumaric Acid Case Study. The data consists of :
- NMR screening data of ~3000 engineered S. cerevisiae strains (in code_mlassisted_pca.zip, data/raw/CycleTUD/), and follow-up DBTL cycle (data/raw/CycleTUDValidation)
- DNA sequencing data (.fasta format, in 4tu_mlassisted_pca_fasta) and coverage per contig files.
- Processed count matrices and numerical matrices that were downstream-processed from the DNA sequencing data files using .gff biobricks files.
Code is also available on the github repository
https://github.com/AbeelLab/ml-assisted-p-coumaric-acid-optimization
History
- 2025-11-18 first online, published, posted
Publisher
4TU.ResearchDataOrganizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Science, Intelligent SystemsDSM-Firmenich, Department of Science and Research
Broad Institute of MIT and Harvard, Infectious Disease and Microbiome Program
DATA
Files (2)
- 1,492,793,010 bytesMD5:
9479b8e1aa08e63222094c0e85d686584tu_mlassisted_pca_fasta.zip - 198,587,496 bytesMD5:
cdd853f72cf692f74612e385566c32cacode_mlassisted_pca.zip -
download all files (zip)
1,691,380,506 bytes unzipped





