Data underlying the publication "Assembly and cell-free expression of a partial genome for the synthetic cell"
DOI: 10.4121/4eb28eac-b94c-4e97-9db6-4cd3af73fad2
Datacite citation style
Dataset
*** Assembly and cell-free expression of a partial genome for the synthetic cell ***
Authors: Céline Cleij, Laura Sierra Heras, Ellen Zwiers, Pascale Daran-Lapujade, Christophe Danelon
Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology;
Department of Biotechnology, Delft University of Technology;
Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA
Corresponding authors: Pascale Daran-Lapujade and Christophe Danelon
Contact information: p.a.s.daran-lapujade@tudelft.nl and danelon@insa-toulouse.fr
*** General introduction ***
This dataset contains data collected during experiments for the manuscript "Assembly and cell-free expression of a partial genome for the synthetic cell". Data was collected in 2021-2025.
*** Methodological information ***
Supplementary data 1-4 were prepared in Excel. The overview of relevant mutations in Supplementary data 4.5 is based on mutations in consensus sequences and raw reads obtained from sequencing.
Supplementary data 5: Designed MSG sequences (GenBank) were prepared with the SnapGene software, using the plasmid maps of the sequenced template plasmids and the designed primer sequences.
Supplementary data 6 and 7: Raw Nanopore sequencing reads (FASTQ) were obtained in house using Nanopore sequencing technology (for MSG0.1 and MSG0.2) or by Plasmidsaurus (Eugene, OR, USA) using Nanopore sequencing technology (for MSG1).
Consensus SynChrs sequences (GenBank) for MSG0.1 and MSG0.2 were obtained after de novo assembly of the processed Nanopore sequencing reads using Flye or Canu. If necessary, a consensus SynChr sequence was assembled in SnapGene using information from the Flye and Canu assemblies and raw reads. Consensus sequences (GenBank) for MSG1 were obtained by Plasmidsaurus after processing of the raw reads, and were manually annotated in SnapGene.
All data processing and analysis steps are described in detail in the Methods section of the chapter.
*** Organization of the dataset ***
See README file.
History
- 2025-10-06 first online, published, posted
Publisher
4TU.ResearchDataFormat
Supplementary data 1-4/Excel; Supplementary data 5 Designed sequences/GenBank; Supplementary data 6 & 7 Consensus sequences/GenBank; Supplementary data 6 & 7 Raw reads/FASTQFunding
- BaSyC – Building a Synthetic Cell Gravitation grant (grant code 024.003.019) NWO
- ANR grant (grant code ANR-22-CPJ2-0091-01) Agence Nationale de la Recherche
Organizations
Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of TechnologyTU Delft, Faculty of Applied Sciences, Department of Biotechnology
Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA
DATA
Files (8)
- 5,765 bytesMD5:
b359c12f2738814b9104997982a1a1cc
README.txt - 17,993 bytesMD5:
43a7e39de43dd8e54bc2406027ad14f3
Supplementary data 1 Strains used in this study.xlsx - 28,788 bytesMD5:
3f3a3eb8ef732145da7d62977596d9d6
Supplementary data 2 Plasmids and linear DNA templates used in this study.xlsx - 34,314 bytesMD5:
75b164654032b92d993c89b348ec8cd2
Supplementary data 3 Primers and SHRs used in this study.xlsx - 25,673 bytesMD5:
5502fc97331470ff78f2e92685763d83
Supplementary data 4 SynChrs constructed in this study.xlsx - 167,818 bytesMD5:
76bed6f8ddaf1c9aab2e746acc83b56d
Supplementary data 5 Designed MSG sequences.zip - 8,277,025,544 bytesMD5:
3efd02a061d919e8abba3e5935eb6bfa
Supplementary data 6 Sequencing data after total DNA isolation from S. cerevisiae.zip - 17,300,199 bytesMD5:
099aea25a3e61ba430490eb3ddf10a13
Supplementary data 7 Sequencing data after MSG1 isolation from E. coli.zip -
download all files (zip)
8,294,606,094 bytes unzipped