%0 Generic %A Cleij, Céline %A Sierra Heras, Laura %A Zwiers, Ellen %A Daran-Lapujade, Pascale %A Danelon, Christophe %D 2025 %T Data underlying the publication "Assembly and cell-free expression of a partial genome for the synthetic cell" %U %R 10.4121/4eb28eac-b94c-4e97-9db6-4cd3af73fad2.v1 %K Minimal synthetic cell %K Synthetic genomics %K Cell-free gene expression %K DNA assembly %K PURE system %K Saccharomyces cerevisiae %X

*** Assembly and cell-free expression of a partial genome for the synthetic cell ***


Authors: Céline Cleij, Laura Sierra Heras, Ellen Zwiers, Pascale Daran-Lapujade, Christophe Danelon

Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology;

Department of Biotechnology, Delft University of Technology;

Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA


Corresponding authors: Pascale Daran-Lapujade and Christophe Danelon

Contact information: p.a.s.daran-lapujade@tudelft.nl and danelon@insa-toulouse.fr


*** General introduction ***

This dataset contains data collected during experiments for the manuscript "Assembly and cell-free expression of a partial genome for the synthetic cell". Data was collected in 2021-2025.


*** Methodological information ***

Supplementary data 1-4 were prepared in Excel. The overview of relevant mutations in Supplementary data 4.5 is based on mutations in consensus sequences and raw reads obtained from sequencing.

Supplementary data 5: Designed MSG sequences (GenBank) were prepared with the SnapGene software, using the plasmid maps of the sequenced template plasmids and the designed primer sequences.

Supplementary data 6 and 7: Raw Nanopore sequencing reads (FASTQ) were obtained in house using Nanopore sequencing technology (for MSG0.1 and MSG0.2) or by Plasmidsaurus (Eugene, OR, USA) using Nanopore sequencing technology (for MSG1).

Consensus SynChrs sequences (GenBank) for MSG0.1 and MSG0.2 were obtained after de novo assembly of the processed Nanopore sequencing reads using Flye or Canu. If necessary, a consensus SynChr sequence was assembled in SnapGene using information from the Flye and Canu assemblies and raw reads. Consensus sequences (GenBank) for MSG1 were obtained by Plasmidsaurus after processing of the raw reads, and were manually annotated in SnapGene.


All data processing and analysis steps are described in detail in the Methods section of the chapter.


*** Organization of the dataset ***

See README file.

%I 4TU.ResearchData