Title of the dataset: Data underlying analysis of the microbial community composition in the manuscript " The impact of processing technology on microbial community composition and functional properties of Beninese maize ogi"
Authors: A. K. Carole Sanya, Anita R. Linnemann, Yann E. Madode, Sijmen E. Schoustra, Eddy J. Smid
Corresponding author: Eddy J. Smid
Contact information: eddy.smid@wur.nl

Description: Raw and cleaned data files related to 16S rRNA and ITS genes amplicon sequencing of the DNA extracted directly from maize starch samples after obtention (labelled _0h) and after fermentation (labelled _6h, _8h, _12h, or _24h, corresponding to the duration of the fermentation). The maize starch samples were obtained from starch slurries collected from traditional processors in Benin, West Africa. The fermented maize starch after fermentation is called ogi. The raw data were obtained from Novogene Company Ltd (Cambridge, United kingdom) and processed to generate clean data files, which were consequently used to investigate the microbial community composition in the maize starch and maize ogi samples. Details on the amplicon sequencing and bioinformatic analysis are provided as supplementary method attached to the related manuscript.

This dataset contains four folders:
- Folder 16SRaw_data_after_sequencing: 
All the raw data related to 16S rRNA genes amplicon sequencing of the DNA extracted from the maize starch and maize ogi samples are found here.
- Folder ITSRaw_data_after_sequencing: 
All the raw data related to ITS genes amplicon sequencing of the DNA extracted from the maize starch and maize ogi samples are found here.
Each of these two folders contain five files:
       --> ASV_table.txt: in the format .txt
       This file contains the amplicon sequencing variants (ASVs)
       --> taxa_table.txt: in the format .txt
       This file contains the taxonomic annotations
       --> sample_table.txt: in the format .txt
       This file contains the metadata information describing every samples
       --> phy_tree.nwk: in the format .nwk
       This file contains the phylogenetic information on the relationships of the ASVs
       --> refseq.Fasta: in the format .Fasta
       This file contains the nucleotide sequences in each of the ASVs

- Folder 16S_clean_for_data_analysis: 
All the clean data related to the raw 16S rRNA genes amplicon sequencing data are found here.
- Folder ITS_clean_for_data_analysis: 
All the clean data related to the raw ITS rRNA genes amplicon sequencing data are found here.
Each of these two folders contain three files:
       --> clean_ASV_table.txt: in the format .txt
       This file contains the remaining ASVs after the data were processed
       --> clean_Taxa_table.txt: in the format .txt
       This file contains the taxonomic annotations after the data were processed
       --> Metadata.txt: in the format .txt
       This file is the sample_table.txt file renamed
       
       

