The workflow for constructing the Moutai Microbiome Catalog is provided in the following figure:
Schematic for constructing the Moutai microbiome catalog and the key results. (A) Taxonomic structure analysis of the fermented grain microbiome. This module includes biomarker analysis, network analysis, community assembly process, and Mantel test analysis. (B) Functional and taxonomic annotation for the gene catalog. The number and percentage of genes were calculated by annotations analyses against the mainstream functional databases (Swiss-Prot, UniRef50, TrEMBL, eggNOG, KEGG, COG, GO, EC, CAZy, and CARD). (C) Genome profile and biosynthetic gene clusters (BGCs) as examined by the analytical pipeline. This module includes BGC prediction and annotation, the species-level clustering of MAGs, and the taxonomical and functional annotation for MAGs. In the whole figure, the green rectangles show the main results, the blue parallelograms represent the analytic procedures, and the software applied for each analytic procedure is shown in purple text. Yellow cylinders represent the databases used for MTFGC analysis.
The name, version, and availability of the software for constructing the Moutai Microbiome Catalog are provided as below:
The name, description, and availability of the databases for constructing the Moutai Microbiome Catalog are provided as below:
The raw metagenomic data could be assessed at the GSA database (https://ngdc.cncb.ac.cn/gsub/) with ProjectID: PRJCA018633 for fermented grain samples and PRJCA018634 for starter samples. The data used for constructing MTFGC could be assessed in the National Genomics Data Center (NGDC) database (https://ngdc.cncb.ac.cn/) with BioProject ID: PRJCA018633. The raw data of MTFGC could be assessed with accession ID: CRA014449 and CRA012433 for raw data, GWHERCQ00000000 and GWHERDV00000000 for MAGs (MTFGC-Genome), as well as GWHERDA00000000 and GWHERDZ00000000 for nonredundant genes (MTFGC-Gene). The data used for constructing MTC could be assessed in the NGDC database with BioProject ID: PRJCA018633 and PRJCA018634. The raw data of MTC could be assessed with accession ID: CRA012433 and CRA012434 for raw data, GWHERDW00000000 for MAGs (MTC-Genome), and GWHERDX00000000 for nonredundant genes (MTC-Gene).
The bioinformatics analysis of Moutai Microbiome in R is provided in "scripts/".
The test data are provided under the directory "data/".
The result is provided in the directory "Figures/".
To use them, please download the whole package and run in the R.
For more details, please refer to https://github.com/zhuxue1002/Moutai-Catalog/