• 基于Pacbio第三代测序技术的厚朴基因组测序分析

    Subjects: Biology >> Botany >> Applied botany submitted time 2020-05-28 Cooperative journals: 《广西植物》

    Abstract: Magnolia officinalis is a famous traditional medicinal plant, belonging to the Magnoliaceae family and Magnolia genus and being widely cultivated in China. Its bark, root bark, branch bark, leaves, flowers and fruits could be used as medicine or food. However, the whose genome information is little known for this plant species. In order to obtain the whole genome sequence information of M. officinalis, the leaf DNA was used as the material, and the third-generation sequencing technology of Pacbio Sequel was used to establish its nucleotide sequence database. Then genome assembly, function annotation and evolution analysis were carried out by bioinformatic methods. The experimental results showed that 140.91 Gb third-generation data were obtained after the original sequencing data, with the Read N50 about 13 784 bp. The assembled M. officinalis genome size was 1.68 Gb, Contig N50 being about 222 069 bp, and the integrity of single copy gene being 78.05 %. 98.40% of the genes from the assembled sequence got gene annotation after being compared with functional databases such as NR, KOG and KEGG. The result of KOG gene annotation was that the protein function of M. officinalis concentrated in the general functional prediction only, posttranslational modification, protein turnover, chaperones signal transduction mechanisms. GO functional classification indicated that the genes of M. officinalis concentrated on cell components and biological processes. KEGG analysis found that the M. Officinalis genes mostly involved in metabolic pathways. By comparative genomics analysis, the genomes of Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Poplar trichocarpa, Ginkgo biloba, Amborella trichopoda, Camellia sinensis and Cinnamomum kanehirae were aligned. It was found that 20 801 of 23 424 genes in M. officinalis could be classified into 12 129 families, 515 gene families being unique to M. officinalis. The genetic evolution tree constructed from the genomes of the selected reference species pointed that the M. officinalis (Magnoliaceae) was closely related to Cinnamomum kanehirae (Lauraceae), and the divergence time between the two species was about 122.5 mya. It is the first time to use the third-generation sequencing technology to analyze the whole genome of M. officinalis in the study. The study is conducive to its further development and utilization, and also provides the information for the study of the whole genome of other medicinal plants.