On microbial community of Pyropia haitanensis by metagenomic analysis*

2021-06-15 08:25JunhaoWANGYunxiangMAOGuoyingDUXiaojiaoLIXianghaiTANG
Journal of Oceanology and Limnology 2021年3期

Junhao WANG , Yunxiang MAO ,2, Guoying DU , Xiaojiao LI, Xianghai TANG ,**

1 Key Laboratory of Marine Genetics and Breeding(Ministry of Education), College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China

2 Key Laboratory of Utilization and Conservation of Tropical Marine Bioresource(Ministry of Education), College of Fisheries and Life Science, Hainan Tropical Ocean University, Sanya 572022, China

Abstract Microorganisms plays an important role in the growth of Pyropia haitanensis. To understand the structural and functional diversity of the microorganism community of P. haitanensis (PH40), the associated metabolic pathway network in cluster of orthologous groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG), and carbohydrate-active enzymes (CAZymes) were explored in metagenomic analysis. DNA extraction from gametophytes of P. haitanensis was performed first, followed by library construction, sequencing, preprocessing of sequencing data, taxonomy assignment, gene prediction, and functional annotation. The results show that the predominant microorganisms of P. haitanensis were bacteria (98.98%), and the phylum with the highest abundance was Proteobacteria (54.64%), followed by Bacteroidetes (37.92%). Erythrobacter (3.98%) and Hyunsoonleella jejuensis (1.56%) were the genera and species with the highest abundance of bacteria, respectively. The COG annotation demonstrated that genes associated with microbial metabolism was the predominant category. The results of metabolic pathway annotation show that the ABC transport system and two-component system were the main pathways in the microbial community. Plant growth hormone biosynthesis pathway and multi-vitamin biosynthesis functional units (modules) were the other important pathways. The CAZyme annotation revealed that the starch might be an important carbon source for microorganisms. Glycosyl transferase family 2 (GT2) and glycosyl transferase family 3 (GT3) were the highly abundant families in glucoside transferase superfamily.Six metagenome-assembled genomes containing enzymes involved in the biosynthesis of cobalamin(vitamin B 12) and indole-3-acetic acid were obtained by binning method. They were confirmed to belong to Rhodobacterales and Rhizobiales, respectively. Our findings provide comprehensive insights into the microorganism community of Pyropia.

Keyword: P. haitanensis; metagenomic; microbial community; cluster of orthologous groups (COG);Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways; carbohydrate-active enzymes(CAZymes)

1 INTRODUCTION

Pyropiais an important commercial product that is widely cultivated in the south coastal areas of China(Xu et al., 2015). Previous studies have shown that the growth status ofPyropiais closely related to its microbial diversity. Several researches have focused on the microbial diversity ofPyropiaby using marker genes such as 16S rRNA genes, 18S rRNA genes, and internal transcribed spacer (ITS) genes (Yang et al.,2008; Shen et al., 2013). A recent study investigated the shift ofPyropia-associated bacterial communities by using 16S rRNA gene sequencing and showed that there were more than 300 operational taxonomic units(OTUs) in their samples, distributed in approximately 15 microbial phyla (Yan et al., 2019). The microbial composition ofPyropia, whether healthy or diseased,is complex. Other similar studies investigated the microbial community composition ofPyropiausing amplicon sequencing. However, owing to host contamination, PCR biases may appear in the amplification of 16S rRNA gene.

In this study, to obtain a more accurate microbial information, we performed taxonomic profiling based on shotgun metagenome sequencing of whole community DNA, which provides a comparatively unbiased insight into the microbial composition ofP.haitanensis. In addition, as metagenomics examined the entire genetic material rather than the identification sequences (e.g., 16S rRNA) (Ngara and Zhang, 2018;Shi et al., 2019), metagenome sequencing technology can solve the complex functional problems of microorganisms (Chen et al., 2019; Qiu et al., 2019).The overall functional analysis of the microbial community ofP.haitanensisrevealed functional genes for the synthesis of vitamin B12and indole-3-acetic acid (IAA). Vitamin B12plays an essential role in many biosynthetic pathways. However, the pathway involved in the synthesis of vitamin B12has been found only in prokaryotes (Bertrand et al.,2011), such asPseudomonasdenitrificans(Warren et al., 2002),Salmonellatyphimurium(Roth et al.,1993), andBacillusmegaterium(Raux et al., 1998;Cruz-López and Maske, 2016). The production of the phytohormone IAA is considered a major plantgrowth-promoting (PGP) feature of plant-beneficial bacteria (Bulgarelli et al., 2013; Nelkner et al.,2019).

In a previous study, 15 uncultured microbial genomes were obtained from bovine rumen by binning method (Hess et al., 2011). Similarly, specific information on the microbial species and genomes in the microbial community ofP.haitanensisshould be determined. Therefore, in this study, metagenomic analysis was performed to identify the microbial community composition and function of microorganisms ofP.haitanensis. We identified six high-quality metagenome-assembled genomes(MAGs) associated with the synthesis of vitamin B12and IAA by binning method. In our study, we used metagenomics to analyze the microbial community structure and function of the microorganism community ofP.haitanensis, which enriched the information of microflora structure, fills in the blank of microflora function, and provided a new research idea for the study oflaver symbiotic microbe.

2 MATERIAL AND METHOD

2.1 Experimental material of P. haitanensis

The gametophytes ofP.haitanensis(PH 40) were obtained from a laboratory culture from the Laboratory of Phycological Genetics and Sometic Cell Engineering, Ocean University of China. The gametophytes were continuously cultivated with Provasoli’s enrichment solution medium at 20±3 ℃and light intensity of 20 μmol photons/(m2·s)following a 12-h light and 12-h dark cycle. DNA extraction was performed when the length of a healthy gametophyte was approximately 8–10 cm.

2.2 DNA extraction, library construction, and sequencing

Gametophytes with a length of around 8–10 cm were selected to extract genomic DNA according to the instructions of Plant Genomic DNA Kit, TIANGEN Biotech (Beijing) Co., Ltd. The concentration and quality (A260/A280) of extracted DNA were determined using a NanoDrop ND-2000 spectrophotometer(NanoDrop, Wilmington, DE, USA), and evaluated with a 1% agarose gel. To minimize DNA extraction bias, three replicate DNA isolations were pooled (Feng et al., 2018). Pair-end library building and sequencing were completed at Personal Biotechnology Co., Ltd.(Shanghai, China) according to the standard protocol(http://www.illumina.com/).

2.3 Preprocessing of sequencing data

Trim Galore (V0.5.0) was used to remove adapter and low quality reads on returned data from the company (Gdula et al., 2019). The reads that were compared with theP.haitanensisgenome (including plastids and mitochondrial genomes) were removed using Bowtie2 (V.2.3.2) (Langmead and Salzberg,2012; Cao et al., 2020). PCR repeats were removed using FastUnique (V.1.1) (Xu et al., 2012). The clean reads were matched back to the host genome to estimate contamination from host genome. To obtain the contigs dataset, metaSPAdes (V.3.9.0) was used to perform the assembly of clean reads after quality control (Bankevich et al., 2012). Then, QUAST(V.4.5, http://quast.bioinf.spbau.ru/) was used to obtain the statistics for all assemblies.

2.4 Taxonomy assignment, gene prediction, and functional annotation

The taxonomic assignment of metagenomes wasperformed with KAIJU (http://kaiju.binf.ku.dk/server) using the default parameters. The taxonomic annotation of microorganisms was obtained by inhouse shell scripts, and species abundance was calculated by the reads number. The open reading frames (ORFs) were predicted through Prodigal(V.2.6.3) (Hyatt et al., 2012). The unique gene dataset was obtained using CD-HIT (V.4.8.1) to remove redundant genes (Fu et al., 2012). Then, the ORFs were aligned using the eggNOG database(evolutionary genealogy of genes: Non-supervised Orthologous Groups, Version 4.0) via eggNOG mapper (V.0.3) with the default parameters (Huerta-Cepas et al., 2016), and the corresponding cluster of orthologous groups of protein (COG) was obtained.The KAAS (KEGG Automatic Annotation Server)(Moriya et al., 2007) program was used to predict KEGG pathway annotations by performing GHOSTX search and SBH method against the Kyoto Encyclopedia of Genes and Genomes database(KEGG GENES). Carbohydrate-active enzymes(CAZymes) for sequences of protein with more than 100 bp were annotated by using the dbCAN2 (V7.0)website (Zhang et al., 2018). The software MetaWRAP(V.1.0.5) was used to obtain MAGs by binning method (Uritskiy et al., 2018). The completeness and contamination of MAGs were estimated by CheckM.

Table 1 Basic information statistics of reads assembly by QUAST

3 RESULT

3.1 Sequencing results and data preprocessing

The original data were finally obtained 28 Gb clean data after quality control. In total, 114 086 645 reads were obtained, and the proportion of reads with base quality score greater than 99% (Quality score >20)was 100%. The total alignment rate between clean reads and host genome was less than 0.01%. The SPAdes software was used to assemble data (meta parameters) after quality control, and the fragments with length less than 1 000 bp in the assembly results were removed. After gene prediction and redundancy removal, 63 868 contig sequences and 241 293 genes were obtained. Then, clean reads were mapped back to the contigs, and the reads utilization rate was 90.69%. The results of assembly evaluation by QUAST are presented in Table 1.

3.2 Taxonomic composition of microbial communities

The clean reads were used to query against the taxonomic database (RefSeq non-redundant proteins database, NR) and only the reads with a bit score >75 were extracted for the following analysis. A total of 51 453 077 (45.1%) reads were assigned to the taxonomic database (Fig.1). Sequences annotated to the bacterial community accounted for 98.98% and constituted the main part of the microbiota. In addition,eukaryotes, archaea, and viruses accounted for 0.8%,0.2%, and 0.02% of the microbiota, respectively.

At the phylum level, 171 taxa were obtained; five taxa had an abundance of more than 1%, namely Proteobacteria (54.64%), Bacteroidetes (37.92%),Actinomycetes (1.30%), Fusarium (1.35%), and Firmicutes (1.45%). The relative abundance of 14 taxa was higher than 0.1%, which included Basidiomycota (0.1%) and Ascomycota (0.1%).

At the genus level, 3 159 genera were obtained; 12 genera had a horizontal abundance of more than 1%,120 genera had an abundance of more than 0.1%, and 580 genera had an abundance of more than 0.01%.The relative abundance of 12 genera was greater than 1%, of whichErythrobacter(3.98%),Sphingorhabdus(2.46%),Sulfitobacter(2.37%),Altererythrobacter(1.84%),Marinobacter(1.12%),Hoeflea(1.08%),andLabrenzia(1.56%) belonged to Proteobacteria;andLewinella(1.94%),Hyunsoonleella(1.56%),Aquimarina(1.16%),Flavobacterium(1.01%), andJejuia(1.01%) belonged to Bacteroidetes. At the species level, 16 691 species were obtained. The relative abundance of four species was greater than 1%, of whichHyunsoonleellajejuensis(1.56%) andJejuiapallidilutea(1.01%) belonged to Bacteroidetes;andSphingorhabdusmarina(1.48%) andSphingomonadalesbacteriumEhC05(1.48%)belonged to Proteobacteria. The relative abundance of 91 species was more than 0.1% and that of 1 134 species was more than 0.01%.

Fig.1 Relative abundance of phylum, genus, and species in microorganisms

3.3 COG annotation

The eggNOG mapper was used to map protein sequences to the eggNOG database by running the diamond mode. Notably, 30.95% of the predicted genes identified in the dataset were assigned to putative functions (Fig.2). The classification of potential genes was subsequently conducted by COG analysis.

General function prediction only [R] was the dominant function among the 25 categories, followed by Amino acid transport and metabolism [E],Transcription [K], Signal transduction mechanisms[T], Carbohydrate transport and metabolism [G],and Cell wall/membrane/envelope biogenesis [M](>9 000). The lowest number of genes (<65) were assigned to RNA processing and modification [A],Chromatin structure and dynamics [B], Nuclear structure [Y], and Cytoskeleton [Z]. In addition,when the potential functional genes were matched to the NCBI Taxonomy Database, we found Proteobacteria (Alphaproteobacteria in particular)was the major phylum in every COG category, which consistent with species abundance distribution.

3.4 KEGG function annotation

Fig.3 Distribution of predicted genes in CAZyme classification

In addition to gene function insights, metagenomic analysis can also provide an opportunity to understand high-level functions and utilities of the microbial community ofP.haitanensis. In total, we identif ied 90 247 KOs (KEGG orthologys), 417 pathways, and 130 modules. Sixty-nine pathways with a relative abundance greater than 1.00% were obtained, and they were def ined as dominant pathways. The ko02010 was the most abundant KEGG pathway,which was assigned to ABC transporters (Table 2),followed by two-component system and quorum sensing. There were many pathways of amino acid biosynthesis and metabolism, such as tyrosine, lysine,histidine, valine, leucine, isoleucine, cysteine,methionine, alanine, aspartate, and glutamate. The enzymes of the KEGG Tryptophan Metabolism (map 00380) pathway potentially involved in indole-acetic acid (IAA, auxin) biosynthesis were identified. In particular, the genes for aldehyde dehydrogenase(NAD+), amidase, and monoamine oxidase were present, implying the presence of auxin synthesized by the tryptamine pathway. The miaA enzyme(K00791) gene capable of synthesizing cis-zeatin was identified in the carotenoid biosynthesis pathway(ko00906), and β-carotene produced in this pathway which was the precursor for biosynthesis of vitamin A. Regarding vitamin metabolism, various complete functional units in the bacterial community encoded vitamins, including thiamine biosynthesis (M00127),pyridoxal biosynthesis (M00124), pantothenic acid biosynthesis (M00119), biotin biosynthesis (M00123),cobalamin biosynthesis (M00122), and methylnaphthoquinone biosynthesis (M00116).

3.5 Carbohydrate active enzyme annotation

The Carbohydrate-Active enZYmes Database(CAZy database) can be divided into six categories:glycosyl transferases (GTs), glycoside hydrolases(GHs), auxiliary activities (AAs), carbohydrate esterases (CEs), proteins with carbohydrate-binding modules (CBMs) and polysaccharide lyases (PLs).

After annotation (e-value <1e-5, coverage > 0.35),4 273 genes were annotated by dbCAN2 against CAZy database. GTs (37.22%) and GHs (34.28%)were the dominant CAZyme categories, followed by CEs (15.23%), AAs (5.99%), CBMs (5%), and PLs(2.28%) (Fig.3). In total, 1 591 genes were distributed in diff erent GT families, with Proteobacteria and Bacteroidetes accounting for 64.3%, and 27% of the GT family, respectively.EpibacteriummobileF1926(4%),Hyphomonassp. Mor2 (3.6%), andMethyloteneraversatilis301 (3.35%) were the top three bacterial strains in the GT families. GT2(33.60%), GT4 (24.51%), and GT51 (10.67%) were the three most abundant GTs containing multiple enzymes related to cell wall synthesis. GH was the second dominant category; 1 465 genes were distributed in diff erent GH families, of which Proteobacteria and Bacteroidetes accounted for 50.9% and 37.7% of the GH family, respectively.Seonamhaeicolasp. S2-3 (4.7%),Aquimarinasp.AD10 (4.6%), andAlgibacteralginicilyticus(3%)were top three bacterial strains in the GH family.GH13 (9.33%) and GH23 (9.12%) were the dominant GHs. GH13 is a big family and includes more subfamilies such as hydrolases, transglycosidases, and isomerases (Svensson, 1994; Janec̆ek, 1997;MacGregor et al., 2001). Branching enzyme (Subf 9)was the most abundant enzyme in all GH13 subfamilies, followed by α-glucosidase (Subf 23).The former can convert amylose into amylopectin,while the latter can hydrolyze oligosaccharides rapidly (Bruni et al., 1970). GH23 included lysozyme and chitinase, which could be a possible way by which microbes compete. We also identified the family of enzymes that can degrade agarose, including GH16, GH50, GH86, and GH118. The relative abundance of CEs, AAs, CBMs, and PLs was lower than the above two categories. CE10 was rank one in Carbohydrate Esterase family; however, majority of the members of this family are esterases acting on non-carbohydrate substrates, included arylesterase,carboxyl esterase, acetylcholinesterase,cholinesterase, sterol esterase, and brefeldin A esterase. AA3 family was the most abundant in the auxiliary activity family, and it belonged to the glucose-methanol-choline oxidoreductase family.CBM9 was the most abundant family in the carbohydrate-binding module family, and it has been known as cellulose-binding domain family IX. In the polysaccharide lyase superfamily, PL6 and PL7 were the families with the highest relative abundance.Thirteen PL6 proteins were not classified to any subfamily, and nine proteins were classified to alginate lyase (Subf 1). Similar to the PL6 family,58% of PL7 proteins were classified as alginate lyase(Subf 5). Furthermore, Alphaproteobacteria was a major class in GT family, AA family, and CE family,while Flavobacteriia was higher abundance in the other CAZymes categories.

3.6 Candidate microorganisms producing B 12 and IAA

3.6.1 Cobalamin biosynthesis accomplished in six draft genomes

Several essential genes coding for enzymes involved in vitamin B12biosynthesis were used to predict the MAGs with potential to synthesize cobalamin, including cbiA/cobB encoding cobyrinic acid a,c-diamide synthase, cbiC/cobH encoding precorrin-8x methylmutase, and cobT encoding nicotinate mononucleotide:5,6-dimethylbenzimidazole phosphoribosyltransferase. Each of these genes represents a potential biomarker for vitamin biosynthesis (Bertrand et al., 2011). When these genes were present in fully sequenced bacterial and archaeal genomes, the complete B12biosynthesis pathway was also present; the genes were homologous in both oxygen-requiring (cobB, cobH, and cobT) and nonoxygen-requiring pathways for vitamin synthesis(cbiA, cbiC, and cobT) (Bertrand et al., 2011). Finally,six MAGs were identified as containing the above essential genes (Supplementary Table S1): MAG4,MAG14, MAG21, MAG26, MAG27, MAG30 and the results of MAGs evaluation by CheckM are presented in Table 3.

Then, we obtained taxonomic information for the six MAGs based on the Genome Taxonomy Database(GTDB, 04-RS89) by GTDB-Tk (V.0.3.3) (Table 4).The result shows that all the six MAGs belonged to Alphaproteobacteria, which included Rhodobacteraceae, Rhizobiaceae, and Devosiaceae.Only MAG30 was assigned to species level with close genetic distance withEpibacteriummobile(GCF_001681715.1, ANI=96.77%).

3.6.2 Auxin-producing bacteria

The KEGG annotation results were searched for IAA-related enzymes in all MAGs, and a complete pathway (tryptamine pathway) for IAA synthesis was found in the MAG21 draft genome. The IAA synthesis pathway was involved in tryptophan metabolism(ko00380) pathway; the related enzyme gene information is shown in Table 5.

4 DISCUSSION

In this study, we aimed to determine the function of the microbial community ofP.haitanensisthrough metagenomic analysis for the first time. We obtained metagenome sequence data and assembled the gene sets that represent a valuable reference repository,particularly forPyropia. After obtaining genetic information of all microorganisms in the sample by metagenome sequencing, we investigated the structure and functional potential of the microbial community ofP.haitanensis(Sudarikov et al., 2017).

A stable microbial community is a key factor in maintaining the growth and development ofP.haitanensis. By characterizing the abundance of reads at the phylum, genera, and species levels,metagenome sequencing revealed the microbial abundance and diversity of the gametophytes ofP.haitanensis. Twelve phyla with abundance greater than 0.1% were obtained. Among them, Proteobacteria(54.64%) and Bacteroidetes (37.92%) were the dominant phyla, which comprised more than 75% of the total population. We reviewed 161 macroalgalbacterial studies over the past few decades, and a bacterial core community comprising Proteobacteria(especially Alphaproteobacteria and Gammaproteobacteria), CFB group, Firmicutes, and Actinobacteria species was found to be functionally closely related to the host (Cruz-López and Maske,2016). The result showed that the algal microorganisms were similar to some extent. Therefore, we speculate that Proteobacteria and Bacteroidetes may have important influence on the growth and development ofP.haitanensis.Erythrobacterwas the most abundant genus. They can degrade alkanes, oxidize tellurite, and form tellurite crystals to decrease the concentration of tellurite acid compounds in the environment and reduce the biological toxicity of tellurite acid compounds (Yurkov et al., 1996; Alonso-Gutiérrez et al., 2009). In addition,Erythrobacterhad a strong production capacity of astaxanthin and carotenoids and played an important role in the global ocean carbon cycle and energy metabolism (Noguchi et al., 1992). We detected bacteria that causedPyropiadisease, such asCobetiamarina,Fusariumsp.,Pseudoalteromonascitrea, andPseudoalteromonastetraodonis, but their abundance was extremely low(<0.001%). Yang et al. (2008) proposed that the microorganisms ofPyropia, such asMarinobacter,Planococcus, andMacrococcus, may be regional.However, in our analysis, these three types of bacteria were detected, indicating that the diff erences in microorganisms between regions may be diff erences in terms of abundance rather than species. The present study also showed that although the abundance ofPseudomonas, which is associated withPyropiahealth, was higher than 0.1%, it was not the dominant species. In addition, virus sequences were found in the data (0.02%), among whichCaudoviraleswas the most abundant, accounting for approximately 62.3%of the total virus sequences. Although there were pathogenic bacteria and viruses among microorganisms ofP.haitanensis, their abundance was very low, and they do not necessarily cause algal disease (Feng et al., 2018).

Table 3 Statistics of MAGs information at contig level

Table 4 The GTDB taxonomic information of MAGs

Table 5 The enzyme genes information of IAA synthesis pathway in MAG21

The gametophytes of microorganisms were quite abundant, and many of them had numerous physiological functions. Nelkner et al. (2019) found that Amino acid transport and metabolism (E) was ranked two in terms of abundance (category R was on rank one) in soil microorganisms, and amino acid metabolism may be of key importance for the soil microbiome analyzed. In the present study, we found that most of the functional genes were involved in microbial metabolism, and Amino acid transport and metabolism (E) was ranked two (category R was on rank one) among the 25 functional categories. These findings indicated that amino acids might play an important role between gametophytes and host. On the other hand, the results show that although the rhizospheric microorganism andP.haitanensismicroorganism were in diff erent environmental media, the corresponding functions of the microbial community were universal to a certain extent.

The most abundant KEGG pathway was ko02010,which was assigned to ABC transporter. The current research on microbial ABC transporters showed that they were involved in many biological functions,such as transport ofions, amino acids, nucleotides,polysaccharides, and peptides; bacterial drug resistance; pheromone secretion; and detoxification of heavy metals (Dean and Annilo, 2005; Davidson et al., 2008; Theodoulou and Kerr, 2015). It was also reported that ABC transporter played an important role in plants (Do et al., 2018). The other abundant KEGG pathways were two-component system and quorum sensing, which play important roles in adaptive mechanisms of microorganisms to the environment, such as colonization, nutrient acquisition, and collective defense (Kleerebezem et al., 1997; Hmelo, 2017).

The abundance of GT was higher than that of GH.The abundance of CEs, AAs, CBMs, and PLs was lower than that of GHs. Among all GHs, amylase GH13 (9.33%) and lysozyme GH23 (9.12%) were the most abundant CAZymes. We also found that the presence of agarase and alginate lyase from the GH family and PL family, which indicated that the host might provide diverse carbon sources for the microflora.Lysozyme usually plays a role in maintaining the stability of the microflora (Li et al., 2019).

Most carbohydrate enzymes belonged to Proteobacteria and Bacteroidetes (especially Alphaproteobacteria and Flavobacteriia), implying that Proteobacteria not only play an important role in species abundance, but also play a pivotal role in function.

Metagenome assembly and binning were performed to reconstruct genomes of unknown and abundant microbial community members. By using binning method, six MAGs were obtained, including MAG4,MAG14, MAG21, MAG26, MAG27, and MAG30,which are potentially involved in cobalamin biosynthesis and IAA synthesis. Vitamin B12, a structurally complex and functionally important vitamin, is one of the essential vitamins required for the growth ofP.haitanensis. It has been reported thatLingulodiniumpolyedrumis a vitamin B1and B12auxotroph and may acquire both vitamins from the associated bacterial community, especially Proteobacteria having high abundance (Croft et al.,2005). Similarly,P.haitanensiswas auxotrophic for vitamin B12(Croft et al., 2005; Bertrand et al., 2011).The fact that vitamin B12can only be formed by bacteria and archaea implies that vitamin B12-producing microorganisms play an important role inP.haitanensis(Watanabe, 2007; Bertrand et al., 2011;Helliwell, 2017; Wichard and Beemelmanns, 2018).On the other hand, bacteria that produce IAA were generally considered PGP microbiome members(Bulgarelli et al., 2013; Nelkner et al., 2019). By putting the whole draft genomes to GTDB, we speculated MAGs as new strains in Rhodobacteraceae,Rhizobiaceae, and Devosiaceae. All MAGs with completeness >90% and contamination <1.5% can provide a strategy for the isolation, culture, and functional identification of microorganisms ofP.haitanensis.

Under the influence of experimental materials and sequencing data, we conducted a systematic study on the microflora of gametophytes ofP.haitanensis, and a deeper study on the complex relationship between host and microflora will be reflected in subsequent studies.

5 CONCLUSION

The microbial community ofP.haitanensisincludes not only prokaryotes but also fungi and viruses or bacteriophages. In this study, we comprehensively analyzed the microbial species diversity ofP.haitanensisand systematically analyzed their functions. We found six MAGs associated with the synthesis of vitamin B12and IAA,and they can added to the microorganism genome database ofP.haitanensis. The obtained genome information for the new candidateP.haitanensisbeneficial microbial species may guide the development of rational isolation strategies. We used metagenomic to analysis the microorganism community ofP.haitanensis, which enriched the information of microflora structure and function, and provided a new research idea for investigating laver symbiotic microbe.

6 DATA AVAILABILITY STATEMENT

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.