Genome Sequence of a Marine Carotenoid Producing Yeast Rhodotorula mucilaginosa CYJ03

2020-03-10 15:06:48TANGWeiWANGYueCAIYaliLIUShiboZHANGJunandHEZengguo
Journal of Ocean University of China 2020年2期

TANG Wei , WANG Yue CAI Yali LIU Shibo ZHANG Jun and HE Zengguo ,

1) School of Medicine and Pharmacy, Ocean University of China, Qingdao 266000, China

2) Marine Microbiological Engineering & Research Center, Marine Biomedical Research Institute of Qingdao,Qingdao 266000, China

Abstract Carotenoids are valuable pigments that have been widely used in food, pharmaceutical, animal breeding and cosmetics industries. Due to the increasing demand for carotenoids of natural origin, the trend for production of carotenoids by red yeast has become popular. Strain Rhodotorula mucilaginosa CYJ03 was isolated from northern Yellow Sea of China for its carotenoid producing potential. It was found that the whole genome of CYJ03 was 19.03 Mb in size and contained 6301 protein-coding genes including a gene cluster for the carotenoids biosynthesis. The genome sequence would be valuable for exploring the potential biological properties of CYJ03, as well as for facilitating the molecular genetic analysis and the manipulation of carotenoids accumulation in this strain, and for the development of it as an engineered host for carotenoid production.

Key words Rhodotorula mucilaginosa CYJ03; genome; carotenoid; biosynthesis pathway; gene cluster

1 Introduction

Rhodotorula mucilaginosa is a species of red-pigmented yeast in the phylum Basidiomycota, family Cryptococcaceae, subfamily Rhodotorulodae, genus Rhodotorula. The red-pigment is usually attributed to the carotenoids accumulated intracellularly (Hawksworth et al.,1996). With the increasing demand for natural carotenoids, the production of carotenoids by yeast fermentation has become the most interesting method. Meanwhile,attentions to red yeast such as R. mucilaginosa and other red yeasts has been on rise over the years (Frengova and Beshkova, 2009; Mata-Gómez et al., 2014).

R. mucilaginosa strains were isolated from different habitats, such as ocean, soil, animals, plants, and industrial minerals (Wirth and Goldani, 2012). In addition to the biosynthesis of carotenoids, the production of a variety of other metabolites of values, such as unsaturated fatty acids, indoleacetic acid, extracellular polysaccharides, and β-glucan, were also documented by the strains of this species, respectively (Singh et al., 2013; Ignatova et al., 2015; Ma et al., 2017). The biological functions of those metabolites on antioxidant, immune enhancement,anti-tumor activity have been well documented. Thus far R. mucilaginosa has been already used to produce carotenoids as well as applied as natural fruit preservative to inhibit plant pathogenic fungi (Akhtyamova and Sattarova,2013; Zhang et al., 2014), and to modify mineral surface in copper biotransformation (Salvadori et al., 2014). In short, R. mucilaginosa has huge application potential to be explored in a broad market aspects covering the fields of food, medicine, energy, animal husbandry, biological control, as well as biomining industries.

Even though the application potential has been well revealed, the published genomic resource of R. mucilaginosa is still scarce. To date, merely five genome sequences of R. mucilaginosa strains, i.e., C2.5t1, RIT389,IIPL32, JGTA-S1 and ATCC 58901, have been published since 2015 (Deligios et al., 2015; Saha and Seal, 2015;Dasgupta et al., 2017; Gan et al., 2017; Landolfo et al.,2018). In order to explore the full potential of the metabolites afore-mentioned, all of the supportive knowledge pool on genomic sequencing and analysis are still in need.

Recently, our group obtained a red marine yeast isolate CYJ03 from the water sample of the northern Yellow Sea of China, and it was identified as R. mucilaginosa. This strain exhibited strong tolerances to high concentrations of NaCl, glucose as well as low temperature. A set of techniques has been established aimed at high density fermentation for the mass production of carotenoids (unpublished data). In this study, the genome sequence of R.mucilaginosa CYJ03 was reported and analyzed, and the carotenoid biosynthesis pathway and gene cluster were investigated. These data can be used to explore genes and protein functions for biochemical and biotechnological features, and to regulate metabolic pathways by genetic engineering and metabolic engineering for producing valuable metabolites.

2 Methods

2.1 Strain, Culture Condition and Identification

R. mucilaginosa CYJ03 (CGMCC 16302) was used in the current study, the strain was grown in yeast extract peptone dextrose (YPD) medium at 28℃ for 18 h. Colonies and cell morphology were observed by spread plate method and scanning electron microscopy (Hitachi S-4800, Japan), respectively. ITS1-5.8S rRNA-ITS2 sequences were analyzed to match to the data base of strains in Rhodotorula, the phylogenetic tree was constructed with the MEGA 7 software by Neighbor-Joining method (Tamura et al., 2004).

2.2 DNA Preparation, Genome Sequencing and Assembly

The total genomic DNA of R. mucilaginosa CYJ03 was extracted using NanoMagTMfast fungal genomic DNA preparation kit (Tianjin Charme Co., Tianjin, China)and the genome sequencing was performed using the PacBio Sequel System (Pacific Biosystems, Menlo Park,USA). Subreads were assembled and self-corrected using the Falcon Assembler (Chin et al., 2016), and the consensus sequences were obtained, which was based on the Overlap-Layout-Consensus algorithm. Then the error was corrected using the Genomic Consensus and Pilon, respectively.

2.3 Gene Prediction and Annotation

Gene prediction was performed using GeneMark-ES(v4.33) (Lomsadze et al., 2005). For gene prediction,assembly scaffolds were masked using RepeatMasker(v4.0.7). The genome assembly was annotated using the software Diamond (at an E value of 10-5) (Akhter et al.,2012) with BLASTXalgorithm against NR, Swissprot,and GO term databases, KEGG for metabolic pathways,KOG for eukaryotic clusters of orthologs, eggNOG for orthologous groups of proteins (Koonin et al., 2004;Minoru et al., 2004; Huerta-Cepas et al., 2016).

2.4 Carotenoid Biosynthesis Pathway and Gene Cluster Analysis

Proteins predicted to be involved in different steps of the carotenoid biosynthetic pathway were annotated by R.mucilaginosa CYJ03 genomic sequences against KEGG database. And the carotenoid biosynthesis pathway was mapped based on the data obtained. The secondary metabolite biosynthesis gene clusters were identified, annotated, and compared in fungi genomes using the online software antiSMASH (Medema et al., 2011).

3 Results and Discussion

3.1 Identification of R. mucilaginosa CYJ03 and Phylogenetic Analysis

CYJ03 could grow aerobically on YPD agar characterized by forming orange and smooth colonies with moist appearance (Fig.1A). Under scanning electron microscopy the cells of CYJ03 were 30-35 µm in length and 20-22 µm in width, ellipsoid in shape with budding observed (Fig.1B).

The phylogenetic relationship of CYJ03 to other Rhodotorula species was analyzed based on the small subunit ribosomal RNA gene analysis. It was found that CYJ03 and the type strain R. mucilaginosa CBS 316 were clustered in the same clade (Fig.2). The identification of CYJ03 as R. mucilaginosa was subsequently confirmed using whole genome nucleotide sequencing.

Fig.1 (A) Colony morphology of R. mucilaginosa CYJ03 grown on YPD agar, (B) Scanning electron microscopy of CYJ03 at 5 kX magnification.

Colonies of Rhodotorula species are usually distinctive by forming colors, ranging from red to colorless, mostly pink and red when grown on SDA (Sabouraud’s Dextrose Agar) (Hernández-Almanza et al., 2014). These pigments produced are beneficial in protecting the producer cell structures form oxidative stress, via the blocking certain radiations with the wavelengths from 620 to 750 nm. The orange pigment of CYJ03 may be then carotenoid-like substances as to be found in other strains (Buzzini et al.,2007).

Fig.2 Phylogenetic analysis of CYJ03 and related strains, Saccharomyces cerevisiae CBS 1175 was used as the root. The phylogenetic tree was constructed based on ITS1-5.8S rRNA-ITS2 sequences. Bootstrap percentages from 1000 replications are shown on the branches (values below 50% are not shown). GenBank accession number of each strain’s sequence is in parentheses. T, type strain.

3.2 Genome Characteristics

Whole-genome sequencing was performed by OE Biotech (Shanghai OE Biotech Co., Ltd., Shanghai, China)using PacBio sequencing technology, with 252156 total reads ranging from 51 bp to 59861 bp for a total of 1 billion 324 million sequenced bases (Table 1). The total size of R. mucilaginosa CYJ03 is 19037214 bp containing in 88 contigs. The largest contig measured 891449 bp, and the N50length was 420192 bp. G+C content was 60.49%.A total of 6301 genes were predicted by applying Gene-Mark-ES for coding genes predicting as well as for genomes assembling (Alexandre et al., 2005). Followed functional annotation were carried out with the software Diamond to assign the NR, KOG, GO, Swissprot, egg-NOG and KEGG databases to 6191 (98.25%), 3683(58.45%), 4144 (65.77%), 4241 (67.31%), 4797 (76.13%)and 2671 (42.39%) of the predicted proteins respectively.

As indicated by the predicted genes and the corresponding protein functions, the genome of CYJ03 was facilitated with a series of genes potentially associated with some specific yet interesting biological characteristics, such as zinc binding, UV radiation resistance, lipid synthesis, carotenoid biosynthesis, and other biotransformation.

Compared to other species in the same genus, the whole genome information of R. mucilaginosa is relatively scarce. So far only five genome sequences of R.mucilaginosa isolates have been published, among them C2.5t1, RIT389, IIPL32 and JGTA-S1 were submitted to the NCBI database, whereas strain ATCC 58901 was sent to JGI Genome Portal(https://genome.jgi.doe.gov/Rhomuc1/Rhomuc1.home.ht ml). The source, genome information and functions of these five strains and CYJ03 are detailed in Supplementary Table S1 (not showen). In this work, the whole genome sequence of R. mucilaginosa CYJ03 was submitted and published to the NCBI database.

Table 1 Genome assembly and annotation statistics of R.mucilaginosa CYJ03

3.3 Carotenoid Biosynthetic Pathway Depicting

In industry, red yeast is often used to produce carotenoids. The production of carotenoids is very important,they are used as natural colorants or nutritional supplements, and some carotenoids are also precursors of vitamin A, such as β-carotene, γ-carotene, torulene and torularhodin (Mata-Gómez et al., 2014). However, due to the lack of information about the carotenoid biosynthetic pathways and molecular characteristics of the Rhodotorula species may hinder these potential strains from increasing their carotenoid production through genetic or metabolic engineering. Phaffia rhodozyma has done a better job in this respect, and its carotenoid biosynthesis pathway has been relatively clear (Rodríguez-Sáiz, 2010).

Based on the genomic data obtained, the carotenoid biosynthetic pathway of R. mucilaginosa CYJ03, was mapped and depicted using the KEGG pathway analysis and annotation (Fig.3). The biosynthesis of carotenoid begins with the formation of geranylgeranyl pyrophosphate (GGPP) followed by a series of enzymatic reactions of acetyl-CoA addition. Two molecules of GGPP were condensed into phytoene which was the first colorless carotenoid, this step was catalyzed by phytoene synthase[EC: 2.5.1.32]. Then the phytoene desaturase [EC: 1.3.99.30] catalyzed three stepwise desaturation reactions for the formation of neurosporene. Neurosporene was then converted to lycopene, which was followed by two independent cyclization branch routes leading to the two final products β-carotene and torulene, respectively. In the reported carotenoid biosynthetic pathway, torulene is further covert to torularhodin by hydroxylation and oxidation, but torularhodin was not found in KEGG mapping(Kot et al., 2018).

Fig.3 Diagram of the carotenoid biosynthetic pathway of CYJ03. crtB, phytoene synthase; AL1, phytoene desaturase; AL2, phytoene synthase/lycopene cyclase.

Combined with the results of gene annotation, it was found that in the carotenoid biosynthetic pathway by CYJ03 three genes, crtB, AL1 and AL2, were involved and encoded phytoene synthase, phytoene desaturase, and phytoene synthase/lycopene beta-cyclase, respectively. It is worth mentioning that AL2 gene encoded a bifunctional protein which exerts both phytoene synthase and lycopene cyclase activities. This finding is similar to what has been observed in several other fungal species (Arrach et al., 2001; Sanz et al., 2011).

3.4 Carotenoid Biosynthetic Gene Cluster of R. mucilaginosa CYJ03

In this study the scanning of CYJ03 genome ended with the finding of twenty-five gene clusters corresponding to secondary metabolite biosynthesis by using antiSMASH software. Three out of 25 of the clusters were found to be involved with terpenes biosynthesis, with 13 clusters being identified as the one in charge of carotenoid biosynthesis. The important genes of this cluster have been annotated, and the results were all shown in Table 2.

It is found that multiple genes of cluster 13 have high homology with what have been identified in other fungi strains. In different fungi the carotenoid gene clusters arrangement varied in layout (Fig.4). Compared to the observations in other fungi, the six main carotenoid biosynthetic genes of CYJ03 were closely located in a compacted array. The six genes encoded a series enzymes for carotenoid biosynthesis namely phytoene desaturase(AL1), putative oligopeptide transporter, putative carotenoid oxygenase, phytoene synthase/lycopene cyclase(AL2), SGS-domain-containing protein and DUF1212 family protein, respectively. In general the carotenoid biosynthetic gene cluster of CYJ03 shares low similarity to that of other fungi, e.g., cluster 13 of CYJ03 only share merely 27% similarity compared to Rhodosporidium toruloides strain CECT1137.

The use of -omics and genetic engineering tools to study carotenoid metabolism mechanisms and improve synthesis efficiency is emerging with a panel of key biosynthetic steps elucidated. Landolfo et al. (2018) studied the expression of CAR2 and CAR1 genes from R. mucilaginosa C2.5t1, which encode phytoene dehydrogenase and phytoene synthase/lycopene cyclase at the late steps of carotenogenic pathway, respectively. The results showed that there was no clear induction trend of these two genes, and no significant variations were observed during growth. Therefore, the amount of carotenoids produced by R. mucilaginosa is more likely determined by the regulation of enzyme activity rather than the regulation at transcript levels, e.g., induction of the biosynthesis of relevent genes (Wozniak et al., 2011). Prado-Cabrero et al. (2007) targeted deleted the carX gene from Fusarium fujikuroi, which encodes a putative carotenoid oxygenase involved in the formation of neurosporaxanthin (a major carotenoid). The deletion of carX gene did not impede neurosporaxanthin biosynthesis, but did significantly evaluate the total carotenoid production. It was concluded that carotenoid oxygenase participates in the regulation of carotenoid synthesis through a negative feedback mechanism. In Sporidiobolus pararoseus NGR,transcriptional levels analysis of genes encoding phytoene desaturase and lycopene cyclase/phytoene synthase were investigated by applying salt stress conditions. The results showed that their relative expressions were significantly up-regulated, which resulted in the accumulation of 3,4-didehydrolycopene and brought about subsequent increase in the production of torulene and torularhodin (Li et al., 2017).

Fig.4 Alignments of homologous carotenoid biosynthetic gene clusters between R. mucilaginosa CYJ03 and other strains.This map was drawn by using the antiSMASH online software, the ClusterBlast was performed. The top10 strains matching with CYJ03 gene cluster are displayed. Homologous genes are marked with the same colour, and numbers in parentheses represent similarity. Numbers indicate the individual genes/enzymes. Names and gene numbers from the R. mucilaginosa CYJ03 genome are listed in Table 2.

Table 2 Genes of the carotenoid biosynthetic gene cluster in R. mucilaginosa CYJ03

With the established CYJ03 genome data in the future the role of key genes for carotenoid biosynthesis of CYJ03 will be further studied and hopefully the techniques for the overproduction of carotenoids will be available through the sophisticated bioengineering manipulations.

4 Conclusions

A carotenoid-producing red yeast strain CYJ03 was isolated from ocean samples and it was subsequently identified as R. mucilaginosa. Whole-genome sequencing was completed, and in-depth analysis of the carotenoid biosynthesis pathway and the relevant gene clusters were conducted, respectively. The elucidation of carotenoid biosynthesis pathway and the relevant key genes have extended the knowledge of carotenoids biosynthesis by Rhodotorula. In addition, the release of the whole genome sequence of R. mucilaginosa CYJ03 enables the exploration of application potential of the metabolites of interest. In the future it would be of great values to develop genetic manipulation tools to facilitate the over production of carotenoids and other valuable metabolites.

Nucleotide Sequence Accession Numbers

Rhodotorula mucilaginosa CYJ03 genome sequencing project has been deposited in NCBI under BioProject (No.PRJNA506114) and BioSample SAMN10448795. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession RZHN00000 000. The version described in this paper is version RZHN 01000000.

Acknowledgements

The study is supported by the Postdoctoral Applied Research Project of Qingdao.