Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Expanded transcriptomic view of strawberry fruit ripening through meta-analysis

  • Gibum Yi ,

    Contributed equally to this work with: Gibum Yi, Hosub Shin

    Roles Conceptualization, Investigation, Writing – original draft, Writing – review & editing

    gibumyi@cnu.ac.kr (GY); ejinlee3@snu.ac.kr (EJL)

    Affiliation Department of Bio-Environmental Chemistry, College of Agriculture and Life Sciences, Chungnam National University, Daejoen, Korea

  • Hosub Shin ,

    Contributed equally to this work with: Gibum Yi, Hosub Shin

    Roles Data curation, Investigation

    Affiliation Department of Agriculture, Forestry and Bioresources, College of Agriculture and Life Sciences, Seoul National University, Seoul, Korea

  • Kyeonglim Min,

    Roles Investigation

    Affiliation Department of Agriculture, Forestry and Bioresources, College of Agriculture and Life Sciences, Seoul National University, Seoul, Korea

  • Eun Jin Lee

    Roles Conceptualization, Supervision, Writing – review & editing

    gibumyi@cnu.ac.kr (GY); ejinlee3@snu.ac.kr (EJL)

    Affiliations Department of Agriculture, Forestry and Bioresources, College of Agriculture and Life Sciences, Seoul National University, Seoul, Korea, Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Korea

Abstract

Strawberry is an important fruit crop and a model for studying non-climacteric fruit ripening. Fruit ripening and senescence influence strawberry fruit quality and postharvest storability, and have been intensively studied. However, genetic and physiological differences among cultivars preclude consensus understanding of these processes. We therefore performed a meta-analysis by mapping existing transcriptome data to the newly published and improved strawberry reference genome and extracted meta-differentially expressed genes (meta-DEGs) from six cultivars to provide an expanded transcriptomic view of strawberry ripening. We identified cultivar-specific transcriptome changes in anthocyanin biosynthesis-related genes and common changes in cell wall degradation, chlorophyll degradation, and starch metabolism-related genes during ripening. We also identified 483 meta-DEGs enriched in gene ontology categories related to photosynthesis and amino acid and fatty acid biosynthesis that had not been revealed in previous studies. We conclude that meta-analysis of existing transcriptome studies can effectively address fundamental questions in plant sciences.

Introduction

Strawberry (Fragaria × ananassa Duch.) is an important and nutritious fresh fruit crop for human consumption. As its popularity has increased, so have research and breeding efforts. Strawberry has an octoploid origin resulting from the combination of four diploid species, F. iinumae, F. nipponica, F. viridis, and F. vesca, and this genomic complexity has made genetic and genomic studies inefficient until recently [1].

Ripening is a complex process integrating development and senescence. Fruits can be divided into two types, climacteric and non-climacteric, based on their respiration and ethylene fluctuation during ripening. Climacteric fruit such as tomato have a respiratory burst and ethylene peak at the onset of ripening and have been studied intensively. By contrast, non-climacteric fruit produce a very small amount of ethylene with no increased rate of respiration. Strawberry is used as a model for non-climacteric fruit ripening because of its commercial importance and experimental advantages [2]. Expressed sequence tag (EST)-based transcript analysis, microarray analysis, and recent next-generation sequencing (NGS)-based transcriptome analysis have been performed to understand the molecular mechanisms of strawberry ripening [35], with metabolic and proteomic analysis supporting this effort [68]. Physiological changes during ripening affect texture, acidity, color, flavor, and aroma, occurring alongside molecular changes in plant hormone signaling, cell wall loosening, sugar transport, and anthocyanin biosynthesis [69].

Strawberry is asexually propagated by runners, and breeding efforts focus on selecting superior plants rather than establishing inbred lines. Intensive breeding programs have been implemented since the 1990s in many countries; dominant cultivars differ by region and are genetically diverse, with complex relationships between cultivars [10]. Research in strawberry has been performed using a wide range of cultivars. Understanding the genetic relationships among current cultivars is therefore necessary for combined analysis. The recently published, chromosome-based genome of cultivar ‘Camarosa’ enables comparison of previously published transcriptome data to a common reference genome [1].

Despite accumulating data showing transcriptome changes during strawberry development, the effects of the genetic backgrounds in different cultivars has not been investigated; thus, it is difficult to conclude whether changes are cultivar-specific or not. Furthermore, use of different reference genomes or de novo assembled transcriptomes along with different gene names means that data from the same developmental stage or obtained under similar conditions are not easily comparable.

Meta-analysis is being applied in the plant science field to provide an expanded view of specific biological questions that cannot be answered in a single experiment [11, 12]. However, application of meta-analysis is relatively limited in plant biology compared to that in medical science or environmental sciences. There are likely two main reasons for this. First, it is easier to produce a randomized sample design and high enough number of biological replicates using plant samples compared to animal or human samples. Second, identical treatments are imposed much less frequently in plant science studies than in medical science. However, even in plant science research, transcriptome data is produced from a relatively small number of biological replicates, generally two to three, owing to the cost per sample. Meta-analyses have the potential to increase the usefulness of studies involving a limited number of samples by compiling multiple studies, as in the application of multiple transcriptome data sets to elucidate concordant changes by identifying meta-DEGs (differentially expressed genes) [11, 12]. Furthermore, meta-analysis can be applied to RNA sequencing (RNA-Seq) data from different species, or under different biotic and abiotic stresses, to address a broad range of questions [13, 14].

The recently published, high-quality genome of strawberry, which has 805 Mb of sequence, covering the 28 expected chromosome-level pseudomolecules [1], allows meta-analysis to be used for systematic comparison of different studies. Here, we compiled previous transcriptome data from various cultivars grown under different conditions and present an expanded transcriptomic view of ripening in strawberry. We used meta-analysis of transcriptome data for six cultivars from three independent studies to deliver an in-depth view of the results, which provide further information on strawberry ripening to the research community.

Methods

Transcriptome data analysis and meta-analysis

Data from three publicly available transcriptome studies were used for meta-analysis (Table 1). The transcriptome data include the whole-fruit transcriptomes of six different cultivars at two coinciding developmental stages, Big Green (BG) and Fully Red (FR). RNA-Seq raw data were downloaded from the NCBI SRA database and low-quality reads (Q < 20) were filtered out using FASTX-Toolkit. Filtered reads were mapped to the Fragaria × ananassa ‘Camarosa’ Genome Assembly v1.0 (https://www.rosaceae.org) using Tophat v2.1.1 [15] with default parameters, and the number of mapped reads was counted using ht-seq-count from HTSeq [16]. For single analyses, differentially expressed genes (DEGs) were identified using the Bioconductor package edgeR 3.30.3 [17] with minimum FPKM (Fragments Per Kilobase of transcript per Million fragments mapped) > 0.3, false discovery rate (FDR) < 0.05, and log2 fold change > 1. Meta-analysis was performed following the method of Cohen et al. [18] with minor modification. Briefly, p-values from single analyses were combined using the Fisher’s sum of logs method using the R package metap v1.1 [19] and multiple tests were performed using the p.adjust function in R with the ‘fdr’ method. Meta-DEGs were identified with median FPKM > 0.3 and absolute value of median log2 fold changes > 1 for all cultivars within the analysis, and an adjusted p-value < 0.01.

Principal component analysis (PCA) was performed using the R package FactoMineR 1.32 [20] with the whole-transcriptome FPKM values of all samples from six cultivars.

Gene ontology (GO) term and KEGG pathway enrichment analysis

Fisher’s exact test was performed using TopGo 2.18 [21] in the R package for GO term enrichment tests. Adjusted p-values were calculated using the p.adjust function in R with the ‘fdr’ method, and significantly enriched GO terms were identified with FDR < 0.05. Enrichment test for KEGG (Kyoto Encyclopedia of Genes and Genomes) [22] pathways was performed using Fisher’s exact test, and p-values were adjusted using the FDR method. Significantly enriched pathways were determined with FDR < 0.05 and odds ratio > 1.

Single-nucleotide polymorphism (SNP) detection and genetic relationship analysis

Filtered reads were mapped to the Fragaria × ananassa ‘Camarosa’ Genome Assembly v1.0 (https://www.rosaceae.org) using Burrows-Wheeler aligner v0.7.17-r1188 [23] with the ‘mem’ algorithm. SAMtools v0.1.19 [24] was used for calling variant and homozygous SNPs for all six cultivars covered by ≥ 3 reads per sample. A total of 7,002 SNPs were identified, and a neighbor-joining tree was reconstructed using MEGA X [25].

Motif search for the meta-DEGs

De novo motif analysis was performed with 1 kb upstream sequence of the transcription start site of META DEGs using MEME software with “-revcomp -mod zoops -objfun de” parameters [26]. The second-order Markov background model was constructed based on upstream sequences of all genes and the model was used as control. Enriched motifs were filtered with E-value threshold of 0.05. The possible binding site of transcription factor was searched using TOMTOM [27] with JASPAR CORE (plant) database 2018 [28].

Results

The six cultivars have distinct characteristics

During the BG to FR stages, strawberry fruit undergoes de-greening and red coloration among other physiological changes [5]. The two stages are visually separated from the other developmental stages and have been selected in many previous studies investigating the ripening process [29, 30]. More DEGs are generated from BG vs. FR than from sub-stages between BG and FR [29].

For meta-analysis of strawberry fruit ripening, we collected publicly available RNA-Seq data for BG and FR stages, for which the largest dataset is available (Table 1). We characterized the six cultivars based on their RNA-Seq results since the meta-analysis assumed that the six cultivars are distinct from each other and have the same weight for analysis. PCA based on their fruit transcriptome profiles at the two developmental stages, BG and FR, showed clustering of samples where PC1 and PC2 explained 36% of the variation. The two different developmental stages were relatively well separated by PC1 and PC2. Furthermore, there seemed to be two clusters of cultivars at both developmental stages. ‘Kingsberry’ and ‘Sunnyberry’ were clustered together and the other cultivars were clustered with each other. Most of the replicates were closely located, with those of ‘Toyonoka’ and ‘Sunnyberry’ at the BG stage showing a greater distance (Fig 1A). Transcriptome profiles tended to show greater variation at the BG stage compared to the FR stage, as determined from the distribution of samples on the PCA score plot (Fig 1A), and as observed in a previous study using ‘Kingsberry’ and ‘Sunnyberry’ [30].

thumbnail
Fig 1. Transcriptomic characteristics of the six strawberry cultivars.

(A) PCA based on transcriptome profiles. FPKM values for all genes were scaled by unit variance with the R package FactoMineR. Samples from the two developmental stages, BG and FR, are colored green and red, respectively. Areas bounded by green and red lines indicate 95% confidence area for BG and FR, respectively. (B) Genetic distance among the six cultivars and the reference cultivar ‘Camarosa’ based on 7,002 SNPs. Numbers on branches indicate percentage bootstrap support from 1,000 replications. Bar, nucleotide substitution rate for the SNP loci.

https://doi.org/10.1371/journal.pone.0252685.g001

We further compared cultivars based on SNPs in genic sequences using the transcriptome data. ‘Xiaobai’ was first introduced from a somatic variant of ‘Benihoppe’ [4], and as expected the genetic distance of these two cultivars was small (Fig 1B). However, their transcript profiles at the BG stage were distinct enough to consider them different cultivars. ‘Kingsberry’ and ‘Sunnyberry’ showed a relatively close genetic relationship from the SNP genotypes (Fig 1B); these cultivars are from the same breeding institute and share partial ancestry [30]. ‘Kingsberry’ also showed a relationship to ‘Benihoppe’ and ‘Xiaobai’ (Fig 1B), which share the maternal parent ‘Akihime’. ‘Toyonoka’ showed similarity to the reference cultivar ‘Camarosa’. These cultivars were genetically diverse to cover certain amount of genetic diversities of strawberry [4, 30]. From these analyses, we concluded that there was sufficient genetic variation among these six samples to consider them separate cultivars and perform further analyses.

Reanalysis of publicly available strawberry transcriptome data

The mapping rate for ‘Kingsberry’ and ‘Sunnyberry’ was slightly improved to 66.5%, compared to 63.6% using the old reference genome [30, 31], showing the improvement in the reference genome (S1 Table) [1]. Furthermore, there was no significant difference in mapping rate among cultivars analyzed in this study, indicating similar quality of RNA-Seq data (S1 Table). The reference genome contains 108,087 protein-coding genes [1], among which 44,061 (40.8%) genes were not transcribed (FPKM < 0.3) in any of the samples of the six cultivars and 11,968 (11.1%) genes were expressed only at a basal level (FPKM < 1) in only one of the samples.

We analyzed correlation coefficients (R2) among biological replicates and among cultivars (S2 Table). The average R2 among biological replicates was 0.939, ranging from 0.868 to 0.998, showing high concordance among replicates. The highest correlations between cultivars were observed for ‘Xiaobai’ FR and ‘Benihoppe’ FR samples. Notably, the ‘Sunnyberry’ BG sample showed higher correlation to the FR samples of ‘Benihoppe’, ‘Xiaobai’, and ‘Snow princess’ than to the BG samples.

We identified more DEGs between FR and BG samples than previous studies; for example, ‘Sunnyberry’ had 4,656 DEGs between these two stages in a previous study [30] but we detected 10,033. There were 1.5–2.4 times more DEGs down-regulated in FR compared to BG than up-regulated in the six cultivars. Different reference genomes and criteria can easily change the number of DEGs detected; however, the ratio between up- and down-regulated DEGs was consistent across data sets (Fig 2A). Numbers of up- and down-regulated DEGs in FR compared to BG in the six cultivars are shown as Venn diagrams (Fig 2B and 2C). ‘Toyonoka’ and ‘Snow princess’ had the largest number of cultivar-specific down- and up-regulated DEGs, respectively. ‘Xiaobai’ and ‘Benihoppe’ had the largest number of two-cultivar-specific up-regulated DEGs, whereas ‘Toyonoka’ and ‘Snow princess’ shared the largest number of two-cultivar-specific down-regulated DEGs (Fig 2B and 2C). Over half (51.5%) of the up-regulated DEGs were single-cultivar specific whereas only 35.2% of the down-regulated DEGs were single-cultivar specific (Fig 2B and 2C). By contrast, 45.3% of the down-regulated DEGs were commonly detected in three or more cultivars, whereas only 27.8% of the up-regulated DEGs were commonly detected in three or more cultivars (Fig 2B and 2C). Down-regulated DEGs tended to be common to several cultivars whereas up-regulated DEGs tended to be cultivar specific.

thumbnail
Fig 2. Meta-DEGs and DEGs from the six strawberry cultivars.

(A) Numbers of up- and down-regulated DEGs between big green (BG) and fully red (FR) stages are shown as red and green bars, respectively. (B) Venn diagram of the up-regulated DEGs (FR vs. BG) in the six cultivars. Numbers in each section represent the specific or common DEGs among six cultivars. (C) Venn diagram of the down-regulated DEGs (FR vs. BG) in the six cultivars.

https://doi.org/10.1371/journal.pone.0252685.g002

We applied GO enrichment analysis to the DEGs of the six cultivars to identify functions important during ripening (Tables 2 and 3). Down-regulated DEGs in the six cultivars shared common GO terms whereas up-regulated DEGs had few common terms. ‘photosynthesis’ (GO:0015979) and ‘cell wall biogenesis’ (GO:0042546) were the two most highly enriched GO terms among down-regulated DEGs for all six cultivars. Other terms related to cell wall loosening and sugar metabolism were commonly associated with DEGs from all six cultivars (Tables 2 and 3).

thumbnail
Table 2. Enriched GO terms for up-regulated meta-DEGs and FDR values for up-regulated DEGs (FR vs. BG) in the six cultivars.

https://doi.org/10.1371/journal.pone.0252685.t002

thumbnail
Table 3. Enriched GO terms for down-regulated meta-DEGs and FDR values for down-regulated DEGs (FR vs. BG) in the six cultivars.

https://doi.org/10.1371/journal.pone.0252685.t003

Cultivar specificity was more apparent in GO analysis of DEGs up-regulated at the FR compared to BG stage. ‘Benihoppe’, ‘Xiaobai’, and ‘Snow princess’ showed similar GO term enrichment whereas ‘Sunnyberry’ and ‘Kingsberry’ shared a distinct set of enriched GO terms. We also observed these differences among cultivars in the PCA clustering (Fig 1). Fewer up-regulated DEGs than down-regulated DEGs and less-conserved GO term enrichment among the up-regulated DEGs indicate that strawberry ripening from the BG to FR stages is active having a large number of genes with decreased expression. Furthermore, these ripening processes are quite variable among different cultivars.

Meta-analysis reveals expanded transcriptome changes during ripening

We identified 12,339 meta-DEGs by meta-analysis of the six cultivars consisting of 3,248 up- and 9,091 down-regulated DEGs in FR compared with BG samples (Fig 2A). As expected from individual analysis of the six cultivars, there were 2.8 times more down-regulated meta-DEGs than up-regulated meta-DEGs. Between 54.5 and 79.0% of DEGs in the six cultivars were retained as meta-DEGs (S3 Table).

The highly enriched GO terms associated with meta-DEGs were ‘organic acid metabolic process’ (GO:0006082) and ‘fatty acid metabolic process’ (GO:0006631). However, these terms were enriched only in two or three of the cultivars, respectively. For DEGs down-regulated at FR compared to BG stage, ‘photosynthesis’ (GO:0015979) and ‘cell wall biogenesis’ (GO:0042546) were the two most highly enriched GO terms among meta-DEGs, as for the six cultivars (Table 3). Other terms related to cell wall loosening such as ‘cell wall polysaccharide metabolic process’ (GO:0010383) and ‘cellular carbohydrate metabolic process’ (GO:0044262), which were associated with DEGs from all six cultivars, were also enriched among the meta-DEGs. KEGG analysis revealed similar findings to GO analysis, with ‘photosynthesis,’ ‘glyoxylate and dicarboxylate metabolism,’ ‘cutin, suberine, and wax biosynthesis,’ and ‘starch and sucrose metabolism’ being enriched pathways. We identified clear, common changes in starch metabolism, chlorophyll degradation, and cell wall degradation during the ripening process in all six cultivars.

Of the meta-DEGs, 2,150 were detected in all six cultivars, 695 genes were differentially expressed in only one of the six cultivars, whereas 483 were not detected as DEGs in any of the six individual cultivars (S4 Table). These 483 meta-DEGs were assumed to be novel DEGs. When GO enrichment analysis was performed on these novel DEGs, ‘photosynthesis, light reaction’ (GO:0019684), ‘cellular modified amino acid biosynthetic process’ (GO:0042398), and ‘fatty acid biosynthetic process’ (GO:0006633) terms were associated with DEGs down-regulated in FR compared with BG samples, and ‘response to auxin’ (GO:0009733) was associated with up-regulated DEGs. These newly detected meta-DEGs provide an extended view of the ripening process. Thus, meta-analysis can be used for gene identification by increasing statistical significance with a greater number of samples.

We further performed motif search for the whole meta-DEGs and for the 484 newly identified DEGs to find possible conserved transcriptional regulation for the meta-DEGs and whether the conserved motifs are shared in the newly identified DEGs or not. The meta-DEGs shared many motifs found in ERF genes and ABA related genes, suggesting hormone-responsive transcriptional regulations are largely controlling transcriptome changes in strawberry fruit ripening (S1 Fig). Furthermore, the newly identified meta-DEGs also shared such motifs indicating the genes likely have altered expression during strawberry fruit ripening (S1 Fig).

Changes in anthocyanin biosynthesis-related genes

Coloration of strawberry fruit during ripening is one of the most prevalent and economically important changes and has been intensively investigated. The color component is mainly anthocyanin, a water-soluble flavonoid compound, with pelargonidin being the major anthocyanin accumulated during ripening [32]. Structural and regulatory genes in the anthocyanin biosynthesis pathway have been investigated in many crop species including strawberry. MYB, bHLH, and WD40 repeat proteins are assembled into an MBW complex and control anthocyanin biosynthesis [33].

There are 1,213 MYB-annotated genes, among which 719 were expressed in fruit tissue and 423 were differentially expressed in at least one of the six cultivars. The meta-DEGs included 191 MYB genes, 155 down-regulated and 36 up-regulated at FR compared with BG stage. FaMYB1 (AF401220) and FaMYB10 (EU155162) are ripening-related transcription factors, known to be negative or positive regulators of anthocyanin biosynthesis [3, 34, 35]. We identified three orthologs for each of the FaMYB1 and FaMYB10 genes in the F. × ananassa reference genome (https://www.rosaceae.org) with high significance (E-value < 10−8) in BLAST searches: FxaC_19g15290, FxaC_20g18010, and FxaC_18g28180 for FaMYB1 and FxaC_3g25830, FxaC_4g15020, and FxaC_2g30690 for FaMYB10, respectively. Two of the MYB1 orthologs and all of the MYB10 orthologs were up-regulated meta-DEGs (Fig 3). The three FaMYB10 genes were designated as FaMYB10-1 (FxaC_4g15020), FaMYB10-2 (FxaC_2g30690), FaMYB10-3B (FxaC_3g25830) and the FaMYB10-2 was recently confirmed to be involved in coloration of strawberry fruit [36].

thumbnail
Fig 3.

Heatmap of anthocyanin biosynthesis regulatory genes (A) and structural genes (B). Up- and down-regulated meta-DEGs between big green (BG) and fully red (FR) stages are colored red and green, respectively. Average FPKM values for developmental stages of the six cultivars are color-scaled the same for each gene ortholog. The numbers for FPKM values were presented if necessary. CHS: chalcone synthase; CHI: chalcone isomerase; F3H: flavanone3 hydroxylase; DFR: dihydroflavonol 4-reductase; ANS: anthocyanidin synthase; UFGT: UDP-glucose:flavonoid 3-O-glycosyltransferase.

https://doi.org/10.1371/journal.pone.0252685.g003

A total of 113 bHLH orthologs have been identified in the F. vesca genome, and their transcription profiles have been investigated in three different-colored cultivars, ‘Benihoppe’, ‘Snow princess’, and ‘Xiobai’ [37]. From this analysis, Zhao et al. suggested that seven bHLH genes are involved in anthocyanin biosynthesis [37]. We examined transcript profiles of these seven bHLH genes in three more strawberry cultivars. There are 454 genes annotated as bHLH genes in the F. × ananassa reference genome, which is about four times as many as in F. vesca, consistent with the octoploid origin of F. × ananassa. Among these, 282 genes were expressed (FPKM > 0.3) in at least one of the fruit samples and 190 genes were differentially expressed between BG and FR in at least one of the six cultivars. For bHLH genes, we identified 97 meta-DEGs comprising 76 up-regulated and 21 down-regulated genes at the FR compared to BG stage. Among the seven bHLH genes suggested to be involved in fruit anthocyanin biosynthesis [37], all three orthologs of FvbHLH27 and one ortholog for FvbHLH40 were up-regulated. By contrast, one ortholog of the FvbHLH80 and three FvbHLH98 orthologs were down-regulated (Fig 3A).

The other component of the MBW complex is WD40 repeat-containing proteins. There were 1,028 genes annotated as WD40 repeat-containing proteins of which 137 were not expressed in samples from any of the six cultivars and 693 did not show any changes in expression between the BG and FR stages in the six cultivars. We identified 60 meta-DEGs: 33 genes up-regulated and 27 down-regulated in FR compared with BG samples. Since there have been few studies on these WD40 repeat genes, whose expressions corresponded to other MBW complex genes were selected as possible candidates that were up-regulated in most of the cultivars. From the expression analysis of the six cultivars and meta-analysis, we propose that the WD40 genes listed in Fig 3, especially FxaC_21g40710, which was significantly up-regulated in all cultivars except ‘Snow princess’, could be components of an MBW complex with FaMYB1 and FabHLH27 or FabHLH40. If this is the case, the WD40 gene could be also critical for white flesh color in the cultivar ‘Snow princess’ along with FaMYB10 [38]. Further analysis and experimental evidences would be necessary for testing this possibility.

We also observed changes in expression of anthocyanin biosynthesis structural genes encoding enzymes such as chalcone synthase (CHS), chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS), and UDP-glucose: flavonoid 3-O-glycosyltransferase (UFGT) (Fig 3B). Clear differences between white- and red-fruit cultivars were observed in genes encoding the first three enzymes, CHS, CHI, and F3H. Orthologs of these genes were up-regulated in all red cultivars at the FR stage compared to BG but down-regulated in ‘Snow princess’. ‘Xiaobai’, which has red skin but white flesh, showed a relatively smaller increase in expression of genes encoding CHI and F3H at the FR stage than the red-skin and red-flesh cultivars. The similar response of these three structural genes supports the idea that a regulatory gene is the determinant of white-color fruit cultivars and this could be a WD40 gene, a component of the MYB-bHLH-WD40 (MBW) complex.

Other aspects of strawberry ripening

MADS box genes are related to many aspects of plant development as well as fruit ripening [39]. There are 255 MADS-annotated genes in the strawberry reference genome and 97 of them were expressed in samples of at least one stage from the six cultivars with FPKM value higher than 0.3. Among these, 54 genes were differentially expressed in at least one of the six cultivars and 34 genes were selected as meta-DEGs with 26 down-regulated and eight up-regulated at FR compared with BG stage. However, FxaC_13g22210, corresponding to FaMADS1 (GQ398009) involved in strawberry fruit ripening, was up-regulated in ‘Snow princess’ and ‘Toyonoka’ only [40]. FaMADS9 ortholog FxaC_22g08610 was also up-regulated in ‘Snow princess’ and ‘Toyonoka’ only [39]. These results suggest that MADS box genes are more likely involved in fruit development than ripening and support the idea that different cultivars exhibit inconsistency in their development and coloration.

Min et al. suggest that several transcription factors are involved in fruit ripening and possibly related to postharvest storability [30]. These candidates are NAC83, WRKY40, and WRKY48, encoded by genes corresponding to FxaC_13g22700, FxaC_24g33610, and FxaC_14g18300 in the new strawberry reference genome. We also determined these to be meta-DEGs, supporting the possibility that these genes are involved in ripening and postharvest storability in a wide range of cultivars.

A relationship between ripening and ubiquitin-mediated proteolysis was proposed and investigated [29]. A number of ubiquitination-related genes were also included among the meta-DEGs, such as ubiquitin-activating enzyme (E1), ubiquitin-conjugating enzyme (E2), and ubiquitin-protein ligase (E3). Among the 21, 203, and 359 genes annotated as E1, E2, and E3, respectively, we selected 0, 14, and 36 as meta-DEGs. These results support the relationship between ripening and ubiquitination as previously suggested [29].

Discussion

We performed a meta-analysis to obtain an in-depth view of strawberry ripening using publicly available strawberry RNA-Seq data. We mapped raw reads of previous data to the new strawberry reference genome [1] and elucidated DEGs in six cultivars as well as meta-DEGs.

Genetic relationships among the six cultivars and reference cultivar ‘Camarosa’ were investigated from SNP genotypes based on genic sequences. Due to the high level of polyploidy and heterozygotic nature of asexually propagated strawberry cultivars, a limited number of loci could be genotyped; however, to the best of our knowledge, this is the largest genome-wide genotype comparison among F. × ananassa cultivars. Since strawberry breeding history is relatively short and cultivar diversity is limited, this information will be valuable for planning breeding programs and designing future experiments with different strawberry cultivars [41].

Enriched GO terms associated with the DEGs showed consistent results with previous studies but also revealed the characteristics of the six cultivars. Notably, enriched GO terms associated with down-regulated DEGs were more consistent among meta-DEGs and the six cultivars than those of up-regulated DEGs. Moreover, the meta-DEGs allowed us an expanded view of changes at different ripening stages, which could not be discovered from a single study. We additionally identified 483 meta-DEGs not revealed in any of the single-cultivar studies along with 695 meta-DEGs detected in only one of the six cultivars. This was possible by increasing the statistical significance of the meta-analysis with an increased number of cultivars.

We selected fruit coloration as a comparable phenotype in the six cultivars and showed that meta-analysis could empower investigation of the underlying mechanism. From the meta-analysis we were able to find a candidate gene (FxaC_21g40710) for a WD40 protein, which possibly forms an MBW complex with FaMYB1 and FabHLH27 or FabHLH40. We also propose that WD40 genes could be color-determinant genes in the six strawberry cultivars. Further investigations of this WD40 candidate should be performed.

Color can be determined qualitatively and is subject to little environmental effect; thus, it is appropriate for the combined analysis of cultivars grown under different conditions. Phenotyping applied together or a standard phenotype index for the six cultivars would make the meta-DEGs more powerful for understanding the transcriptomic contribution to the phenotype. Furthermore, standard gene ID and accumulated expression data should be combined with a user-friendly interface for future strawberry research.

In summary, our meta-analysis of public transcriptome data provides an expanded view of strawberry ripening including common and cultivar-specific transcriptome changes. This work demonstrates that meta-analysis of existing transcriptome data can provide a deep understanding of specific processes not revealed by a single study. As the number of transcriptome studies increases and data accumulate, systematic analysis using meta-analysis will be necessary for maximizing data utility and addressing biological questions.

Supporting information

S1 Fig. Significantly enriched motifs in promoters of meta-DEGs.

Enriched motifs sequences were searched in all meta-DEGs (12,339 genes) (A) and newly identified meta-DEGs (483 genes) (B).

https://doi.org/10.1371/journal.pone.0252685.s002

(TIF)

S1 Table. RNA-Seq reads and mapping summary for studies used in this analysis.

https://doi.org/10.1371/journal.pone.0252685.s003

(DOCX)

S2 Table. Pearson’s correlation coefficient (R2) between samples based on their expression profiles.

https://doi.org/10.1371/journal.pone.0252685.s004

(DOCX)

S3 Table. Number of DEGs in each study and DEGs retained after meta-analysis.

https://doi.org/10.1371/journal.pone.0252685.s005

(DOCX)

S4 Table. The 483 meta-DEGs from meta-analysis of the six cultivars.

https://doi.org/10.1371/journal.pone.0252685.s006

(XLSX)

Acknowledgments

The authors thank Jong Seung Kim MD, PhD (Chonbuk National University Hospital) for his technical advice for meta-analysis.

References

  1. 1. Edger PP, Poorten TJ, VanBuren R, Hardigan MA, Colle M, McKain MR, et al. Origin and evolution of the octoploid strawberry genome. Nat Genet. 2019;51: 541–547. pmid:30804557
  2. 2. Giovannoni JJ. Genetic regulation of fruit development and ripening. Plant Cell. 2004;16: S170–S180. pmid:15010516
  3. 3. Aharoni A, O’Connell AP. Gene expression analysis of strawberry achene and receptacle maturation using DNA microarrays. J Exp Bot. 2002;53: 2073–2087. pmid:12324531
  4. 4. Lin Y, Jiang L, Chen Q, Li Y, Zhang Y, Luo Y, et al. Comparative transcriptome profiling analysis of red- and white-fleshed strawberry (Fragaria x ananassa) provides new insight into the regulation of anthocyanins pathway. Plant Cell Physiol. 2018;59: 1844–1859. pmid:29800352
  5. 5. Wang QH, Zhao C, Zhang M, Li YZ, Shen YY, Guo JX. Transcriptome analysis around the onset of strawberry fruit ripening uncovers an important role of oxidative phosphorylation in ripening. Sci Rep. 2017;7: 41477. pmid:28195221
  6. 6. Baldi P, Orsucci S, Moser M, Brilli M, Giongo L, Si-Ammour A. Gene expression and metabolite accumulation during strawberry (Fragaria x ananassa) fruit development and ripening. Planta. 2018;248: 1143–1157. pmid:30066220
  7. 7. Bianco L, Lopez L, Scalone AG, Di Carli M, Desiderio A, Benvenuto E, et al. Strawberry proteome characterization and its regulation during fruit ripening and in different genotypes. J Proteomics. 2009;72: 586–607. pmid:19135558
  8. 8. Zhang J, Wang X, Yu O, Tang J, Gu X, Wan X, et al. Metabolic profiling of strawberry (Fragaria x ananassa Duch.) during fruit development and maturation. J Exp Bot. 2011;62: 1103–1118. pmid:21041374
  9. 9. Moya-Leon MA, Mattus-Araya E, Herrera R. Molecular events occurring during softening of strawberry fruit. Front Plant Sci. 2019;10: 615. pmid:31156678
  10. 10. Hancock JF, Sjulin TM, Lobos GA. “Strawberries” in Temperate fruit crop breeding: germplasm to genomics ed J. F. Hancock (Springer: Heidelberg, 2008).
  11. 11. Rest JS, Wilkins O, Yuan W, Purugganan MD, Gurevitch J. Meta-analysis and meta-egression of transcriptomic responses to water stress in Arabidopsis. Plant J. 2016;85: 548–560. pmid:26756945
  12. 12. Zhao J, Sauvage C, Zhao J, Bitton F, Bauchet G, Liu D, et al. Meta-analysis of genome-wide association studies provides insights into genetic control of tomato flavor. Nat Commun. 2019;10: 1534. pmid:30948717
  13. 13. Shaar-Moshe L, Hubner S, Peleg Z. Identification of conserved drought-adaptive genes using a cross-species meta-analysis approach. BMC Plant Biol. 2015;15: 111. pmid:25935420
  14. 14. Sudmant PH, Alexis MS, Burge CB. Meta-analysis of RNA-seq expression data across species, tissues and studies. Genome Biol. 2015;16: 287. pmid:26694591
  15. 15. Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25: 1105–1111. pmid:19289445
  16. 16. Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31: 166–169. pmid:25260700
  17. 17. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26: 139–140. pmid:19910308
  18. 18. Cohen SP, Leach JE. Abiotic and biotic stresses induce a core transcriptome response in rice. Sci Rep. 2019;9: 6273. pmid:31000746
  19. 19. Dewey M. metap: Meta-analysis of significance values. R package version 1.1. (2019).
  20. 20. Le S, Josse J, Husson F. FactoMineR: An R package for multivariate analysis. J Stat Softw. 2008;25: 1–18.
  21. 21. Alexa A, Rahnenfuhrer J. topGO: enrichment analysis for gene ontology. R package version 2.34.0 (2018).
  22. 22. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28: 27–30. pmid:10592173
  23. 23. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25: 1754–1760. pmid:19451168
  24. 24. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009;25: 2078–2079. pmid:19505943
  25. 25. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol Biol Evol. 2018;35: 1547–1549. pmid:29722887
  26. 26. Bailey TL, Elkan C. (1994). Fitting a mixture model by expectation maximization to discover motifs in bipolymers.
  27. 27. Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS. Quantifying similarity between motifs. Genome Biol. 2007;8: 1–9. pmid:17324271
  28. 28. Khan A, Fornes O, Stigliani A, Gheorghe M, Castro-Mondragon JA, Van Der Lee R, et al. JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic Acids Res. 2018;46: D260–D266. pmid:29140473
  29. 29. Hu P, Li G, Zhao X, Zhao F, Li L, Zhou H. Transcriptome profiling by RNA-Seq reveals differentially expressed genes related to fruit development and ripening characteristics in strawberries (Fragaria x ananassa). PeerJ. 2018;6: e4976. pmid:29967718
  30. 30. Min K, Yi G, Kim HK, Hong Y, Choi JH, Lee EJ. Comparative transcriptome and metabolome analyses of two strawberry cultivars with different storability. PLOS ONE. 2020;15: e0242556. pmid:33264316
  31. 31. Hirakawa H, Shirasawa K, Kosugi S, Tashiro K, Nakayama S, Yamada M, et al. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species. DNA Res. 2014;21: 169–181. pmid:24282021
  32. 32. Almeida JRM, D’Amico E, Preuss A, Carbone F, de Vos CHR, Deiml B, et al. Characterization of major enzymes and genes involved in flavonoid and proanthocyanidin biosynthesis during fruit development in strawberry (Fragaria x ananassa). Arch Biochem. 2007;465: 61–71. pmid:17573033
  33. 33. Xu W, Dubos C, Lepiniec L. Transcriptional control of flavonoid biosynthesis by MYB-bHLH-WDR complexes. Trends Plant Sci. 2015;20: 176–185. pmid:25577424
  34. 34. Lin-Wang K, McGhie T, Wang M, Liu Y, Warren B, Storey R, et al. Engineering the anthocyanin regulatory complex of strawberry (Fragaria vesca). Front Plant Sci 2014;5: 651. pmid:25477896
  35. 35. Medina-Puche L, Cumplido-Laso G, Amil-Ruiz F, Hoffmann T, Ring L, Rodríguez-Franco A, et al. MYB10 plays a major role in the regulation of flavonoid/phenylpropanoid metabolism during ripening of Fragaria × ananassa fruits. J Exp Bot. 2014;65: 401–417. pmid:24277278
  36. 36. Castillejo C, Waurich V, Wagner H, Ramos R, Oiza N, Muñoz P, et al. Allelic variation of MYB10 is the major force controlling natural variation in skin and flesh color in strawberry (Fragaria spp.) fruit. Plant Cell. 2020;32: 3723–3749. pmid:33004617
  37. 37. Zhao F, Li G, Hu P, Zhao X, Li L, Wei W, et al. Identification of basic/helix-loop-helix transcription factors reveals candidate genes involved in anthocyanin biosynthesis from the strawberry white-flesh mutant. Sci Rep. 2018;8: 2721. pmid:29426907
  38. 38. Wang H, Zhang H, Yang Y, Li M, Zhang Y, Liu J, et al. The control of red colour by a family of MYB transcription factors in octoploid strawberry (Fragaria × ananassa) fruits. Plant Biotechnol J. 2020;18:1169–1184. pmid:31647169
  39. 39. Vallarino JG, Merchante C, Sánchez-Sevilla JF, Balaguer MA, Pott DM, Ariza MT, et al. Characterizing the involvement of FaMADS9 in the regulation of strawberry fruit receptacle development. Plant Biotechnol J. 2020;18: 929–943. pmid:31533196
  40. 40. Pimentel P, Salvatierra A, Moya-León MA, Herrera R. Isolation of genes differentially expressed during development and ripening of Fragaria chiloensis fruit by suppression subtractive hybridization. J Plant Physiol. 2010;167: 1179–1187. pmid:20413181
  41. 41. Gil-Ariza DJ, Amaya I, López-Aranda JM, Sánchez-Sevilla JF, Botella MÁ, Valpuesta V. Impact of plant breeding on the genetic diversity of cultivated strawberry as revealed by expressed sequence tag-derived simple sequence repeat markers. J Amer Soc Hort Sci. 2009;134: 337–347.