Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The R2R3-MYB gene family in banana (Musa acuminata): Genome-wide identification, classification and expression patterns

Abstract

The R2R3-MYB genes comprise one of the largest transcription factor gene families in plants, playing regulatory roles in plant-specific developmental processes, defense responses and metabolite accumulation. To date MYB family genes have not yet been comprehensively identified in the major staple fruit crop banana. In this study, we present a comprehensive, genome-wide analysis of the MYB genes from Musa acuminata DH-Pahang (A genome). A total of 285 R2R3-MYB genes as well as genes encoding three other classes of MYB proteins containing multiple MYB repeats were identified and characterised with respect to structure and chromosomal organisation. Organ- and development-specific expression patterns were determined from RNA-Seq data. For 280 M. acuminata MYB genes for which expression was found in at least one of the analysed samples, a variety of expression patterns were detected. The M. acuminata R2R3-MYB genes were functionally categorised, leading to the identification of seven clades containing only M. acuminata R2R3-MYBs. The encoded proteins may have specialised functions that were acquired or expanded in Musa during genome evolution. This functional classification and expression analysis of the MYB gene family in banana establishes a solid foundation for future comprehensive functional analysis of MaMYBs and can be utilized in banana improvement programmes.

Introduction

Banana (Musa spp.), including dessert and cooking types, is a staple fruit crop for a major world population, especially in developing countries. The crop is grown in more than 100 countries throughout the tropics and sub-tropics, mainly in the African, Asia-Pacific, and Latin American and Caribbean regions [1]. Bananas provide an excellent source of energy and are rich in certain minerals and in vitamins A, C and B6. Furthermore, this perennial, monocotyledonous plant provides an important source of fibre, sugar, starch and cellulose (used for paper, textiles). Bananas have also been considered as a useful tool to deliver edible vaccines [2]. Certain agronomic traits, such as stress and pest resistance as well as fruit quality, are thus of considerable interest. Banana improvement through breeding exercises has been challenging for various reasons. Therefore, genetic engineering-based optimisations hold great promise for crop improvement. For this purpose, candidate gene targets need to be identified. The release of a high quality banana genome sequence [3] provides an useful resource to understand functional genomics of important agronomic traits and to identify candidate genes to be utilized in banana improvement programmes.

Almost all biological processes in eukaryotic cells or organisms are influenced by transcriptional control of gene expression. Thus, the regulatory level is a good starting point for genetic engineering. Regulatory proteins are involved in transcriptional control, alone or complexed with other proteins, by activating or repressing (or both) the recruitment of RNA polymerase to promoters of specific genes [4]. These proteins are called transcription factors. As expected from their substantial regulatory complexity, transcription factors are numerous and diverse [5]. By binding to specific DNA sequence motifs and regulating gene expression, transcription factors control various regulatory and signaling networks involved in the development, growth and stress response in an organism.

One of the widest distributed transcription factor families in all eukaryotes is the MYB (myeloblastosis) protein family. In the plant kingdom, MYB proteins constitute one of the largest transcription factor families. MYB proteins are defined by a highly conserved MYB DNA-binding domain, mostly located at the N-terminus of the protein. The MYB domain generally consists of up to four imperfect amino acid sequence repeats (R) of about 50–53 amino acids, each forming three alpha–helices [summarised in 6]. The second and third helices of each repeat build a helix–turn–helix (HTH) structure with three regularly spaced tryptophan (or hydrophobic) residues, forming a hydrophobic core [7]. The third helix of each repeat was identified as the DNA recognition helix that makes direct contact with DNA [8]. During DNA contact, two MYB repeats are closely packed in the major groove, so that the two recognition helices bind cooperatively to the specific DNA recognition sequence motif. In contrast to vertebrates genomes, which only encode MYB transcription factors with three repeats, plants have different MYB domain organisations, comprising one to four repeats [6, 9]. R2R3-MYBs, which are MYB proteins with two repeats (named according to repeat numbering in vertebrate MYBs), are particularly expanded in plant genomes. Copy numbers range from 45 unique R2R3-MYBs in Ginkgo biloba [10] to 360 in Mexican cotton (Gossypium hirsutum) [11]. The expansion of the R2R3-MYB family was coupled with widening in the functional diversity of R2R3-MYBs, considered to regulate mainly plant-specific processes including secondary metabolism, stress responses and development [6]. As expected, R2R3-MYBs are involved in regulating several biological traits, for example wine quality, fruit color, cotton fibre length, pollinator preferences and nodulation in legumes.

In this study, we have used genomic resources to systematically identify members of the M. acuminata (A genome) R2R3-MYB gene family. We used knowledge from other plant species, including the model plant A. thaliana, leading to a functional classification of the banana R2R3-MYB genes based on the MYB phylogeny. Furthermore, RNA-Seq data was used to analyse expression in different M. acuminata organs and developmental stages and to compare expression patterns of closely grouped co-orthologs. The identification and functional characterization of the R2R3-MYB gene family from banana will provide an insight into the regulatory aspects of different biochemical and physiological processes, as those operating during fruit ripening as well as response to various environmental stresses. Our findings offer the first step towards further investigations on the biological and molecular functions of MYB transcription factors with the selection of genes responsible for economically important traits in Musa, which can be utilized in banana improvement programmes.

Material and methods

Search for MYB protein coding genes in the M. acuminata genome

A consensus R2R3-MYB DNA binding domain sequence [12] (S1 Table) was used as protein query in tBLASTn [13] searches on the M. acuminata DH-Pahang genome sequence (version 2) (https://banana-genome-hub.southgreen.fr/sites/banana-genome-hub.southgreen.fr/files/data/fasta/version2/musa_acuminata_v2_pseudochromosome.fna) in an initial search for MYB protein coding genes. To confirm the obtained amino acid sequences, the putative MYB sequences were manually analysed for the presence of an intact MYB domain. All M. acuminata MYB candidates from the initial BLAST were inspected to ensure that the putative gene models encode two or more (multiple) MYB repeats. The identified gene models were analysed to map them individually to unique loci in the genome and redundant sequences were discarded from the data set to obtain unique MaMYB genes. The identified MaMYB genes were matched to the automatically annotated genes from the Banana Genome Hub database [14]. The open reading frames of the identified MaMYB were manually inspected and, if available, verified by mapping of RNA-Seq data to the genomic sequence. Resulting MaMYB annotation is provided in the supporting information: Multi-FASTA files with MaMYB CDS sequences (S1 File) and MaMYB peptide sequences (S2 File), general feature format (GFF) file (S3 File) describing the MaMYB genes for use in genome viewers/browers.

Genomic distribution of MaMYB genes

The MaMYB genes were located on the corresponding chromosomes by the MapGene2chromosome web v2 (MG2C) software tool (http://mg2c.iask.in/mg2c_v2.0/) according to their position information of a physical map, available from the DH-Pahang genome annotation (V2).

Phylogenetic analyses

Protein sequences of 133 A. thaliana MYBs were obtained from TAIR (http://www.arabidopsis.org/). We also considered other multiple MYB-repeat proteins from A. thaliana in the phylogenetic analysis to determine orthologs in the M. acuminata genome: five AtMYB3R, AtMYB4R1 and AtCDC5. Additionally, 43 well-known, functionally characterised landmark plant R2R3-MYB protein sequences were collected from GenBank at the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/).

Phylogenetic trees were constructed from ClustalΩ [15] aligned MYB domain sequences (293 MaMYBs, 132 AtMYBs and 43 plant landmark MYBs) using MEGA7 [16] with default settings. A mojority rule Maximum Likelihood (ML) consensus tree inferred from 1000 bootstrap replications was calculated. M. acuminata MYB proteins were classified according to their relationships with corresponding A. thaliana and landmark MYB proteins.

Expression analysis from RNA-Seq data

RNA-Seq data sets were retrieved from the Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra) via fastq-dump v2.9.6 (https://github.com/ncbi/sra-tools) (S2 Table). STAR v2.5.1b [17] was applied for the mapping of reads to the Pahang v2 reference genome sequence [18] using previously described parameters [19]. featureCounts [20] was applied for quantification of the mapped reads per gene based on the Pahang v2 annotation. Previously developed Python scripts were deployed for the calculation of normalized expression values (https://github.com/bpucker/bananaMYB) [19].

Results and discussion

The first annotated reference genome sequence of M. acuminata (A genome) became available in 2012 [3]. It was obtained from a double haploid (DH) plant of the Pahang cultivar, derived through haploid pollen and spontaneous chromosome doubling from the wild subspecies Musa acuminata ssp. malaccensis [3]. This wild subspecies was involved in the domestication of the vast majority of cultivated bananas and its genetic signature is commonly found in dessert and cooking bananas (ProMusa, http://www.promusa.org). An improved version (DH-Pahang version 2) of the genome assembly and annotation was presented in 2016 comprising 450.7 Mb (86% of the estimated size) from which 89.5% are assigned to one of the 11 chromosomes, predicted to contain 35,276 protein encoding genes [18]. This M. acuminata DH-Pahang version 2 genome sequence provides the platform of this study.

Identification and genomic distribution of M. acuminata MYB genes

A consensus R2R3-MYB DNA binding domain sequence (deduced from Arabidopsis, grape and sugarbeet R2R3-MYBs, S1 Table) was used as protein query in tBLASTn searches on the DH-Pahang version 2 genome sequence to comprehensively identify MYB protein coding genes in M. acuminata. The resulting putative MYB sequences were proven to map to unique loci in the genome and were confirmed to contain an intact MYB domain. This ensured that the gene models contained two or more (multiple) MYB repeats. We identified a set of 285 R2R3-MYB proteins and nine multiple repeat MYB proteins distantly related to the typical R2R3-MYB proteins: six R1R2R3-MYB (MYB3R) proteins, two MYB4R proteins and one CDC5-like protein from the M. acuminata genome sequence (Table 1).

thumbnail
Table 1. List of annotated MYB genes in the Musa acuminata (DH Pahang) genome sequence.

https://doi.org/10.1371/journal.pone.0239275.t001

The number of R2R3-MYB genes is one of the highest among the species that have been studied to date, ranging from 45 in Ginkgo biloba [10] over 157 in Zea mays [25] and 249 in Brassica napus [26] to 360 in Gossypium hirsutum [11]. This is probably due to three whole-genome duplications (γ 100 Myr ago and α, β 65 Myr ago) that occurred during Musa genome evolution [3, 27]. The number of atypical multiple repeat MYB genes identified in M. acuminata is in the same range as those reported for most other plant species, up to six MYB3R and up to two MYB4R and CDC5-like genes.

The 285 MaR2R3-MYB genes identified constitute approximately 0.81% of the 35,276 predicted protein-coding M. acuminata genes and 9.0% of the 3,155 putative M. acuminata transcription factor genes [18]. These were subjected to further analyses. The identified MaR2R3-MYB genes were named following the nomenclature of the locus tags provided in the DH-Pahang version 2 genome annotation (Table 1). A keyword search in the NCBI database (http://www.ncbi.nlm.nih.gov/) revealed no evidence for M. acuminata MYB genes not present in Table 1.

Six publications dealing with M. acuminata MYB genes were identified: one study describes the elevated expression of nine MaMYB genes in transgenic banana plants overexpressing the NAC domain transcription factor MusaVND1 (vascular related NAC domain) indicating a role of these MaMYBs in the regulation of secondary wall deposition [22]. MYBS3 (Ma08_25960) was found to be differentially expressed between cold-sensitive and cold-tolerant bananas [24]. Another publication described the M. acuminata R2R3-MYB gene Ma05_03690 being upregulated in the early response to the endoparasitic root-knot nematode Meloidogyne incognita in roots [28]. MusaMYB31 (Ma01_02850) was identified as a negative regulator of lignin biosynthesis and the general phenylpropanoid biosynthesis pathway [21]. MaMYB3 (Ma06_11140) was found to repress starch degradation in fruit ripening [23] and MaMYB4 (Ma01_19610) was recently described to control fatty acid desaturation [29].

On the basis of the DH-Pahang version 2 annotation, 291 of the 294 MaMYB genes could be assigned to the eleven chromosomes. The chromosomal distribution of MaMYB genes on the pseudochromosomes is shown in Fig 1 and revealed that M. acuminata MYB genes are distributed across all chromosomes.

thumbnail
Fig 1. Distribution of MaMYB genes on the eleven M. acuminata chromosomes.

https://doi.org/10.1371/journal.pone.0239275.g001

Gene structure analysis revealed that most MaR2R3-MYB genes (155 of 285; 54.4%) follow the previously reported rule of having two introns and three exons, and display the highly conserved splicing arrangement that has also been reported for other plant species [25, 30, 31]. A total of 23 (8.1%) MaR2R3-MYB genes have two exons and seven (2.5%) were single exon genes. 67 (23.5%) MaR2R3-MYB genes have four exons, 21 (7.4%) five exons, eight (2.8%) six exons and two of each with seven and twelve exons, respectively. The complex exon-intron structure of Ma06_12110 and Ma10_18840 is conserved in their A. thaliana orthologs AtMYB88 and AtMYB124/FOUR LIPS (FLP) containing ten and eleven exons, respectively. This supports their close evolutionary relationship, but also suggests the conservation of this intron pattern in evolution since the monocot-dicot split 140–150 Myr ago [32].

Chromosomes are drawn to scale. The positions of centromeres (grey ovals) are roughly estimated from repeat distribution data. The chromosomal positions of the MaMYB genes (given in DH Pahang version 2 annotation ID) are indicated. R2R3-MYB genes are given in black letters, MYB genes with more than two MYB repeats are given in blue letters. The number of MYB genes on each chromosome is given below the respective chromosome.

Phylogenetic analysis of the M. acuminata MYB family

With the aim to explore the putative function of the predicted M. acuminata MYBs, we assigned them to plant MYB proteins with known function. For this, we chose primarily data from A. thaliana, which is the source of most functional MYB characterisations. From comparable studies, MYB function appears conserved across MYB clades, suggesting that closely related MYBs recognise similar/same target genes and possess cooperative, overlapping or redundant functions.

To unravel the relationships, we constructed a phylogenetic tree with 468 MYB domain amino acid sequences of MYB proteins. We used 293 MaMYBs (omitting the CDC5-like MaMYB), the complete A. thaliana MYB family (132 members, including 126 R2R3-MYB, five MYB3R and one MYB4R) and 43 functionally well characterised landmark R2R3-MYBs from other plant species. The phylogenetic tree topology allowed us to classify the analysed MYBs into one MYB3R clade, one MYB4R clade and 42 R2R3-MYB protein clades (Fig 2).

Most R2R3-MYB clades (31 of 42) include variable numbers of MYB proteins from Arabidopsis and banana, indicating that the appearance of most MYB genes in these two species predates the monocot-dicot split as observed in other studies [33]. Several of these clades also contain landmark MYBs from other plant species. The two clades 4 and 28 only contain Arabidopsis R2R3-MYB members, while seven clades (9, 13, 19, 21, 26, 29 and 32, displayed with white background in Fig 2) only contain banana R2R3-MYB members. Additionally, two clades, 24 with monocot anthocyanin biosynthesis regulators and 27 with the strawberry landmark flavonoid biosynthesis repressor FaMYB1 [34], do not contain any A. thaliana MYB (Fig 2).

ML consensus tree inferred from 1000 bootstraps with 468 MYB domain amino acid sequences of MYB proteins from Musa accuminata (Ma), Arabidopsis thaliana (At) and landmark MYBs from other plant species built with MEGA7. Clades are labeled with different colors and functional annotations are given. The numbers at the branches give bootstrap support values from 1000 replications. SCW, secondary cell wall.

The lineage specificity of some MYB clades could indicate that these clades may have been lost or gained in a single order or species during plant evolution, as indicated by other studies [3537]. For example, clade 28 lacked M. acuminata orthologs, but includes the A. thaliana R2R3-MYBs AtMYB0/GLABRA1 and AtMYB66/WEREWOLF, which have been identified as being involved in the formation of trichomes and root hairs from epidermal cells [38, 39]. Similar observations have been made in maize (monocot) and sugarbeet (eudicot, caryophyllales) [30, 31], both not containing clade 28 orthologs, while grape (eudicot, rosid) and poplar (eudicot, rosid) do [12, 40]. It has been hypothesized that GLABRA1-like MYB genes have been acquired in rosids after the rosid-asterids split [41]. The absence of MaMYBs in clade 28 is consistent with this hypothesis, since monocots branched off before the separation of asterids and rosids in eudicots. Clade 4 also lacks banana R2R3-MYBs. This clade contains the glucosinolate biosynthesis regulators AtMYB28, AtMYB34 and AtMYB51 [42, 43]. The absence of MaMYBs in this clade is concordant with the fact that glucosinolates are only present in the Brassicaceae family. This clade is thought to have originated from a duplication event before the divergence of the genus Arabidopsis from Brassica [44].

The seven clades containing only MaMYBs were manually inspected by applying BLAST searches at the NCBI protein database in order to identify high similarity to functionally characterized landmark plant R2R3-MYBs. In no case could a landmark MYB be identified. Consequently, these clades could be described as a lineage-specific expansion in M. acuminata, reflecting a species-, genus- or order-specific evolutionary change. These MaMYB proteins may have specialised functions that were acquired or expanded in M. acuminata during genome evolution. Further research will be needed to decipher the biological roles of these MaMYB genes.

R2R3-MYBs may interact with basic helix-turn-helix (bHLH)-type transcription factors, together with WD-repeat (WDR) proteins, forming a trimeric MBW complex. These R2R3-MYBs are defined by a bHLH-binding consensus motif [D/E]Lx2[R/K]x3Lx6Lx3R [45] found in all bHLH-interacting R2R3-MYBs. A search in the MaMYB proteins for the mentioned bHLH-interaction motif identified 30 MaMYBs containing this motif (Table 1) and thus putatively interacting with bHLH proteins. 26 of these 30 MaMYBs were all functionally assigned to clades containing (potentially) known bHLH-interacting R2R3-MYBs: nine in clade 22 (repressors phenylpropanoid, sinapate-, lignin regulators), six in clade 25 (proanthocyanidin regulators), four in clade 23 (general flavonoid, trichome regulators), four in clade 27 (flavonoid repressors) and three in clade 24 (monocot anthocyanin regulators). Four MaMYBs, Ma02_19650, Ma04_28300, Ma08_15820 and Ma11_21820 were all found in the M. acuminata-specific clade 26. These MaMYBs may interact with bHLH-type transcription factors. The close relation to flavonoid biosynthesis regulators suggests that they may act similarly to R2R3-MYB proanthocyanidin regulators. Further research is needed to determine if they regulate a Musa-specific biosynthesis pathway.

Expression profiles for M. acuminata MYB genes in different organs and developmental stages

Since it is not unusual for large transcription factor families in higher organisms to have redundant functions, a particular transcription factor needs to be studied and characterised in the context of the whole family. In this regard, the gene expression pattern can provide a hint to gene function. In this study we used publicly-available RNA-Seq reads from the Sequence Read Archive (S2 Table) to analyse the expression of the 293 MaMYB genes in different organs and developmental stages: embryonic cell suspension, seedling, root and young, adult and old leaf, pulp stages S1-S4 and peel stages S1-S4. Filtered RNA-Seq reads were aligned to the reference genome sequence and the number of mapped reads per annotated transcript were quantified and compared across the analysed samples to calculate normalised RNA-Seq read values which are given in Table 2.

thumbnail
Table 2. The expression profiles of MaR2R3-MYB genes in different organs and developmental stages based on RNA-Seq data.

https://doi.org/10.1371/journal.pone.0239275.t002

Our expression analyses revealed that M. acuminata MYBs have diverse expression patterns in different organs. Many of the MaMYBs exhibited low transcript abundance levels with expression in only one or a few organs. This is consistent with other transcription factor genes, typically found to be expressed in this manner due to functional specificity and diversity. The highest number of expressed MaMYB genes (221; 75.4%) is observed in roots, followed by pulp (209; 71.3%), leaf (203; 69.3%), peel (196; 66.9%) and embryonic cell suspension (129; 44%). The fewest MaMYB genes are expressed in seedlings (98; 33.4%) in the considered dataset. 40 MaMYB genes (13.7%) were expressed in all samples analysed (albeit with varying expression levels), which suggested that these MaMYBs play regulatory roles at multiple developmental stages in multiple tissues. 13 MaMYB genes (4.4%) lacked expression information in any of the analysed samples, possibly indicating that these genes are expressed in other organs (e.g. pseudostem, flower, bract), specific cells, at specific developmental stages, under special conditions or are pseudogenes. 280 MaMYBs (95.6%) are expressed in at least one analysed sample, although the transcript abundance of some genes was very low. Some MaMYB genes were expressed in all analysed RNA-Seq samples at similar levels (e.g. R2R3-MYBs Ma06_g33440 and Ma11_g06880) while others show variance in transcript abundance with low (no) levels in one or several organs and high levels in others (or vice versa). For example, Ma02_g16570, Ma03_g09310, Ma07_g11110, Ma10_g01730, Ma10_g05260 and Ma10_g09100 show organ-specific expression, as their transcripts were exclusively detected in leaves, which hints to leaf-specific functionality. Ma03_g07840 and Ma07_g19470 were found to be predominantly expressed in pulp, showing expression also in other analysed organs, but not in the seedling. Overall, these results suggests that the corresponding MaMYB regulators are limited to distinct organs, tissues, cells or conditions.

Some paralogous MaMYB genes clustered in the genome (Tables 1 and 2) showed different expression profiles, while other clustered paralogous MaMYB genes did not. For example, the clustered R2R3-MYB genes Ma07_g19880 and Ma07_g19890 (both in the proanthocyanidin-related cluster 25): while Ma07_g19880 is expressed in pulps, peels, roots, and leaves, Ma07_g19890 is nearly not expressed in the analysed organs. The expression pattern of Ma03_g07840 and Ma03_g07850 (also both in the proanthocyanidin-related cluster 25) is, in contrast, very similar, with low expression in root, leaf, pulp and peel, but no expression in embyonic cells and roots. These results could point to functional redundancy of the genes Ma03_g07840 and Ma03_g07850, while Ma07_g19880 and Ma07_g19890 could be (partly) involved in distinct, tissue-specific aspects of specialised mebabolite biosynthesis. The functional categorisation and its expression domains make Ma07_g19880 a good candidate to encode a proanthocyanidin biosynthesis regulator in developing banana fruits (pulp and peel), and thus an excellent target for genetic manipulation of proanthocyanidin content, known to influence senescence of banana fruit [46], to maintain the freshness of harvested banana fruit.

Conclusions

The present genome-wide identification, chromosomal organisation, functional classification and expression analyses of M. acuminata MYB genes provide a first step towards cloning and functional dissection to decode the role of MYB genes in this economically interesting species. Further, knowledge of selected MYB genes can be biotechnologically utilized for improvement of fruit quality, yield, disease resistance, tolerance to biotic and abiotic stresses, and the biosynthesis of pharmaceutical compounds.

Supporting information

S1 Table. MYB domain consensus sequence of R2R3-MYBs, used for tBLASTn searches.

https://doi.org/10.1371/journal.pone.0239275.s001

(DOCX)

S2 Table. RNA-Seq raw data used for expression analysis.

https://doi.org/10.1371/journal.pone.0239275.s002

(DOCX)

S1 File. Multi-FASTA file with MaMYB CDS sequences.

https://doi.org/10.1371/journal.pone.0239275.s003

(FASTA)

S2 File. Multi-FASTA file with MaMYB peptide sequences.

https://doi.org/10.1371/journal.pone.0239275.s004

(FASTA)

S3 File. MaMYB genes General Feature Format (GFF) file.

For use in genome viewers/browsers on the M. accuminata pseudochromosomes.

https://doi.org/10.1371/journal.pone.0239275.s005

(GFF)

Acknowledgments

We are grateful to Melanie Kuhlmann for excellent technical assistance. We thank Nathanael Walker-Hale for language editing.

References

  1. 1. Frison EA, Sharrock SL. The economic, nutritional and social importance of bananas in the world. In: Cea Picq, editor. Bananas and Food Security. Montpellier, France: INIBAP; 1999. p. 21–35.
  2. 2. Langridge WHR. Edible Vaccines. Scientific American Sp. 2006;16:46–53.
  3. 3. DHont A, Denoeud F, Aury JM, Baurens FC, Carreel F, Garsmeur O, et al. The banana (Musa acuminata) genome and the evolution of monocotyledonous plants. Nature. 2012;488(7410):213–7. pmid:22801500
  4. 4. Fuda NJ, Ardehali MB, Lis JT. Defining mechanisms that regulate RNA polymerase II transcription in vivo. Nature. 2009;461(7261):186–92. pmid:19741698
  5. 5. Riechmann JL, Heard J, Martin G, Reuber L, Jiang CZ, Keddie J, et al. Arabidopsis transcription factors: Genome-wide comparative analysis among eukaryotes. Science. 2000;290(5499):2105–10. pmid:11118137
  6. 6. Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends in Plant Science. 2010;15(10):573–81. pmid:20674465
  7. 7. Ogata K, Kanei-Ishii C, Sasaki M, Hatanaka H, Nagadoi A, Enari M, et al. The cavity in the hydrophobic core of Myb DNA-binding domain is reserved for DNA recognition and trans-activation. Nature Structural Biology. 1996;3(2):178–87. pmid:8564545
  8. 8. Jia L, Clegg MT, Jiang T. Evolutionary dynamics of the DNA-binding domains in putative R2R3-MYB genes identified from rice subspecies indica and japonica genomes. Plant Physiology. 2004;134(2):575–85. pmid:14966247
  9. 9. Stracke R, Werber M, Weisshaar B. The R2R3-MYB gene family in Arabidopsis thaliana. Current Opinion in Plant Biology. 2001;4:447–56. pmid:11597504
  10. 10. Liu X, Yu W, Zhang X, Wang G, Cao F, Cheng H. Identification and expression analysis under abiotic stress of the R2R3-MYB genes in Ginkgo biloba L. Physical and Molecular Biology of Plants. 2017;23(3):503–16.
  11. 11. Salih H, Gong W, He S, Sun G, Sun J, Du X. Genome-wide characterization and expression analysis of MYB transcription factors in Gossypium hirsutum. BMC Genetics. 2016;17(1):129. pmid:27613381
  12. 12. Matus JT, Aquea F, Arce-Johnson P. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes. BMC Plant Biology. 2008;8:83. pmid:18647406
  13. 13. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of Molecular Biology. 1990;215(3):403–10. pmid:2231712
  14. 14. Droc G, Larivière D, Guignon V, Yahiaoui N, This D, Garsmeur O, et al. The banana genome hub. Database (Oxford). 2013;2013(bat035).
  15. 15. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology. 2011;7:539. pmid:21988835
  16. 16. Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Molecular Biology and Evolution. 2016;33(7):1870–4. pmid:27004904
  17. 17. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. pmid:23104886
  18. 18. Martin G, Baurens FC, Droc G, Rouard M, Cenci A, Kilian A, et al. Improvement of the banana "Musa acuminata" reference sequence using NGS data and semi-automated bioinformatics methods. BMC Genetics. 2016;17(243).
  19. 19. Haak M, Vinke S, Keller W, Droste J, Rückert C, Kalinowski J, et al. High Quality de Novo Transcriptome Assembly of Croton tiglium. Frontiers in Molecular Biosciences. 2018;5:62. pmid:30027092
  20. 20. Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30(7):923–30. pmid:24227677
  21. 21. Tak H, Negi S, Ganapathi TR. Overexpression of MusaMYB31, a R2R3 type MYB transcription factor gene indicate its role as a negative regulator of lignin biosynthesis in banana. PLoS ONE. 2017;12(2):e0172695. pmid:28234982
  22. 22. Negi S, Tak H, Ganapathi TR. Cloning and functional characterization of MusaVND1 using transgenic banana plants. Transgenic Research. 2015;24(3):571–85. pmid:25523085
  23. 23. Fan ZQ, Ba LJ, Shan W, Yun-Yi X, Wang-Jin L, Jian-Fei K, et al. A banana R2R3-MYB transcription factor MaMYB3 is involved in fruit ripening through modulation of starch degradation by repressing starch degradation-related genes and MabHLH6. The Plant Journal. 2018;96(6):1191–1205. pmid:30242914
  24. 24. Yang QS, Gao J, He WD, Dou TX, Ding LJ, Wu JH, et al. Comparative transcriptomics analysis reveals difference of key gene expression between banana and plantain in response to cold stress. BMC Genomics. 2015;16:446. pmid:26059100
  25. 25. Du H, Feng BR, Yang SS, Huang YB, Tang YX. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7(6):e37463. pmid:22719841
  26. 26. Hajiebrahimi A, Owji H, Hemmati S. Genome-wide identification, functional prediction, and evolutionary analysis of the R2R3-MYB superfamily in Brassica napus. Genome. 2017;60(10):797–814. pmid:28732175
  27. 27. Wu W, Yang YL, He WM, Rouard M, Li WM, Xu M, et al. Whole genome sequencing of a banana wild relative Musa itinerans provides insights into lineage-specific diversification of the Musa genus. Scientific Reports. 2016;6(31586):srep31586.
  28. 28. Castañeda NEN, Alves GSC, Almeida RM, Amorim EP, Fortes Ferreira C, Togawa RC, et al. Gene expression analysis in Musa acuminata during compatible interactions with Meloidogyne incognita. Annals of Botany. 2017;119(5):915–30. pmid:28130221
  29. 29. Song C, Yang Y, Yang T, Ba L, Zhang H, Han Y, et al. MaMYB4 recruits histone deacetylase MaHDA2 and modulates the expression of omega-3 fatty acid desaturase genes during cold stress response in banana fruit. Plant & cell physiology. 2019; 60(11):2410–2422. pmid:31340013
  30. 30. Du H, Yang SS, Liang Z, Feng BR, Liu L, Huang YB, et al. Genome-wide analysis of the MYB transcription factor superfamily in soybean. BMC Plant Biology. 2012;12(106):106.
  31. 31. Stracke R, Holtgräwe D, Schneider J, Pucker B, Rosleff Sörensen T, Weisshaar B. Genome-wide identification and characterisation of R2R3-MYB genes in sugar beet (Beta vulgaris). BMC Plant Biology. 2014;14(249).
  32. 32. Chaw SM, Chang CC, Chen HL, Li WH. Dating the monocot-dicot divergence and the origin of core eudicots using whole chloroplast genomes. Journal of Molecular Evolution. 2004;58(4):424–41. pmid:15114421
  33. 33. Du H, Liang Z, Zhao S, Nan MG, Phan Tran LS, Lu K, et al. The Evolutionary History of R2R3-MYB Proteins Across 50 Eukaryotes: New Insights Into Subfamily Classification and Expansion. Scientific Reports. 2015;5(11037).
  34. 34. Aharoni A, De Vos CHR, Wein M, Sun Z, Greco R, Kroon A, et al. The strawberry FaMYB1 transcription factor suppresses anthocyanin and flavonol accumulation in transgenic tobacco. The Plant Journal. 2001;28(3):319–32. pmid:11722774
  35. 35. Baum D. Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups. Nature Education. 2008;1(1):190.
  36. 36. Zhong Y, Yin H, Sargent DJ, Malnoy M, Cheng ZM. Species-specific duplications driving the recent expansion of NBS-LRR genes in five Rosaceae species. BMC Genomics. 2015;16(77):s12864-015-1291-0.
  37. 37. Mehta RS, Bryant D, Rosenberg NA. The probability of monophyly of a sample of gene lineages on a species tree. Proceedings of the National Academy of Sciences of the United States of America. 2016;113(29):8002–9. pmid:27432988
  38. 38. Oppenheimer DG, Herman PL, Sivakumaran S, Esch J, Marks MD. A myb gene required for leaf trichome differentiation in Arabidopsis is expressed in stipules. Cell. 1991;67:483–93. pmid:1934056
  39. 39. Lee MM, Schiefelbein J. WEREWOLF, a MYB-related protein in Arabidopsis, is a position-dependent regulator of epidermal cell patterning. Cell. 1999;24(5):473–83.
  40. 40. Wilkins O, Nahal H, Foong J, Provart NJ, Campbell MM. Expansion and diversification of the Populus R2R3-MYB family of transcription factors. Plant Physiology. 2009;149(2):981–93. pmid:19091872
  41. 41. Brockington SF, Alvarez-Fernandez R, Landis JB, Alcorn K, Walker RH, Thomas MM, et al. Evolutionary analysis of the MIXTA gene family highlights potential targets for the study of cellular differentiation. Molecular Biology and Evolution. 2013;30(3):526–40. pmid:23188591
  42. 42. Sonderby IE, Hansen BG, Bjarnholt N, Ticconi C, Halkier BA, Kliebenstein DJ. A systems biology approach identifies a R2R3 MYB gene subfamily with distinct and overlapping functions in regulation of aliphatic glucosinolates. PLoS One. 2007;2(12):e1322. pmid:18094747
  43. 43. Gigolashvili T, Berger B, Mock HP, Muller C, Weisshaar B, Flugge UI. The transcription factor HIG1/MYB51 regulates indolic glucosinolate biosynthesis in Arabidopsis thaliana. The Plant Journal. 2007;50(5):886–901. pmid:17461791
  44. 44. Yanhui C, Xiaoyuan Y, Kun H, Meihua L, Jigang L, Zhaofeng G, et al. The MYB transcription factor superfamily of Arabidopsis: expression analysis and phylogenetic comparison with the rice MYB family. Plant Molecular Biology. 2006;60(1):107–24. pmid:16463103
  45. 45. Zimmermann IM, Heim MA, Weisshaar B, Uhrig JF. Comprehensive identification of Arabidopsis thaliana MYB transcription factors interacting with R/B-like BHLH proteins. The Plant Journal. 2004;40:22–34. pmid:15361138
  46. 46. Chen J, Li F, Li Y, Wang Y, Wang C, Yuan D, et al. Exogenous procyanidin treatment delays senescence of harvested banana fruit by enhancing antioxidant responses and in vivo procyanidin content. Postharvest Biology and Technology. 2019;158.