Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

GmFT2a Polymorphism and Maturity Diversity in Soybeans

  • Bingjun Jiang ,

    Contributed equally to this work with: Bingjun Jiang, Yanlei Yue

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Yanlei Yue ,

    Contributed equally to this work with: Bingjun Jiang, Yanlei Yue

    Affiliations MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China, Mudanjiang Branch of Heilongjiang Academy of Agricultural Sciences, Mudanjiang, Heilongjiang, China

  • Youfei Gao,

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Liming Ma,

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Shi Sun,

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Cunxiang Wu,

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Wensheng Hou,

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

  • Hon-Ming Lam,

    Affiliation Center for Soybean Research, State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong

  • Tianfu Han

    tianfuhan@hotmail.com

    Affiliation MOA Key Laboratory of Soybean Biology (Beijing), Institute of Crop Sciences, The Chinese Academy of Agricultural Sciences, Beijing, China

Abstract

Background

Soybean is a short-day crop of agricultural, ecological, and economic importance. The sensitive photoperiod responses significantly limit its breeding and adaptation. GmFT2a, a putative florigen gene with different transcription profiles in two cultivars (late-maturing Zigongdongdou and early-maturing Heihe 27) with different maturity profiles, is key to flowering and maturation. However, up to now, its role in the diverse patterns of maturation in soybeans has been poorly understood.

Methods

Eighty varieties, including 19 wild accessions, covering 11 of all 13 maturity groups, were collected. They were planted in pots and maintained under different photoperiodicity conditions (SD, short day; LD, long day; and ND, natural day). The day to first flowering was recorded and the sensitivity to photoperiod was investigated. Polymorphisms in the GmFT2a coding sequence were explored by searching the known SNP database (NCBI dbSNP). The GmFT2a promoter regions were then cloned from these varieties and sequenced. Further polymorphism and association analyses were conducted.

Results

These varieties varied greatly in time to first flowering under ND and exhibited a consecutive distribution of photoperiod sensitivity, which suggested that there is rich diversity in flowering time. Furthermore, although GmFT2a had only one known synonymous SNP in the coding sequence, there were 17 haplotypes of the GmFT2a promoter region, HT06 of which was extremely abundant. Further association analysis found some SNPs that might be associated with day to first flowering and photoperiod sensitivity.

Conclusion

Although GmFT2a is a key flowering gene, GmFT2a polymorphism does not appear to be responsible for maturity diversity in soybean.

Introduction

Different soybean cultivars exhibit different maturity pattern and sensitivity toward photoperiod, which are related to their adaptation to different ecological environments. For practical reasons, soybean breeders categorized soybean cultivars into different "maturity groups". For instance, soybeans in North America were classified into 13 maturity groups (MG): MG000 to MGX [1,2]. On the other hand, Chinese soybean researchers have divided cultivars into 12 MGs based on the environments and planting patterns in China [3,4]. Commercial cultivars of desirable traits but belonging to a particular MG are often limited by the geographical range of cultivation. It is therefore important to gain a better understand on the genetic control of photoperiodism and maturity in soybean.

Photoperiod responses and maturity patterns in soybean are quantitative traits controlled by multiple genes or loci. Up till now, nine maturity loci have been reported, including, E1-E8, and J [5-13]. These loci have been comprehensively reviewed by Xia et al. [14]. They play different roles under different photoperiods with stronger effects under long-day and weaker effects under short-day conditions [15]. Four of these loci were characterized at the molecular level, using map-based or candidate-based cloning. E1 encodes a soybean-specific potential transcription factor Glyma06g23040 [16]; E2 encodes a GIGANTEA homologue, GmGIa [17]; and E3 and E4 encode the phytochromes GmPhyA3 [18] and GmPhyA2 [19]. However, the genes corresponding to the five remaining loci have not been identified and the exact functions of the four identified loci remain unclear.

The key flowering gene Flowering Locus T (FT) in Arabidopsis thaliana encodes a putative florigen that is an integrating factor of the flowering regulation network [20,21]. Soybean has at least 10 FT-like genes [22,23], among which, GmFT2a and GmFT5a could functionally promote flowering in A. thaliana [23,24]. Furthermore, it was observed that GmFT2a overexpression could induce early flowering in transgenic soybean [24]. Therefore, it is surprising that these two important flowering genes have not been considered as candidate genes for the five unidentified maturity loci. GmFT2a is regulated by photoperiod differentially in two cultivars exhibiting different photoperiod sensitivities (photoperiod-sensitive Zigongdongdou and photoperiod-insensitive Heihe 27) [24], suggesting that the function of GmFT2a might be related to the regulation of maturity. While it is speculated that the expression of GmFT2a may be developmentally regulated [24], the promoter region of GmFT2a has not been thoroughly analyzed.

The release of the soybean reference genome [25] has provided a new platform for breeding and molecular research. In addition, the resequencing of 31 wild and cultivated soybean genomes (designated as 31-Soybean Resequencing Project in this paper) has further characterized genome-wide genetic variations [26]. These studies may provide tools to address the question of how the polymorphisms in GmFT2a and its flanking sequences might function in the diversification of flowering and maturity time in soybeans.

In this study, soybean cultivars were originally cultivated/collected in China and North America, which cover most MGs, from MG000 to MGVIII. The sequence polymorphisms in the GmFT2a coding sequence and the GmFT2a promoter were analyzed. Possible roles of GmFT2a and its potential application in breeding were discussed.

Materials and Methods

1: Plant materials and photoperiod treatments

Eighty varieties were collected in China and North America (Table S1 in File S1). Soybean seeds were planted in soil in 10-liter pots and grown under natural day (ND) conditions. After germination, seedlings of uniform size were selected so that each pot finally contained five uniform plants. The seedlings were grown in nature sunshine until the cotyledons opened, and were then separated into groups and grown under different photoperiods (LD, 16 h light/8 h dark; SD 12 h light/12 h dark; and ND). Additional details regarding plant growth and treatments were as reported before [27]. The day to first flowering of each plant was recorded as the number of days from the expansion of unifoliates to first flowering (DEUFF) and 15 plants in three pots were investigated for each variety of each treatment. Photoperiod sensitivity (PS) was calculated as described previously [28].

2: DNA Extraction, PCR, and Sequencing

Genomic DNA was isolated using the TianGen (Beijing, China) New Plant Genomic DNA Isolation Kit (DP320). Two PCR primers GmFT2a-5-N2300 (5’-AAGTAAATTATTTTCCCCTTATTTCCTATC-3’) and GmFT2a-3-P165 (5’-CAAAGTATAGAAGTTCCTGAGGTCATCA-3’) were used to amplify the GmFT2a promoter region. The resulting PCR product was cloned into the pMD18-T simple vector (Takara, Dalian, China) or the pZeroBack/blunt vector (TianGen, Beijing, China). Further Sanger Sequencing was done in the National Key Facility for Crop Gene Resources, Institute of Crop Science, The Chinese Academy of Agricultural Sciences, China. In addition to the vector-specific sequencing primers, the primers GmFT2a-5-N1655 (5’-ACAGTGCATGTGGGAGGCAAATCGGCATAT-3’) and GmFT2a-3-N350 (5’-CACATCCCTTCCATCTTCTCATTTTCTC-3’) were used in DNA sequencing. All new sequencing data (List S1 in File S1) has been deposited in GenBank.

3: Bioinformatics analysis

The genomic sequences were aligned using ClustalW 2.0.9 [29]. The alignment was adjusted manually and input into MEGA 5 [30] for calculation of nucleotide diversity and Tajima’s D statistics. It was also input into TASSEL [31] to estimate linkage disequilibrium and identify SNP-trait associations by generating a general linear model (GLM). The phylogenetic relationships among the 17 haplotypes were inferred using the NJ method in MEGA 5 [30].

Results

The selected soybean population exhibited a continuous spectrum of photoperiod sensitivities

The plants investigated in this study were collected/cultivated in China and North America. The selected cultivars include a wide range of maturity types, from MG000 to MGVIII, and comprising 11 of the 13 defined maturity groups [1], representing a diverse population adapted to different geographic regions. As shown in Table 1 and Figure 1, the DEUFF varied from 19.7 to 32.8 days under SD conditions, from 20.0 to 122.3 days under ND conditions, and from 22.9 to 116.4 days under LD conditions, except two accessions, CS36 and ZG, which totally failed to flower under LD conditions. The variation of DEUFF was reduced under SD conditions and enhanced under LD conditions (Figure 1), which was not surprising for the SD crop soybean. The population also exhibited a rich diversity of PS, increasing from 0.056 (CS02) to 0.81 (H05) (Figure 1).

CultivarDEUFFPSCultivarDEUFFPS
SDNDLDSDNDLD
CS0100021.4±1.120.4±0.722.9±2.40.063CS38VII24.9±0.473.9±2.288.9±1.90.720
CS0200022.6±1.120.9±1.423.9±2.20.056CS39VII24.9±0.684.6±1.189.0±1.70.720
CS030021.4±1.524.7±1.726.2±3.30.182CS40VIII26.9±1.383.2±1.1108.2±8.90.752
CS060021.4±2.020.0±0.623.2±0.90.081CS41VIII26.7±2.483.9±1.490.6±0.50.706
CS07021.7±2.122.0±2.527.0±5.40.195CS43VIII27.6±0.773.9±3.188.4±0.50.688
CS08020.6±1.320.1±0.622.9±1.70.098CS4723.1±0.9; 0.899735; 0.899735; 0.899735; 30.2±1.437. 1±5.00.376
CS09024.1±3.225.5±0.928.1±4.10.142CS4823.1±1.728.9±2.2; 2.2115437.8±6.50.388
CS1022.6±2.729.8±1.932.9±3.40.314CS5222.9±0.831.9±1.947.2±3.80.515
CS12I22.8±1.728.5±2.133.2±1.60.314CS5424.1±1.144.1±1.854.3±3.00.556
CS13I24.8±1.632.5±1.542.7±3.40.420CS5827.0±0.845.0±2.750.2±1.90.462
CS14I25.1±3.027.9±2.334.6±3.10.275CS5928.0±1.749.5±1.461.3±3.20.543
CS15II22.2±2.328.6±2.133.1±2.50.330H0120.7±0.748.3±2.082.6±0.90.750
CS16II24.8±1.928.7±2.634.7±4.70.285H0223.5±0.571.1±2.180.1±2.10.710
CS19II26.7±2.131.2±2.640.1±3.70.335H0321.2±1.129.5±1.082.1±0.80.740
CS20III22.0±1.931.2±3.246.9±5.20.531H0425.5±1.271.4±1.694.6±1.00.730
CS21III26.2±2.135.6±6.046.3±0.50.434H0522.4±1.075.9±1.6116.4±1.20.810
CS22III25.1±0.431.5±0.840.5±1.60.380H0619.9±0.828.8±1.154.2±2.30.630
CS23III25.3±2.139.4±1.049.9±3.10.493H0722.6±0.678.6±0.7115.1±1.30.800
CS24IV21.3±1.634.1±6.046.7±4.20.544H0821.7±0.628.7±1.056.1±3.70.610
CS25IV23.7±2.736.7±3.147.9±3.60.505H0922.1±0.758.9±0.972.0±5.30.690
CS26IV23.2±2.245.2±2.552.5±5.00.559H1021.3±0.625.5±1.151.5±6.10.590
CS29V25.0±0.860.2±1.283.3±0.60.700H1119.9±0.849.2±6.261.5±4.60.680
CS30V27.9±1.467.1±1.487.9±1.10.683H1221.8±0.763.1±1.588.0±0.00.750
CS31V26.4±1.367.2±0.886.9±0.80.696H1319.7±0.737.0±0.881.4±2.60.760
CS32VI29.5±1.067.0±0.584.0±0.00.649H1420.1±0.858.5±1.179.0±0.70.750
CS33VI26.3±2.068.1±1.389.0±2.30.705H1521.3±0.658.0±0.788.8±1.30.760
CS34VI24.1±0.882.1±3.394.7±0.50.745H1622.5±0.764.5±1.2106.3±1.10.790
CS35VI25.0±0.883.8±1.494.1±0.60.734HH21.2±1.722.8±0.923.6±1.30.102
CS36VII24.7±0.883.6±0.5NaNNaNZG32.8±0.8122.3±0.8NaNNaN
CS37VII25.1±0.772.6±1.388.6±1.10.717

Table 1. Days from expansion of unifoliates to first flowering (DEUFF) and photoperiod sensitivity (PS).

The superscript indicates maturity group. SD, short day; ND, natural day; and LD, long day.
CSV
Download CSV
thumbnail
Figure 1. Days to first flowering under different photoperiod conditions and photoperiod sensitivities of different soybean varieties.

Left, the number of days from expansion of unifoliates to first flowering (DEUFF) under different photoperiod conditions (SD, short day; ND, natural day; and LD, long day). Right, sorted photoperiod sensitivities of different soybean varieties.

https://doi.org/10.1371/journal.pone.0077474.g001

The GmFT2a coding sequence is highly conserved

Investigation of the SNP data from the 31-Soybean Resequencing Project [26] revealed that there was only one SNP site in the GmFT2a coding sequence. It is a synonymous A/T SNP (named ss249156869) located at position 30746204 on chromosome Gm16 (Figure S1 in File S1). Further resequencing of over-100 cultivated soybean genomes did not identify any SNP in the GmFT2a coding sequence (unpublished data). Therefore, the GmFT2a coding sequence is highly conserved and the diversity in flowering time and maturation time in soybeans is not a result of polymorphism in the coding region of GmFT2a.

The GmFT2a promoter region harbors rich polymorphisms

The GmFT2a promoter (about 2.3Kb) from each of the 80 soybean accessions used in this study was cloned and Sanger sequenced (available in GenBank with accession numbers of KF573201 - KF573362). Considering that the soybean genome is palaeopolyploid and contains 10 FT-like genes [23,32], we confirmed each sequencing result with BLAST, with reference to the genome of Williams 82 (www.phytozome.org). The nucleotide diversity was analyzed by Tassel v2.1 [31]. In total, 15 SNPs and 16 InDels were detected in the 2,489 aligned base pairs, with six InDels and two SNPs contained within two longer InDels (Table S2 in File S1). For the whole sequenced population, an average difference of 4.7 SNPs per kilobase (π=0.0047) were found between two samples (Table 2). For the subpopulations of North American cultivars, Chinese cultivars, and wild soybeans from China, the values were 4.4, 4.8, and 5.4 SNPs per kilobase, respectively (Table 2). This showed that the GmFT2a promoter region is more diverse in the soybeans from China than those from North America. Indeed, the Watterson estimator (θ) value was higher in the North American subpopulation than in the other two subpopulations (Table 2). Tajima’s D values were all negative, with differences reaching a significant level (P<0.001) in the population and subpopulations (Table 2). Without considering the two SNP sites located inside InDels, the 13 independent SNP sites were compared with those found in the 31-Soybean Resequencing Project [26]. Ten sites were common, three sites were newly found, and seven sites were missing. The missing sites were either close to an insertion and a deletion (Table 3).

WholeCultivatedWild
N. AmericaChina
π0.00470.004410.00480.0054
θ0.045510.031290.015290.01735
Tajima's D−2.91836−2.93295−2.60104−2.52622

Table 2. Summary of DNA polymorphic sites in the GmFT2a promoter region.

π, average nucleotide differences per site between the two sequences; θ, Watterson estimator; Tajima’s D, test for neutral selection (significant at P<0.001).
CSV
Download CSV
Position31-Soybean ResequencingGmFT2a promoter sequence
1830739527S17
16330739672S162
22530739733
32130739826S320
45630739961S455
67630740180
115030740650S1149
145930740952S1458
149430740984
158130741026S1580
173730741179
174030741180
174330741183
174430741184
184530741283S1844
1912S1912
193130741349S1930
194530741363S1944
2033S2032
2229S2228

Table 3. Comparison of SNP sites from the 31-Soybean Resequencing Project and those seen in the GmFT2a promoter sequences from the present study.

CSV
Download CSV

GmFT2a promoter region exhibits 17 haplotypes

Although there were many polymorphisms in the GmFT2a promoter region, no linkage disequilibrium was detected in this region (Figure 2 and Figure S2 in File S1). A total of 17 haplotypes (HT01-HT17), with ten SNP sites and six InDels, were found in these 80 accessions (Table 4). HT06 was the major haplotype, accounting for 62 accessions; more than two-thirds of the population. In the wild soybeans (H01-H16, J1-J3), 14 haplotypes were included; that is, HT01, HT02, HT04, HT05, HT06, HT07, HT08, HT09, HT10, HT11, HT12, HT13, HT14, and HT16 (Table 5). The cultivated soybeans included six haplotypes, HT02, HT03, HT04, HT06, HT15, and HT17, of which haplotypes HT03, HT15, and HT17 were not found in the wild accessions (Table 5). These haplotypes were further analyzed phylogenetically. An NJ tree showed the division of the haplotypes into two major clusters. One cluster contained 11 haplotypes; that is, HT02, HT03, HT04, HT05, HT06, HT07, HT08, HT10, HT11, HT12, and HT14; the other cluster included six haplotypes, HT01, HT09, HT13, HT15, HT16, and HT17 (Figure 3).

thumbnail
Figure 2. Linkage disequilibrium decay: GmFT2a promoter region.

https://doi.org/10.1371/journal.pone.0077474.g002

PositionS17S162D231S320S1149S1458D1496S1580D1737S1844S1849D1849S1912S1930D2014D2263Variety Number
HaplotypeHT01AA4ATT1A0G-20GG301
HT02AC0CCA44T2G-20AA301
HT03AC0CTA44-2G-20AA301
HT04AC0CTA44G2G-20AA302
HT05AC0CTA44T0G-20AA301
HT06AC0CTA44T2G-20AA3062
HT07AC4CCA44T0G-20GG301
HT08AC4CCT1A0AC0GG302
HT09AC4CCT1A0G-20GG401
HT10AC4CTA44G0G-20GG301
HT11AC4CTA1A0G-20GG301
HT12AC4CTT1A0A-20GG301
HT13AC4CTT1A0G-20GG001
HT14AC4CTT1T0A-20GG301
HT15GT4ATT1A0G-20GG4105
HT16GT4ATT1A0G-20GG401
HT17GT4CTT1A0AT0GG304

Table 4. Haplotypes of the GmFT2a promoter region in 80 varieties.

CSV
Download CSV
VarietyHaplotypeVarietyHaplotypeVarietyHaplotype
N. American CultivarsCS01000HT06N. American CultivarsCS29VHT06Chinese CultivarsCS59HT15
CS02000HT06CS30VHT06CS60HT17
CS0300HT06CS31VHT02, HT06CS61HT06
CS0400HT06CS32VIHT06CS62HT06
CS0500HT06CS33VIHT04, HT06CS63HT06
CS0600HT06CS34VIHT15HHHT06
CS070HT06CS35VIHT06ZGHT17
CS080HT06CS36VIIHT06Wild SoybeansH01HT05, HT06
CS090HT06CS37VIIHT06H02HT01
CS10HT06CS38VIIHT06H03HT06
CS12IHT06CS39VIIHT06H04HT06
CS13IHT06CS40VIIIHT06H05HT06
CS14IHT06CS41VIIIHT15H06HT06
CS15IIHT06CS42VIIIHT15H07HT10, HT11
CS16IIHT06CS43VIIIHT06H08HT13
CS17IIHT06JUHT17H09HT07
CS18IIHT06WM82HT06H10HT09
CS19IIHT06Chinese CultivarsCS46HT06H11HT06
CS20IIIHT06CS47HT06H12HT08
CS21IIIHT17CS48HT06H13HT06
CS22IIIHT06CS49HT06H14HT04, HT06
CS23IIIHT06CS50HT06H15HT06
CS24IVHT06CS51HT15H16HT08
CS25IVHT03, HT06CS52HT06J1HT12, HT14
CS26IVHT06CS53HT06J2HT16
CS27HT06CS54HT06J3HT06
CS28VHT06CS58HT06

Table 5. Haplotypes of the GmFT2a promoter region in 80 soybean varieties.

Superscript indicates maturity group.
CSV
Download CSV
thumbnail
Figure 3. Neighbor-Joining tree depicting the phylogenetic relationships between 17 haplotypes of the GmFT2a promoter region.

https://doi.org/10.1371/journal.pone.0077474.g003

Several SNPs show some relationship with DEUFF and photoperiod sensitivity

Association analysis was done using the GLM (Table 6). At a significance level of p<0.01, SNP S17 showed a relationship with DEUFF under SD and ND; SNPs S162 and S1849 were associated with DEUFF under SD; and InDel D272 was associated with DEUFF under LD. Association with PS was also analyzed (Table 6).

TraitDEUFF, SDDEUFF, NDDEUFF, LDPS
p-valueR2p-valueR2p-valueR2p-valueR2
S170.0004 0.1972 0.0091 0.1135
S1620.0021 0.1972 0.0185 0.1328
D2310.0135 0.1023 0.0299 0.0829 0.0217 0.0921
D2720.0254 0.2816 0.0037 0.3608 0.0001 0.4595
S3200.0437 0.0695
D7760.0377 0.1105
S14580.0416 0.0709
D14960.0453 0.1047 0.0383 0.1138
D17370.0482 0.1026 0.0371 0.1149
S18440.0465 0.0678
S18490.0036 0.1818
D18490.0465 0.0678
S19120.0135 0.1023 0.0299 0.0829 0.0217 0.0921
S19300.0135 0.1023 0.0299 0.0829 0.0217 0.0921

Table 6. General linear model association of SNP and InDel sites.

DEUFF, days from expansion of unifoliates to first flowering; SD, short day; ND, natural day; LD, long day; PS, photoperiod sensitivity.
CSV
Download CSV

Discussion

The soybean population in the present study harbors rich diversity in DEUFF and PS

The soybean population investigated in this study was diverse in terms of geographic source, since the specimens were collected from North America and China and included 11 maturity groups, from MG000 to MGVIII. The samples therefore covered almost all of the 13 MGs [1]. The population showed a rich diversity of DEUFF under different photoperiod conditions; that is, LD, SD and ND. The days to first flowering diversified much more under LD than under the other two photoperiods (Table 1 and Figure 1). More importantly, the population exhibited a consecutive diversified day to first flowering. Individual plants within the population first-flowered every few days, from 20 to 85 days after the expansion of unifoliates, under natural day conditions (Table 1). As for PS, wild soybeans were on average more sensitive than cultivated ones. The PS of wild soybeans varied from 0.590 to 0.810, while that of cultivated soybeans varied from 0.056 to 0.752. This greater range of variation in cultivated soybeans facilitates their adaptation to different ecological environments. More importantly, the entire population showed a consecutive diversified spectrum of PS, from 0.056 to 0.81 (Table 1 and Figure 1). Therefore, the population studied here is richly diversified not only in terms of geographic source but also in terms of phenotype with regard to DEUFF and PS.

GmFT2a is under strong selection

The coding sequence of GmFT2a is highly conserved. A search of the known SNP data from the 31-Soybean Resequencing project [26] revealed only one synonymous A/T SNP site (ss249156869) in the GmFT2a coding sequence (Figure S1 in File S1). Furthermore, in an ongoing genome resequencing project involving over 100 cultivars, no SNP was found in the GmFT2a coding sequence (unpublished data). Therefore, the polymorphism of the GmFT2a coding sequence, which is under such strong selection, is probably not causally related to the diversity of flowering and maturity in soybeans. Adding that GmFT2a is involved in flowering transition and maintenance in soybean [24] and its homolog FT is an integrating factor of the flowering regulation network in A. thaliana [20,21], GmFT2a should be function essential for soybean adaptation.

Unlike the GmFT2a coding sequence, the GmFT2a promoter region is highly diversified. The degree of diversification of this region differs in different subpopulations (North American cultivars, Chinese cultivars, and wild soybeans). Wild soybeans were more diversified than cultivated ones, with a higher pairwise nucleotide diversity parameter (π) value (Table 2). Furthermore, the 19 wild soybeans examined in the present study included 14 of 17 possible haplotypes, while the 62 cultivated soybeans examined included only six of 17 possible haplotypes, supporting the assessment of a higher level of diversification in wild soybeans (Table 3). Examination of all 80 varieties revealed 31 polymorphic sites in the GmFT2a promoter region. It is interesting that there was no significant linkage disequilibrium in such a narrow region, since Lu et al. detected linkage disequilibrium in rice Ghd7 [33]. This indicated that the GmFT2a promoter region was highly polymorphic. Considering that GmFT2a is a putative florigen gene that plays central roles in the flowering regulation network, this high degree of polymorphism might facilitate the adaptation of soybeans to different environments and requirements.

The GmFT2a promoter region is also under strong selection. The whole population and each of the three subpopulations considered individually all had significantly negative Tajima’s D values (Table 2). This suggested that, like cultivated soybeans, wild soybeans might also be under positive selection. However, the negative values might also result from low frequency mutations or population expansion. However, more evidence is required in order to define the selection model. Haplotype analysis also provides evidence for strong selection. A total of 17 haplotypes were set up using 16 stringent polymorphic sites. These haplotypes did not distribute equally. Haplotype HT06 was the most predominant one. It was found in 62 varieties (10 out of 19 wild soybeans, 13 out of 17 Chinese cultivars, and 39 out of 44 North American cultivars), covering all maturity groups, from MG000 to MGVIII. This is also suggested that GmFT2a might be under high selection pressure, indicating a high degree of risk when selecting GmFT2a haplotypes during breeding. On the other hand, high profit tends to stem from high risk. CS59, a currently predominant and widely adapted cultivar of Zhonghuang 13, includes HT15; its wide adaptation might have resulted from the selection of HT15. GmFT2a might function as an engine. The development of a strong and suitable engine might be the key to increasing production potential and adaptability.

Polymorphism of GmFT2a is not related to maturity diversity

Further association analysis with GLM did not show a significant association between GmFT2a polymorphism and maturity diversity, which is consistent with the idea that GmFT2a is under strong selection. At the level of p<0.01, SNP S17 showed a relationship with the day to first flowering under SD and ND while SNPs S162 and S1849 showed such a relationship only under SD. The PLACE program (http://www.dna.affrc.go.jp/PLACE/) identified a CIACADIANLELHC element (CAANNNNATC, dark letter means SNP location) near SNP S17 (G /A). Near SNP S1849 (T/C), however, the program found an IBOXCORENT element (GATAAGR) [34,35]. Whereas the CIACADIANLELHC element is associated with circadian expression, the IBOXCORENT element is associated with light-responsive regulation; both are related to photoperiod responses in soybeans. However, more evidence is needed to elucidate how these SNPs function to regulate photoperiod reaction. Considering that GmFT2a is under high selection pressure, polymorphism in this gene does not appear to be responsible for maturity diversity.

There are nine maturity loci, E1-E8 and J [14]. E5-E8 and J have not been identified on the molecular level. GmFT2a is under highly stringent selection pressure, indicating that it probably does not correspond to one of the five unknown maturity loci. Indeed, GmFT2a also has nine paralogous genes, and little is known about these other nine genes [23,24]. Together, the nine maturity loci, GmFT2a and its relatives, and other flowering genes make up a complicated and elaborate flowering regulation network. There are many selection sites in the network that could be utilized in breeding new soybean varieties with good adaptation. GmFT2a functions downstream of other flowering genes, integrating flowering signals to regulate flowering. The predominance of HT06 indicates a core function of GmFT2a as an engine in the network. Therefore, it would be rather difficult to select GmFT2a directly in soybean breeding. However, future breeding should pay more attention to GmFT2a as a key element to be considered in approaches to breaking the bottleneck of soybean breeding.

Supporting Information

File S1.

A Word document with supplementary materials, including Table S1 and S2, Figure S1 and S2 and List S1.

https://doi.org/10.1371/journal.pone.0077474.s001

(DOC)

Acknowledgments

We thank Randall L. Nelson (Pathology and Genetics Research Unit and Department of Crop Sciences, Soybean/Maize Germplasm, Agricultural Research Service, United States Department of Agriculture, University of Illinois, Urbana, Illinois, United States of America), Lijuan Qiu (Institute of Crop Sciences, Chinese Agricultural Academy of Sciences, China) and Zhangxiong Liu (Institute of Crop Sciences, Chinese Agricultural Academy of Sciences, China) for providing some soybean varieties.

Author Contributions

Conceived and designed the experiments: BJ TH. Performed the experiments: BJ YY YG LM. Analyzed the data: BJ YY YG. Contributed reagents/materials/analysis tools: SS CW WH HL. Wrote the manuscript: BJ TH.

References

  1. 1. Hartwig EE (1970) Growth and reproductive characteristics of soybeans (Glycine max (L.) Merr.) grown under short-day conditions. Trop Sci 12: 47-53.
  2. 2. Zhang LX, Kyei-Boahen S, Zhang J, Zhang MH, Freeland TB et al. (2007) Modifications of optimum adaptation zones for soybean maturity groups in the USA. Crop Manag Available: http://dx.org/10.1094/CM-2007-0927-01-RS.
  3. 3. Wang G (1981) The research about the ecological division of soybean cultivars in China. Sci Agr Sin: 39-46.
  4. 4. Hao G, Chen X, Pu M (1992) Maturity groups of soybean cultivars in China. Acta Agron Sin: 275-281.
  5. 5. Bernard RL (1971) Two major genes for time of flowering and maturity in soybeans. Crop Sci 11: 242-244. doi:https://doi.org/10.2135/cropsci1971.0011183X001100020022x.
  6. 6. Buzzell RI (1971) Inheritance of a soybean flowering response to fluorescent-daylength conditions. Can J Genet Cytol 13: 703-707.
  7. 7. Bernard RL (1972) Two genes affecting stem termination in soybeans. Crop Sci 12: 235-239. doi:https://doi.org/10.2135/cropsci1972.0011183X001200020028x.
  8. 8. Buzzell RI, Voldeng HD (1980) Inheritance of insensitivity to long daylength. Soyb Genet Newsl 7: 26-29.
  9. 9. McBlain BA, Bernard RL (1987) A new gene affecting the time of flowering and maturity in soybeans. J Hered 78: 160-162.
  10. 10. Bonato ER, Vello NA (1999) E6, a dominant gene conditioning early flowering and maturity in soybeans. Genet Mol Biol 22: 229-232.
  11. 11. Cober ER, Voldeng HD (2001) A new soybean maturity and photoperiod-sensitivity locus linked to E1 and T. Crop Sci 41: 698-701. doi:https://doi.org/10.2135/cropsci2001.413698x.
  12. 12. Cober ER, Molnar SJ, Charette M, Voldeng HD (2010) A new locus for early maturity in soybean. Crop Sci 50: 524-527. doi:https://doi.org/10.2135/cropsci2009.04.0174.
  13. 13. Ray JD, Hinson K, Mankono JEB, Malo MF (1995) Genetic control of a long-juvenile trait in soybean. Crop Sci 35: 1001-1006. doi:https://doi.org/10.2135/cropsci1995.0011183X003500040012x.
  14. 14. Xia Z, Zhai H, Liu B, Kong F, Yuan X et al. (2012) Molecular identification of genes controlling flowering time, maturity, and photoperiod response in soybean. Plant Syst Evol 298: 1217-1227. doi:https://doi.org/10.1007/s00606-012-0628-2.
  15. 15. Wang Y, Wu C, Zhang X, Wang Y, Han T et al. (2008) Effects of soybean major maturity genes under different photoperiods. Acta Agron Sin: 1160-1168.
  16. 16. Xia Z, Watanabe S, Yamada T, Tsubokura Y, Nakashima H et al. (2012) Positional cloning and characterization reveal the molecular basis for soybean maturity locus E1 that regulates photoperiodic flowering. Proc Natl Acad Sci U S A 109: E2155-E2164. doi:https://doi.org/10.1073/pnas.1117982109. PubMed: 22619331.
  17. 17. Watanabe S, Xia Z, Hideshima R, Tsubokura Y, Sato S et al. (2011) A map-based cloning strategy employing a residual heterozygous line reveals that the GIGANTEA gene is involved in soybean maturity and flowering. Genetics 188: 260-395.
  18. 18. Watanabe S, Hideshima R, Xia Z, Tsubokura Y, Sato S et al. (2009) Map-based cloning of the gene associated with the soybean maturity locus E3. Genetics 182: 1251-1262. doi:https://doi.org/10.1534/genetics.108.098772. PubMed: 19474204.
  19. 19. Liu B, Kanazawa A, Matsumura H, Takahashi R, Harada K et al. (2008) Genetic redundancy in soybean photoresponses associated with duplication of the phytochrome A gene. Genetics 180: 995-1007. doi:https://doi.org/10.1534/genetics.108.092742. PubMed: 18780733.
  20. 20. Lazakis CM, Coneva V, Colasanti J (2011) ZCN8 encodes a potential orthologue of Arabidopsis FT florigen that integrates both endogenous and photoperiod flowering signals in maize. J Exp Bot 62: 4833-4842. doi:https://doi.org/10.1093/jxb/err129. PubMed: 21730358.
  21. 21. Bernier G, Périlleux C (2005) A physiological overview of the genetics of flowering time control. Plant Biotechnol J 3: 3-16. doi:https://doi.org/10.1111/j.1467-7652.2004.00114.x. PubMed: 17168895.
  22. 22. Thakare D, Kumudini S, Dinkins RD (2011) The alleles at the E1 locus impact the expression pattern of two soybean FT-like genes shown to induce flowering in Arabidopsis. Planta 234: 933-943. doi:https://doi.org/10.1007/s00425-011-1450-8. PubMed: 21681526.
  23. 23. Kong F, Liu B, Xia Z, Sato S, Kim B et al. (2010) Two coordinately regulated homologs of FLOWERING LOCUS T are involved in the control of photoperiodic flowering in soybean. Plant Physiol: 110-160796.
  24. 24. Sun H, Jia Z, Cao D, Jiang B, Wu C et al. (2011): GmFT2a, a soybean homolog of; Locus Flowering, T. (2011) is involved in flowering transition and maintenance. PLOS ONE 6: e29238. doi:https://doi.org/10.1371/journal.pone.0029238. PubMed: 22195028.
  25. 25. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T et al. (2010) Genome sequence of the palaeopolyploid soybean. Nature 463: 178-183. doi:https://doi.org/10.1038/nature08670. PubMed: 20075913.
  26. 26. Lam HM, Xu X, Liu X, Chen W, Yang G et al. (2010) Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat Genet 42: 1053-1059. doi:https://doi.org/10.1038/ng.715. PubMed: 21076406.
  27. 27. Wu C, Ma Q, Yam KM, Cheung MY, Xu Y et al. (2006) In situ expression of the GmNMH7 gene is photoperiod-dependent in a unique soybean (Glycine max [L.] Merr.) flowering reversion system. Planta 223: 725-735.
  28. 28. Fei Z, Wu C, Sun H, Hou W, Zhang B et al. (2009) Identification of Photothermal Responses in Soybean by Integrating Photoperiod Treatments with Planting-Date Experiments. Acta Agron Sin: 1525-1531.
  29. 29. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA et al. (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23: 2947-2948. doi:https://doi.org/10.1093/bioinformatics/btm404. PubMed: 17846036.
  30. 30. Tamura K, Peterson D, Peterson N, Stecher G, Nei M et al. (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28: 2731-2739. doi:https://doi.org/10.1093/molbev/msr121. PubMed: 21546353.
  31. 31. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y et al. (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23: 2633-2635. doi:https://doi.org/10.1093/bioinformatics/btm308. PubMed: 17586829.
  32. 32. Thakare D, Kumudini S, Dinkins RD (2010) Expression of flowering-time genes in soybean E1 near-isogenic lines under short and long day conditions. Planta 231: 951-963. doi:https://doi.org/10.1007/s00425-010-1100-6. PubMed: 20091337.
  33. 33. Lu L, Yan W, Xue W, Shao D, Xing Y (2012) Evolution and association analysis of Ghd7 in rice. PLOS ONE 7: e34021. doi:https://doi.org/10.1371/journal.pone.0034021. PubMed: 22666315.
  34. 34. Higo K, Ugawa Y, Iwamoto M, Korenaga T (1999) Plant cis-acting regulatory DNA elements (PLACE). Database. Nucleic Acids Res 27: 297-300.
  35. 35. Prestridge DS (1991) SIGNAL SCAN: a computer program that scans DNA sequences for eukaryotic transcriptional elements. CABIOS 7: 203-206. PubMed: 2059845.