Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Development of diagnostic SNP markers for quality assurance and control in sweetpotato [Ipomoea batatas (L.) Lam.] breeding programs

Correction

21 May 2020: The PLOS ONE Staff (2020) Correction: Development of diagnostic SNP markers for quality assurance and control in sweetpotato [Ipomoea batatas (L.) Lam.] breeding programs. PLOS ONE 15(5): e0233828. https://doi.org/10.1371/journal.pone.0233828 View correction

Abstract

Quality assurance and control (QA/QC) is an essential element of a breeding program’s optimization efforts towards increased genetic gains. Due to auto-hexaploid genome complexity, a low-cost marker platform for routine QA/QC in sweetpotato breeding programs is still unavailable. We used 662 parents of the International Potato Center (CIP)’s global breeding program spanning Peru, Uganda, Mozambique and Ghana, to develop a low-density highly informative single nucleotide polymorphism (SNP) marker set to be deployed for routine QA/QC. Segregation of the selected 30 SNPs (two SNPs per base chromosome) in a recombined breeding population was evaluated using 282 progeny from some of the parents above. The progeny were replicated from in-vitro, screenhouse and field, and the selected SNP-set was confirmed to identify relatively similar mislabeling error rates as a high density SNP-set of 10,159 markers. Six additional trait-specific markers were added to the selected SNP set from previous quantitative trait loci mapping studies. The 36-SNP set will be deployed for QA/QC in breeding pipelines and in fingerprinting of advanced clones or released varieties to monitor genetic gains in famers’ fields. The study also enabled evaluation of CIP’s global breeding population structure and the effect of some of the most devastating stresses like sweetpotato virus disease on genetic variation management. These results will inform future deployment of genomic selection in sweetpotato.

Introduction

Development of user-friendly, low-cost, high-throughput markers for quality assurance and control (QA/QC) in a genomic-assisted breeding (GAB) era is a critically important aspect in crop improvement and germplasm conservation [1,2]. This is because genetic fidelity and trueness-to-type are often not phenotypically obvious. The use of molecular markers for QA/QC has been implemented in several plants and animals and single nucleotide polymorphism (SNP) markers have become the markers of choice in germplasm characterization and QA/QC [3]. For example, [4] developed transcriptome-derived SNP markers for cost-efficient forest seed stock identification, [5] developed ‘near-minimal’ sets of SNPs to differentiate operational taxonomic units in fruit flies, while [6] developed species-diagnostic SNP markers to analyze admixture structure of varieties and rootstocks in citrus. Here we define QA and QC according to [7] who defined QA as the process or set of processes used to measure the quality of a product and QC as the process of ensuring products and services meet consumer expectations. In plant breeding, QA would refer to all measures put in place to prevent errors and create a high-quality variety, while QC is the process of identifying the defects or errors in the quality of the breeding line, germplasm accession, variety, or any other product from the breeding pipeline. Solid QA/QC procedures are critical in plant breeding, as errors in the process of developing new varieties can lead to wasted time and resources, and also reduce and/or cancel out genetic gains achieved because of genotype mix-ups along the breeding pipeline.

For most crops, the same SNP set can be used in a QA program to characterize germplasm, and study genetic diversity, genetic relationships and population structure, and in a QC program to evaluate genetic identity, genetic purity, parent-offspring identity, validation of crosses in nurseries and trait-specific testing [3, 7]. Genotype misclassification is a common problem in most crops as has been reported in Oryza spp [8] and Brassica spp [9], that has consequences in breeding and variety development. QA/QC have become even more important with the advent of molecular markers for decision support in breeding programs. Whereas the importance of QC filtration methods for SNP markers are well established [10,11] and methods put in place, QA/QC of the phenotypes that are combined with genotypes to predict performance are generally not very well established. The general lack of phenotype/genotype concordance has led to molecular decision support tools for increasing genetic gains in plant breeding such as genomic selection not achieving their full potential. Although well acknowledged in human and animal genetic fields [12,13], reports on the effects of poor QA/QC and genotype misclassification in plant breeding are generally lacking. With next- and third-generation sequencing methods enhancing rapid and cost-efficient development of large amounts of genomic data [11, 14], one of the biggest challenges of plant breeding programs is putting in place highly precise mechanisms for QA/QC of the phenotyping/genotyping processes.

Sweetpotato is a crop of increasing importance in sub-Saharan Africa (SSA) contributing to both food and nutritional security, from both adapted, starchy, white-fleshed varieties and new improved high β-carotene, orange-fleshed varieties [1517]. The International Potato Center (CIP), one of the centers of the Consultative Group on International Agricultural Research (CGIAR), runs a global sweetpotato improvement program. CIP is headquartered in Lima, Peru and has established three additional breeding support platforms in SSA. The support platform at Lima offers global technical support, while the east and central Africa platform focusses on end-user preferred varieties within this region including resistance to sweetpotato virus disease (SPVD), a major production constraint within the region. The southern Africa breeding support platform focusses on end-user preferred varieties in addition to drought tolerance, which is the major production constraint in this region, while the west African support platform focusses on culinary aspects, especially the ‘less sweet’ sweetpotato which is preferred in this region [17]. Being mainly an auto-hexaploid, GAB tools are just starting to be mainstreamed into the breeding program due to genome complexity. Several genomic tools have been developed, in partnership with several development partners and a molecular breeding team is currently stationed at CIP’s regional office for SSA in Nairobi, Kenya to facilitate this mainstreaming. Putting in place QA/QC measures is prerequisite to enhance the likelihood of success applying GAB in sweetpotato breeding. The main objectives of the current study were: i) to characterize breeding population parents from global support platforms for population structure, ii) to estimate allele diversity among the breeding population parents from the four global support platforms iii) to estimate levels of misclassification error through different steps of a multi-stage, multi-country breeding pipeline, iv) to develop a low-cost diagnostic SNP set for rapid QA/QC of sweetpotato breeding populations.

Materials and methods

Genetic materials

We collected parents from all four global breeding support platforms of CIP: Peru, being the global support platform; Uganda, being the support platform for east and central Africa; Mozambique, being the support platform for southern Africa; and Ghana, being the support platform for west Africa. We had 331 parents from Peru, 126 parents from Uganda, 144 parents from Mozambique and 61 parents from Ghana, totaling 662 parents. The list of the breeding population parents is provided in S1 Table. Since our objective was to mainstream QA/QC in breeding trials, we used progeny from a segregating breeding population to validate that the finally selected SNP set segregates in recombined individuals from parents and can identify relatively the same error rates as a high-density SNP set. These validation materials were derived from a breeding population progeny developed from the east and central African support platform, named the Mwanga Diversity Panel (MDP).

Genotyping and SNP calling

DNA from the breeding population parents and segregating progeny was extracted at the Biosciences east and central Africa (BecA) laboratories based at the International Livestock Research Institute (ILRI), Nairobi. The extraction was done following a modified cetyl trimethylammonium bromide (CTAB) method optimized for sweetpotato. The DNA was treated for contaminating RNA using RNAse A, quantified and normalized using standard protocols. The DNA was then submitted for sequencing using the Diversity Array Technology’s DArTSeq method implemented by BecA’s Integrated Genotyping Service and Support platform (IGSS) as described by [18], as separate genotyping projects. IGSS is a subsidized genotyping platform supported by the Bill and Melinda Gates Foundation to enhance use of genomics in breeding for SSA. Sequencing was done at 96-plex, high density and SNP calling done using DArT’s proprietary software DArTSoft [18], with aligning to the diploid reference genome of Ipomoea trifida, a relative of sweetpotato [19]. Given that most commercial genotyping platforms have allele depth coverage ~25x to 30x, previous studies [20] have shown that this depth of coverage is not adequate to call allele dosage with confidence in genotype quality for hexaploid sweetpotato. The study also showed that in such cases, ‘diploidized’ biallelic loci which are informative enough performed almost as well as data with high confidence dosage information, for simple traits. Therefore, biallelic markers used in this study were called in a diploidized version which means that the five heterozygous classes expected in sweetpotato were collapsed into one heterozygous class and the SNPs were coded as 0 = AA, 1 = BB, 2 = AB and “-”= Missing. All genotype-quality related parameters including call rate, frequency of loci homozygous for the reference allele (FreqHomRef), frequency of loci homozygous for the SNP allele (FreqHomSnp), frequency of heterozygous loci (FreqHets), polymorphic information content of the SNP allele (PICSnp) and average polymorphic information content (AvgPIC) were computed as part of the DArT’s proprietary software DArTSoft workflow. The metadata defining data columns as obtained from the DArTseq platform is provided as S1 File. The raw data from the global parents is provided as S1 Data.

Data analysis and validation

Population structure of International Potato Center’s breeding parents.

Since allele frequencies through genotype calling are biased when allele depth of coverage is relatively low [21] and given that we used diploidized markers for a hexaploid, we used non-parametric methods as described by [22], to estimate allele sharing distance (ASD). These methods do not assume Hardy-Weinberg equilibrium or linkage equilibrium and were implemented using the program AWClust 3.1. The phylogenetic tree was constructed using MEGA X program [23]. For allele diversity, Nei’s coefficients of inbreeding (FIS) [24] and Wright’s inbreeding coefficients (FST or θ) according to [25], were estimated in R. Additionally, linkage disequilibrium (LD) between pairs of markers used for parental population structure was done using the LDheatmap package in R, with the option of estimating r2. The LD between pairs of markers was carried out to evaluate the distribution and representativeness in the genome of the SNPs used in structure and diversity analysis, rather than to estimate the LD decay in sweetpotato, which would require a higher-density of SNPs and other considerations for allele dosage, linkage phases and haplotypes.

Estimating the level of misclassification error during breeding operations and field experiments.

We used the MDP multi-family breeding population to estimate the level of error expected from our standard cross-border breeding operations. This population was developed by crossing 8B by 8A parents without reciprocals, from the east and central Africa breeding platform in CIP-Uganda, thereby resulting in 64 families, with about 30 progeny per family on average. The crossing was done following a B*A pseudo-heterotic grouping based on genetic distance established by simple sequence repeat (SSR) markers [26]. The seed was germinated and maintained in vitro at the Biosciences east and central Africa (BecA) hub by CIP-Kenya, from where the in vitro plantlets were shipped back to CIP-Uganda for screenhouse multiplication and field experiments at Namulonge (0° 31' 17.99" N, 32° 36' 32.39" E). To evaluate the genetic fidelity following movement from in vitro to screenhouse and then to field experiments, we randomly sampled 94 genotypes from 1,886 genotypes of the MDP, which constitutes about 5% of the population, and genotyped using DArTSeq, as described above. The genotyping was done in 96-plex resulting in three 96-well plates (one each for in vitro, screen house and field) and a total of 282 DNA samples (S2 Table). The raw data from the segregating progeny is provided as S2 Data. The SNPs were filtered and used in further analysis. The ASD was computed as for the parents above [22] and a phylogenetic tree was generated using DARwin 6.0.21 [27]. Afterwards the clustering was examined based on positions on the tree and Sankey diagrams developed using the Alluvial package [28] in R. Additionally, Monte-Carlo tests were used to calculate the correlation among the separate distance matrices from in vitro, screenhouse and field using the mantel.rtest function of ade4 package in R.

Selection and validation of a diagnostic SNP set.

For our objective to develop a QA/QC SNP set that would be diagnostic for the global breeding population, we selected SNPs identified from parents as described above but also validated the selected SNPs using segregating progeny from a breeding population. We therefore selected only high-quality SNP markers that were present in both the parents and the MDP breeding population progeny. The selected markers were confirmed if they kept the same population structure of the parents and still identified the same misclassification error rate in the MDP population. Several studies have selected rapid QA/QC sets with as low as 10 SNP markers e.g. in Maize [29]. However, the base chromosome number of sweetpotato is 15 and given that we were diploidizing hexaploid loci, our aim was to have a minimum of two markers per base chromosome. We performed principal component analysis of the intermediate marker set according to [29], but no apparent grouping of the markers was determined. We therefore selected the final 30 SNP markers based on chromosome number and genetic distance, from an intermediate marker set of 85 SNP markers. To establish the utility of the 30 selected SNPs for rapid QA/QC, we compared the ASD of both parents and MDP populations based on the 30 selected SNPs and the ASD based on their respective original filtered marker sets (205 SNPs for parents and 10,159 SNPs for MDP), using DARwin 6.0.21 tree comparison function [27]. The filtered data, including the 205-SNP set, 85-SNP set, and 85-SNP set for MDP are provided in S3 Data.

Results

SNP profile from the parental and segregating populations

A total of 9,670 SNP markers were obtained from genotyping the 662 parents of CIP’s breeding population (S1 Data). The unfiltered dataset had call rates ranging from 0.4–1.0, FreqHomRef ranging from 0.0–1.0, FreqHomSnp ranging from 0.0–1.0, FreqHets ranging from 0.0–0.3, PICSnp and AvgPIC both ranging between 0.0–0.5 (Fig 1). With filtration of ≤ 25% missingness, ≥ 0.25 polymorphic information content (PIC) and ≥ 10% minimum allele frequency (MAF) and an average of 30x allele depth of coverage we recovered 205 SNP markers that were deemed appropriate for analysis of the breeding population structure (S3 Data). Fig 1 shows quality attributes of the unfiltered and filtered SNP data. After filtration, call rates ranged from 0.75–1.0, FreqHomRef ranged from 0.6–0.85, FreqHomSnp ranged from 0.0–0.1, FreqHets ranged from 0.0–0.35, PICSnp ranged from 0.25–0.45, while AvgPIC ranged from 0.1–0.3 (Fig 1). The number of filtered SNPs ranged from six to 18 per base chromosome.

thumbnail
Fig 1.

Quality attributes of unfiltered (9,670; Top) and filtered (205; Bottom) SNPs from DArTSeq indicating call rate (A), frequency of homozygotes for the reference allele (B), frequency of homozygotes for the alternative allele (SNP; C), frequency of heterozygotes in the data (D), polymorphic information content of the SNP (E), and average polymorphic information content between reference and SNP alleles (F).

https://doi.org/10.1371/journal.pone.0232173.g001

The high-density genotyping of the MDP segregating progeny resulted in about 41,194 SNPs (S2 Data). Stringent filtering of these based on polymorphic information content (PIC) ≥ 0.25, minimum allele frequency ≥ 20% and call rate ≥ 90%, removing all SNPs with no chromosome position and those on chromosome zero (Chr00) resulted in 10,159 SNPs that were used in further analyses.

Population structure of CIP’s breeding population

We examined population structure of the parents using 205 SNP markers. As expected from a global breeding program, population structuring indicated evidence of germplasm transfer among the breeding support platforms, although there was also evident local adaptation to each support platform (Fig 2). The global support platform in Peru had the highest number of parents in the current study. Clustering showed that there is a group of parents from Peru that are closely related to African breeding parents especially those from Ghana and Mozambique. However, an additional group was only unique to Peru (Fig 2). This group can also be seen between the first and second dimensions of a 2D multidimensional scale (S1 Fig). The east and central Africa support platform in Uganda had a distinct group of parents but also a small admixed group with Mozambique (Fig 2). The Uganda platform did not have a lot of admixtures from Peru. Given that the west African support platform was recently established (ca. 2010), and is the smallest in terms of size, the clustering indicates intake of breeding materials from other breeding support platforms especially from Mozambique and Peru, with minimal transfer to Ghana from Uganda. However, on a higher level, the structure can be generalized into two, with one cluster made up of materials from Peru and Ghana, and the other made up of materials from Uganda, Mozambique and Peru.

thumbnail
Fig 2. Phylogenetic tree (Neighbor-Joining) showing the population structure of the International Potato Center (CIP)’s global breeding parents.

Genotypes in Black represent parents from the global support platform in Peru, genotypes in Blue represent parents from the southern Africa support platform in Mozambique, genotypes in Green represent parents from the east and central Africa support platform in Uganda, while genotypes in Red represent parents from the west African support platform in Ghana. The tree was developed using MEGA X.

https://doi.org/10.1371/journal.pone.0232173.g002

Allele diversity and chromosome-wise linkage disequilibrium between markers

Nei’s coefficients of inbreeding using 205 SNPs indicated an average of FIS = 0.14 across all populations and that parents from Uganda had the highest inbreeding coefficient FIS = 0.33, followed by Mozambique with FIS = 0.24. Ghana, followed by Peru had the lowest coefficients of inbreeding at FIS = 0.008 and FIS = 0.07, respectively. The estimated variance components and fixation indices showed that the correlation of genes within individuals or inbreeding was F = 0.18, the correlation of genes in different individuals within the same population was θ = 0.07, and the correlation of genes within individuals within populations f = 0.12. Comparing θ values (FST) between pairs of populations (support platforms in this case) showed that Uganda was the most distinct group with θ = 0.08, θ = 0.09, and θ = 0.1 with Ghana, Mozambique and Peru, respectively. The paired θ values among Ghana, Mozambique, and Peru were fairly consistent ranging from θ = 0.041 to θ = 0.049. Data is summarized in Table 1. We further analyzed for LD between pairs of markers as proxy to check for the distribution of markers genome-wide and per chromosome. Analysis of LD between markers indicated minimal LD among the SNP markers used with the genome-wide LD having an average r2 ≤ 0.1. LD per chromosome is presented in Fig 3. The results show that very few loci were in LD at r2 ≥ 0.1, as majority of loci within a chromosome also had r2 ≤ 0.1. These results indicated that the data set was adequately distributed for use in analyzing population structure and diversity. To evaluate the utility of the selected markers for QA/QC, we calculated Nei’s coefficients of inbreeding using the 30 selected SNPs. The results showed the same trend as when using 205 SNPs, where, average FIS = 0.21 across all populations. Parents from Uganda still had the highest inbreeding coefficient FIS = 0.35, followed by Mozambique with FIS = 0.27. Ghana, followed by Peru had the lowest coefficients of inbreeding at FIS = 0.08 and FIS = 0.13, respectively (data not shown).

thumbnail
Fig 3. Linkage disequilibrium among 205 markers used in analysis of population structure of 662 breeding parents of the International Potato Center’s global breeding program.

The linkage disequilibrium was analyzed per chromosomes for the 15 base chromosomes of hexaploid sweetpotato.

https://doi.org/10.1371/journal.pone.0232173.g003

thumbnail
Table 1. Allelic diversity parameters among parents of the International Potato Center (CIP)’s breeding parents from Ghana, Mozambique, Peru and Uganda.

https://doi.org/10.1371/journal.pone.0232173.t001

Estimated misclassification errors within the breeding process

Misclassification is used here to indicate mislabeling errors leading to distorted clustering of the same genotype in vitro, in the screen house and in the field. Cluster analysis indicated that 26 out of 94 genotypes evaluated did not cluster as expected among in vitro, screen house and field samples thereby indicating about 27.7% misclassification errors as the germplasm moved from in vitro to screenhouse and then to the field as can be seen from the Sankey diagram (Fig 5, Top). The Monte-Carlo tests comparing correlation among the separate distance matrices for in vitro, screenhouse and field indicated that more misclassification occurred when the genotypes moved from in-vitro (Nairobi, Kenya) to screenhouse when compared to movement from screenhouse to the field (both in Namulonge, Uganda). The correlation between distance matrices of in vitro and screenhouse was r = 0.84, between in vitro and field was r = 0.87 and between screenhouse and field r = 0.93. These indicated misclassification rates of 16%, 13% and 7% respectively (data not shown).

Identifying diagnostic markers for routine quality assurance and control of breeding populations

Using 10,159 SNPs for QC as used above is ‘rich’ for routine QA/QC within most breeding programs. The desired low-cost, low-density QA/QC SNP set was therefore selected in this study based on the described criteria. This resulted in further filtration of the 205 SNP markers used for population structure of the parents above, down to 85 SNP markers (S3 Data), which could be used for ‘general QC’ in the sweetpotato breeding programs as proposed by [29]. However, for routine QC, this marker number is still probably too high for most breeding programs. Principal component analysis of the 85 markers did not show any specific grouping of markers, with PC1 and PC2 only explaining 9.1% of the variation (S2 Fig). Therefore, the final 30 SNP markers were selected based on genetic distance per chromosome. The set of 85 markers were not evenly distributed for all chromosomes and chromosome 15 had only one marker. To achieve the target of two markers per base chromosome, we selected one marker from the original set of 205 SNPs, based on genetic distance relative to the one marker present in the set of 85 SNPs, for chromosome 15. Comparing the population structure of the parents using 205, 85 and 30 SNP markers for parents indicated that the 30 markers kept the general structure of the populations, though the clustering was considerably different compared with the use of 205 SNPs (Fig 4). Comparing clustering trees indicated that the tree developed with 205 SNPs was 17.1% different from the tree with 30 SNPs when strict conditions were used. For validation of the selected marker set, we also compared the level of misclassification error identified in the breeding population progeny (MDP) using 10,159 SNPs, 85 SNPs and 30 SNPs (Fig 5). The misclassification error was calculated as the percentage of genotypes that did not cluster as expected in in vitro, screenhouse and field. Results show that 10,159 SNPs identified 27.7% misclassification, 85 SNPs identified 29.8% misclassification and 30 SNPs identified 31.9% misclassification. Tree comparison between 10,159 SNPs and 30 SNPs for MDP progeny indicated that they were 24.6% dissimilar when strict conditions were applied. Combined, these results suggest that the selected 30 SNPs could be used as a cost-effective rapid QA/QC set for sweetpotato in CIP’s breeding populations, while trading-off the 4% difference in error detection between 10,159 SNPs and 30 SNPs, for a significantly reduced cost of QA/QC. The selected SNPs are listed in Table 2.

thumbnail
Fig 4. Phylogenetic trees (Neighbor-Joining) comparing the clustering of the 662 International Potato Center (CIP)’s breeding parents using 205 highly informative SNP markers (left), 85-SNP intermediate marker set (center) and the 30-SNP selected QA/QC set markers (right).

Genotypes in Violet represent parents from the global support platform in Peru, genotypes in Green represent parents from the southern Africa support platform in Mozambique, genotypes in Orange represent parents from the east and central Africa support platform in Uganda, while genotypes in Blue represent parents from the west African support platform in Ghana. Trees were developed using DARwin 6.0.21.

https://doi.org/10.1371/journal.pone.0232173.g004

thumbnail
Fig 5. Sankey diagrams showing mislabeling error as Mwanga diversity Panel (MDP) population moved from in-vitro to screen house to field, based on 10,159 SNPs, 85-SNP intermediate marker set and 30-SNP selected quality control (QC)-set.

The Pink color indicates those that did not cluster (with mislabeling errors) while the grey color indicates those that clustered as expected, implying no mislabeling errors.

https://doi.org/10.1371/journal.pone.0232173.g005

thumbnail
Table 2. Details of the 36 SNPs selected for ‘rapid QC’ in sweetpotato.

AlleleID refers to the identity of the specific allele on the DArT platform, AlleleSeq refers to the flanking sequence of the SNP, Chr indicates the chromosome number, Pos indicates the position of a SNP on the specific chromosome, SNP is single nucleotide polymorphism.

https://doi.org/10.1371/journal.pone.0232173.t002

Addition of trait specific markers to the selected quality control set

Previous studies had mapped quantitative trait loci (QTL) for yield and component traits [30, 31] as well as quality-related traits [32]. From these QTL mapping results, we selected six SNP markers that were associated with dry matter, starch, β-carotene, flesh color and total root yield. The markers labeled ‘trait specific’ are shown in Table 2. The first four traits were selected because they are correlated and important contributors to culinary traits that affect adoption of new varieties in sweetpotato. Dry matter and starch are positively correlated but are negatively correlated to both β-carotene and flesh color, and this negative correlation affects ‘culinary quality’. Additionally, these traits are oligogenic hence results are repeatable within the QTL. Total storage root yield was selected as a primary trait and the selected marker of a QTL was found to be a constitutive marker for this trait across several environments based on multi-environment testing of a full-sib population [30].

Discussion

We obtained less than 10,000 bi-allelic SNP markers from genotyping our parental population. Stringent filtering resulted in an even smaller data set of 205 SNP markers. Similarly, only about 41,194 SNP markers were obtained from genotyping a segregating progeny, which were filtered down to 10,159 SNPs. This is in contrast with the high number of markers normally reported from GBS-based genotyping in diploid crops. The considerable reduction in highly informative markers can be associated with the difficulty in genotyping polyploids. With a mostly auto-hexaploid genome [19], sweetpotato presents allele dosage uncertainty due to ambiguous copy numbers of each allele. Additionally, the assumption of random inheritance of alleles may not hold true in this case due to uncharacterized consequences of whole genome duplication [33,34]. DArTSeq implements new protocols of sequencing complexity reduced representations [35] in combination with the next-generation sequencing methods [36,37]. Implementing genotyping-by-sequencing-like procedures, DArTSeq involves a two-restriction enzyme system composed of a ‘rare-cutter’ and a ‘common-cutter’, mainly Pst1-Mse1, to enhance uniform complexity reduction within the genome [38,39]. In such next-generation sequencing methods, depth of sequencing determines genotyping quality as low depth of coverage results in genotyping errors, misalignments and a lot of missing data which eventually cause biases in downstream population-genetic analyses [21, 40,41]. For instance, [42] showed that low sequencing depth resulted in SNPs that underestimated genomic heritability due to overestimation of inbreeding and underestimation of heterozygosity in ryegrass. We chose to use ‘diploidized’ data in the current analysis because the depth of coverage from most genotyping platforms is not adequate to reliably characterize heterozygous loci, such as those likely to be found in polyploids, for which deeper sequencing is required [43]. Furthermore, analyses comparing genotypic data from DArTSeq and those from a deep sequencing optimization platform for sweetpotato called GBSpoly [44] have confirmed that highly informative ‘diploidized’ DArTseq data performed just as well as high confidence data with dosage in genomic predictions of sweetpotato for traits with simple trait architecture [20]. This therefore implies that such markers are adequate to carry out routine activities such as population structure and diversity that do not necessarily depend on details of allele dosage and haplotyping.

Population structure as well as allele diversity analyses in the current study indicated that parental genotypes from Uganda were the more distinct and inbred. This observation can be associated with the high sweetpotato virus disease (SPVD) pressure around the lake region of eastern Africa and a general lack of germplasm with high levels of resistance to SPVD necessitating the use of the same lines frequently as parents in the Ugandan breeding program [4547]. SPVD is the most important virus complex in SSA and its effects are most pronounced in east Africa, causing yield loses of about 56–98% in farmers’ fields [48,49]. [50] noted that SPVD prevalence in east Africa resulted in the failure of nearly all orange-fleshed varieties introduced into this region. SPVD is caused by a synergistic and complex infection by sweetpotato feathery mottle virus and sweetpotato chlorotic stunt virus, transmitted by aphids and white flies, respectively [51]. [52] indicated that different regions have different strains of the individual virus, and that east Africa has distinct strains. Studies have also showed that sweetpotato chlorotic stunt virus strains are more related in east and southern Africa and are distinct from those in the other regions of the world [53]. These results are supported by our current population structuring which shows that Uganda has some admixing with Mozambique, but very little admixing with either Peru or Ghana. These results have implications extending to other breeding decisions such as determining the effective population sizes especially for the Uganda breeding platform where migration of germplasm into the platform is restricted due to SPVD.

Different alleles are represented in different genetic backgrounds and our results show allele diversity between other support platforms with especially the Uganda population. Therefore, understanding population diversity especially of a global breeding program is important for breeding decisioning. Breeding programs are currently moving towards GAB. Repeatability of quantitative trait loci in different genetic backgrounds is one prerequisite for the success of GAB methods such as QTL mapping, genome-wide association mapping, and genomic selection [54,55]. In genome-wide association mapping, accounting for population structure avoids false positives and allows selection of causative variants, while accurate prediction of untested future genotypes in genomic selection is only possible when familial relatedness is accounted for, allowing for a reliable association between markers and QTL [56]. In the case of our global breeding population, the current information will be important when designing a genomic selection scheme to facilitate decisions such as prediction within or across sub-populations. Similarities and differences in genetic architecture of complex traits between populations can also be understood by studying the genetic correlation between the populations [55]. Our results indicate that the Ugandan sub-population was also the most distinct from the three others when θ values (FST) between pairs of populations was examined. This would imply that predictions may be carried out separately for the Ugandan populations in future GAB activities, while the predictions may be tested across the platforms in Peru, Mozambique and Ghana, given similar environmental conditions. Since GAB requires that markers be in LD with QTL, our results indicating very minimal LD among markers confirm that this marker density is not enough for making selection decisions [57,58]. Although the number of markers used in the current study are adequate for the purposes of the current objectives of population structuring, more dense markers along the genome will be required to reliably study the LD decay in sweetpotato. However, ‘high density’ has cost implications and hence the optimum number of markers required for routine GAB use will need to be reliably estimated through reducing within-haplotype density by selecting the minimum number of markers that can define common haplotypes [59].

In the current study, we used filtration and validation methods of DArTSeq developed markers to select a marker set of 30 SNPs that can be used for QA/QC purposes in sweetpotato. Our selection of informative markers included considerations for depth of coverage, missingness, chromosome position, genetic distances, validation for repeatability in progeny and inclusion of trait-specific markers to result in a total of 36 SNPs. Development of SNP sets for QA/QC has been done in several crops. Extensive tests were carried out to develop a SNP set for ‘broad’ and ‘rapid’ QC in maize [29]. In their study, they showed that marker coverage between 2 and 15, markers with less than 20% missing values, including markers with known chromosome positions, markers with less than 6% heterogeneity, inclusion of trait specific markers, and selection of markers from groups based on average group distance gave the best marker set towards developing a ‘rapid’ QC set, using DArTSeq markers. Prior to this, [1], used about 1,597 SNP markers from the KASPar and GoldenGate platforms to select highly informative markers for low-cost QC genotyping in maize. They recommended a set of 50–100 SNPs for routine QC after finding about 29% heterogeneity in inbred lines. In rice, [2] recommended a subset of 24–36 SNP markers filtered from DArTSeq developed markers for genetic purity analyses. In sweetpotato, QA/QC problems in breeding operations have been shown in the current manuscript by monitoring the rate of misclassification as materials moved through different stages of breeding trialing. Our results show about 30% misclassification issues in one breeding population. Routine application of QA/QC in sweetpotato breeding trials will therefore improve precision and breeding efficiency through use of new methods like forward breeding and genomic selection currently being adopted by CGIAR programs. These new breeding strategies are aimed towards increasing the rate of genetic gains from breeding to address issues related with population increase and climate change. Therefore, QA/QC of breeding processes will improve the likelihood of success.

Since the real impact from breeding can only be measured by the improvements observed in farmers’ fields, controlling and assuring the quality of finished varieties is also important to breeding programs. Issues with QA/QC of released varieties have been reported in sweetpotato and this is exacerbated because the extent of adoption of new varieties cannot be determined accurately especially with informal seed systems where genetic integrity is seldom considered [60]. In Ethiopia, [18] used 17,220 DArTSeq developed markers to establish that about 20% of farmers confused local varieties for improved varieties and vice versa, and that farmers assigned different local names to the same variety or vice versa. Their study confirmed that data from survey studies [61,62] were mostly unreliable. Despite this important revelation, high density genotyping at 17,220 markers is not amenable for widespread routine use, therefore leaving household surveys as the predominant way of carrying out adoption studies. The currently developed marker set will therefore be useful in also addressing adoption-related needs in sweetpotato.

Towards deployment in breeding programs, the identified markers will be converted into competitive allele-specific PCR (KASP) system, for routine use of the target SNPs for QA/QC. However, to achieve increased genetic gains in the sweetpotato breeding programs, QA/QC will need to be combined with other approaches of optimizing breeding schemes.

Supporting information

S1 Data. The original DArTSeq data generated from 662 parents of the global breeding program of the international potato center.

https://doi.org/10.1371/journal.pone.0232173.s001

(CSV)

S2 Data. DArTseq single nucleotide polymorphism (SNP) markers (raw data) from the segregating progeny used in quality control of breeding operations.

https://doi.org/10.1371/journal.pone.0232173.s002

(XLSX)

S3 Data. Separate excel sheets showing the 205 stringently filtered and highly polymorphic SNPs (parents-205), 85-SNP intermediate with highlighted 30-SNP selected QA/QC set based on parents (Parents-85&Selected QC Set), and 85-SNP intermediate marker set with highlighted selected QA/QC set based on the progeny of the Mwanga Diversity Panel (MDP-85&Selected QC Set).

https://doi.org/10.1371/journal.pone.0232173.s003

(XLSX)

S1 Fig. Two-dimensional figure from multidimensional scaling of the 662 International Potato center (CIP)’s global sweetpotato breeding parents as observed using 205 highly informative SNP markers.

https://doi.org/10.1371/journal.pone.0232173.s004

(PDF)

S2 Fig. Principle component analysis (PCA) carried out on 85-SNP intermediate marker set to check for possible groupings that could aid the selection of a 30-SNP quality control marker set.

https://doi.org/10.1371/journal.pone.0232173.s005

(PDF)

S1 Table. List of 662 parental genotypes from the International Potato Center (CIP)’s global breeding program including breeding support platform of origin.

https://doi.org/10.1371/journal.pone.0232173.s006

(XLSX)

S2 Table. Genotype list and family assignments of the segregating progeny used in quality control of breeding operations.

https://doi.org/10.1371/journal.pone.0232173.s007

(XLSX)

S1 File. The metadata that is associated with columns related to genotype quality parameters as provided by the DArTSeq platform.

https://doi.org/10.1371/journal.pone.0232173.s008

(XLSX)

Acknowledgments

The work was carried out as part of the Consultative Group on International Agricultural Research (CGIAR)-Research Program on Roots, Tubers and Bananas (RTB). Sincere thanks to the Integrated Genotyping Service and Support (IGSS) platform for genotyping the populations. Additionally, the authors acknowledge all technical breeding teams in Ghana, Mozambique, Peru and Uganda. This manuscript has been released as a pre-print at doi: http://biorxiv.org/cgi/content/short/826792v1, (Gemenet et al.).

References

  1. 1. Semagn K, Beyene Y, Makumbi D, Mugo S, Prasanna BM, Magorokosho C et al. Quality control genotyping for assessment of genetic identity and purity in diverse tropical maize inbred lines. Theor Appl Genet. 2012; 125:1487–1501. pmid:22801872
  2. 2. Ndjiondjop MN, Semagn K, Zhang J, Gouda AC, Kpeki SB, Goungoulou A, et al. Development of species diagnostic SNP markers for quality control genotyping in four rice (Oryza L.) species. Mol Breed. 2018; 38:131. pmid:30416368
  3. 3. Ertiro BT, Ogugo V, Worku M, Das B, Olsen M, Labuschagne M, et al. Comparison of Kompetitive Allele Specific PCR (KASP) and genotyping by sequencing (GBS) for quality control analysis in maize. BMC Genomics 2015; 16:908. pmid:26545737
  4. 4. Cullingham CI, Cooke JEK, Dang S, Coltman DW. A species-diagnostic SNP panel for discriminating lodgepole pine, jack pine, and their interspecific hybrids. Tree Genetics and Genomes 2013; 9:1119–1127.
  5. 5. Frey JE, Guillén L, Frey B, Samietz J, Rull J, Aluja M. Developing diagnostic SNP panels for the identification of true fruit flies (Diptera: Tephritidae) within the limits of COI based species delimitation. BMC Evol Biol. 2013; 13:106. pmid:23718854
  6. 6. Curk F, Ancillo G, Ollitrault F, Perrier X, Jacquemoud-Collet J-P, Garcia-Lor A, Navarro L, et al. Nuclear species-diagnostic SNP markers mined from 454 amplicon sequencing reveal admixture genomic structure of modern citrus varieties. PLoS One 2015;10: e0125628. pmid:25973611
  7. 7. Gowda M, Worku M, Nair SK, Palacios-Rojas N, Huestis G, Prasanna BM. Quality Assurance/ Quality Control (QA/QC) in Maize Breeding and Seed Production: Theory and Practice. 2017; CIMMYT: Nairobi.
  8. 8. Orjuela J, Sabot F, Chéron S, Vigouroux Y, Adam H, Chrestin H, et al. An extensive analysis of the African rice genetic diversity through a global genotyping. Theor Appl Genet. 2014; 127:2211–2223. pmid:25119871
  9. 9. Mason AS, Zhang J, Tollenaere R, Vasquez-Teuber P, Dalton-Morgan J, Hu L, et al. High-throughput genotyping for species identification and diversity assessment in germplasm collections. Mol Ecol Resour. 2015; 15:1091–1101. pmid:25641370
  10. 10. Forneris NS, Legarra A, Vitezica ZG, Tsuruta S, Aguilar I, Misztal I, et al. Quality control of genotypes using heritability estimates of gene content at the marker. Genetics 2015; 199:675–681. pmid:25567991
  11. 11. Jarquín D, Howard R, Graef G, Lorenz A. Response surface analysis of genomic prediction accuracy values using quality control covariates in soybean. Evol Bioinformatics 2019; 15:1–7.
  12. 12. Buyske S, Yang G, Matise TC, Gordon D. When a case is not a case: effects of phenotype misclassification on power and sample size requirements for the transmission disequilibrium test with affected child trios. Human Hered. 2009; 67(4):287–92.
  13. 13. Smith S, Hay EH, Farhat N, Rekaya R. Genome wide association studies in presence of misclassified binary responses. BMC Genetics 2013; 14:124. pmid:24369108
  14. 14. van Dijk EL, Jaszczyszyn Y, Naquin D, Thermes C. The Third Revolution in Sequencing Technology. Trends in Genetics 2018; 34(9).
  15. 15. Mwanga ROM, Ghislain M, Kreuze J, Ssemakula GN, Yencho GC. Exploiting the use of biotechnology in sweetpotato for improved nutrition and food security: Progress and future outlook In: Nampala P, Makara MA (eds) Proceedings of the International Conference on Agro-Biotechnology, Biosafety and Seed Systems in Developing Countries. Science Foundation for Livelihoods and Development; 2011. pp25-31, Kampala, Uganda.
  16. 16. Low JW, Lynam J, Lemaga B, Crissman C, Barker I, Thiele G et al. Sweetpotato in Sub-Saharan Africa. In: Loebenstein G, Thottappilly G (eds) The Sweetpotato. Springer Science + Business Media, B.V Dordrecht; 2009. pp359–390.
  17. 17. Low JW, Mwanga ROM, Andrade M, Carey E, Ball A. Tackling vitamin A deficiency with biofortified sweetpotato in sub-Saharan Africa. Global Food Secur 2017; 14:23–30.
  18. 18. Kosmowski F, Aragaw A, Kilian A, Ambel A, Ilukor J, Yigezu B, et al. Varietal Identification in household surveys: Results from three household-based methods against the benchmark of DNA fingerprinting in southern Ethiopia. Expl Agric. 2018;
  19. 19. Wu S, Lau KH, Cao Q, Hamilton JP, Sun H, Zhou C et al. Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement. Nature communications 2018; 9:4580. pmid:30389915
  20. 20. Gemenet DC, Lindqvist-Kreuze H, Olukolu BA, De Boeck B, Pereira GDS, Mollinari M, et al. Sequencing depth and genotype quality: Accuracy and breeding operation considerations for genomic selection application in autopolypoid crops. BioRxiv Preprint 2020; https://doi.org/10.1101/2020.02.23.961383.
  21. 21. Maruki T, Lynch M. Genotype Calling from Population-Genomic Sequencing Data. G3: Genes Genomes Genetics 2017; https://doi.org/10.1534/g3.117.039008.
  22. 22. Gao X, Starmer J. Human population structure detection via multilocus genotype clustering. BMC Genetics 2007; 8:34. pmid:17592628
  23. 23. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Molecular Biology and Evolution 2018; 35:1547–1549. pmid:29722887
  24. 24. Nei MF. Statistics and analysis of gene diversity in subdivided populations. Annals of Human Genetics 1977; 41:225–233. pmid:596830
  25. 25. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution 1984; 38:1358–1370. pmid:28563791
  26. 26. David MC, Diaz FC, Mwanga ROM, Tumwegamire S, Mansilla RC, Grüneberg WJ. Gene pool subdivision of east African sweetpotato parental material. Crop Sci. 2018; 58:2302–2314.
  27. 27. Perrier X, Jacquemoud-Collet JP. DARwin software. 2006; Available from: http://darwin.cirad.fr/darwin.
  28. 28. Bojanowski M, Edwards R. Alluvial R package. 2006; Available from: https://github.com/mbojan/alluvial.
  29. 29. Chen J, Zavala C, Ortega N, Petroli C, Franco J, Burgueño J, et al. The development of quality control genotyping approaches: A case study using elite maize lines. PLoS ONE 2016; 11(6): e0157236. pmid:27280295
  30. 30. Gemenet DC, De Boeck B, Pereira GDS, Kitavi MN, Ssali RT, Utoblo O et al. When a phenotype is not the genotype: Implications of phenotype misclassification and pedigree errors in genomics-assisted breeding of sweetpotato [Ipomoea batatas (L.) Lam.]. BioRxiv [Preprint]. 2019 [cited 2019 November 11]; Available from: https://doi.org/10.1101/747469.
  31. 31. Pereira GDS, Gemenet DC, Mollinari M, Olukolu BA, Diaz F, Mosquera V, et al. Multiple QTL mapping in autopolyploids: a random-effect model approach with application in a hexaploid sweetpotato full-sib population. BioRxiv [Preprint]. 2019 [cited 2019 November 11]; Available from: https://doi.org/10.1101/622951.
  32. 32. Gemenet DC, Pereira GDS, De Boeck B, Wood JC, Mollinari M, Olukolu BA et al. Quantitative trait loci and differential gene expression analyses reveal the genetic basis for negatively-associated β-carotene and starch content in hexaploid sweetpotato [Ipomoea batatas (L.) Lam.]. Theor App Genet. 2019; pmid:31595335
  33. 33. Blischak PD, Kubatko LS, Wolfe AD. Accounting for genotype uncertainty in the estimation of allele frequencies in autopolyploids. Mol Ecol Resources 2016; 16:742–754.
  34. 34. Blischak PD, Kubatko LS, Wolfe AD. SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data. Bioinformatics 2018; 34(3):407–415. pmid:29028881
  35. 35. Altshuler D, Pollara VJ, Cowles CR, Van Etten WJ, Baldwin J, Linton L, et al. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 2000; 407:513–516. pmid:11029002
  36. 36. Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA et al. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One 2008; 3: e3376. pmid:18852878
  37. 37. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 2011; 6(5): e19379. pmid:21573248
  38. 38. Poland JA, Brown PJ, Sorrells ME, Jannink J-L. Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by- sequencing approach. PLoS One 2012; 7: e32253. pmid:22389690
  39. 39. Brouard J-S, Boyle B, Ibeagha-Awemu EM, Bissonnette N. Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high-quality genotypes and the accuracy of imputation. BMC Genetics 2017; 18:32. pmid:28381212
  40. 40. Fumagalli M. Assessing the effect of sequencing depth and sample size in population genetics inferences. PLoS One 2013; 8(11): e79667. pmid:24260275
  41. 41. Crawford JE, Lazzaro BP. Assessing the accuracy and power of population genetic inference from low-pass next-generation sequencing data. Front in Genet. 2012; 3:66.
  42. 42. Ashraf BH, Byrne S, Fé D, Czaban A, Asp T, Pedersen MG et al. Estimating genomic heritabilities at the level of family-pool samples of perennial ryegrass using genotyping-by-sequencing. Theor Appl Genet. 2016; 129:45–52. pmid:26407618
  43. 43. Fresnedo-Ramírez J, Yang S, Sun Q, Karn A, Reisch BI, Cadle-Davidson L. Computational analysis of ampSeq data for targeted, high-throughput genotyping of amplicons. Front Plant Sci. 2019; 10:599. pmid:31156670
  44. 44. Wadl PA, Olukolu BA, Branham SE, Jarret RL, Yencho GC, Jackson DM. Genetic diversity and population structure of the USDA sweetpotato (Ipomoea batatas) germplasm collections using GBSpoly. Front Plant Sci. 2018; 9:1–13.
  45. 45. Gibson RW, Kaitisha GC, Randrianaivoarivony JM. Identification of then east African strain of sweetpotato chlorotic stunt virus as a major component of sweetpotato virus disease in southern Africa. Plant Dis. 1998; 132:1063.
  46. 46. Gibson RW, Mpembe I, Alicai T, Carey EE, Mwanga ROM, Seal SE, et al. Symptoms, aetiology and serological analysis of sweet potato virus disease in Uganda. Plant Pathol. 1998; 47:95–102.
  47. 47. Ndunguru J, Kapinga R, Sseruwagi P, Sayi B, Mwanga R, Tumwegamire S, et al. Assessing the sweetpotato virus disease and its associated vectors in northwestern Tanzania and central Uganda. Afr J Agric Res. 2009; 4(4):334–343.
  48. 48. Mukasa SB, Rubaihayo PR, Valkonen, JPT. Incidence of viruses and virus-like diseases of sweetpotato in Uganda. Plant Dis. 2003; 87:329–335. pmid:30831824
  49. 49. Ndunguru J, Kapinga R. Viruses and virus-like diseases affecting sweetpotato subsistence farming in southern Tanzania. Afr J Agric Res. 2007; 5:232–239.
  50. 50. Grüneberg WJ, Ma D, Mwanga ROM, Carey EE, Huamani K, Diaz F et al. Advances in sweetpotato breeding from 1992 to 2012. In: Low JW, Nyongesa M, Quinn S Parker M (eds) Potato and Sweetpotato in Africa: Transforming the Value Chains for Food and Nutrition Security. CABI; 2015. pp3–68.
  51. 51. Mwanga ROM, Moyer J, Zhang D, Carey EE, Yencho GC. Nature of resistance to sweetpotato virus diseases. Acta Horticulturae 2002; 583:113–119.
  52. 52. Clark CA, Davis JA, Abad JA, Cuellar WJ, Fuentes S, Kreuze JF et al. Sweetpotato viruses: 15 years of progress on understanding and managing complex diseases. Plant Disease 2012; 96:168–185. pmid:30731810
  53. 53. Hoyer U, Maiss E, Jekmann W, Lessemann DE, Vette HJ. Identification of coat protein gene of sweetpotato sunken vein cloterovirus from Kenya and evidence for serological relationship among geographically diverse cloterovirus isolates from sweetpotato. Phytopath. 1996; 47:582–587.
  54. 54. Azevedo CF, de Resende MDV, e Silva FF, Nascimento M, Viana JMS, Valente MSF. Population structure correction for genomic selection through eigenvector covariates. Crop Breeding and Applied Biotechnology 2017; 17:350–358.
  55. 55. Wientjes YCJ, Calus MPL, Duenk P, Bijma P. Required properties for markers used to calculate unbiased estimates of the genetic correlation between populations. Genet Sel Evol. 2018; 50:65. pmid:30547748
  56. 56. Daetwyler HD, Kemper KE, van der Werf JHJ, Hayes BJ. Components of the accuracy of genomic prediction in a multi-breed sheep population. J Anim Sci. 2012; 90:3375–3384. pmid:23038744
  57. 57. Flint-Garcia SA, Thornsberry JM, Buckler ES IV. Structure of linkage disequilibrium in plants. Annu Rev Plant Biol. 2003; 54:357–374. pmid:14502995
  58. 58. Vos PG, Paulo MJ, Voorrips RE, Visser RGF, van Eck HJ, van Eeuwijk FA. Evaluation of LD decay and various LD‑decay estimators in simulated and SNP‑array data of tetraploid potato. Theor Appl Genet. 2017; 130:123–135. pmid:27699464
  59. 59. Meng Z, Zaykin DV, Xu C-F, Wagner M, Ehm MG. Selection of genetic markers for association analyses, using linkage disequilibrium and haplotypes. Am J Hum Genet. 2003; 73:115–130. pmid:12796855
  60. 60. Namanda S, Gibson R, Sindi K. Sweetpotato seed systems in Uganda, Tanzania and Rwanda. Journal of Sustainable Agriculture 2011; 35(8):870–884.
  61. 61. Labeyrie V, Deu M, Barnaud A, Calatayud C, Buiron M, Wambugu P, et al. Influence of ethnolinguistic diversity on the sorghum genetic patterns in subsistence farming systems in Eastern Kenya. PLoS One 2014; 9: e92178. pmid:24637745
  62. 62. Wossen T, Tessema G, Abdoulaye T, Rabbi I, Olanrewaju A, Alene A, et al. The cassava monitoring survey in Nigeria. Final report. IITA, Ibadan, Nigeria; 2017. 66p. ISBN 978-978-8444-81-7.