Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Construction, De-Novo Assembly and Analysis of Transcriptome for Identification of Reproduction-Related Genes and Pathways from Rohu, Labeo rohita (Hamilton)

  • Dinesh Kumar Sahu,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Soumya Prasad Panda,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Prem Kumar Meher,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Paramananda Das,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Padmanav Routray,

    Affiliation Aquaculture Production and Environment Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Jitendra Kumar Sundaray,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Pallipuram Jayasankar,

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

  • Samiran Nandi

    eurekhain@yahoo.co.in

    Affiliation Fish Genetics and Biotechnology Division, ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar, Orissa, India

Abstract

Rohu is a leading candidate species for freshwater aquaculture in South-East Asia. Unlike common carp the monsoon breeding habit of rohu restricts its seed production beyond season indicating strong genetic control over spawning. Genetic information is limited in this regard. The problem is exacerbated by the lack of genomic-resources. We identified 182 reproduction-related genes previously by Sanger-sequencing which were less to address the issue of seasonal spawning behaviour of this important carp. Therefore, the present work was taken up to generate transcriptome profile by mRNAseq. 16GB, 72bp paired end (PE) data was generated from the pooled-RNA of twelve-tissues from pre-spawning rohu using IlluminaGA-II-platform. There were 64.97 million high-quality reads producing 62,283 contigs and 88,612 numbers of transcripts using velvet and oases programs, respectively. Gene ontology annotation identified 940 reproduction-related genes consisting of 184 mainly associated with reproduction, 223 related to hormone-activity and receptor-binding, 178 receptor-activity and 355 embryonic-development related-proteins. The important reproduction-relevant pathways found in KEGG analysis were GnRH-signaling, oocyte-meiosis, steroid-biosynthesis, steroid-hormone biosynthesis, progesterone-mediated oocyte-maturation, retinol-metabolism, neuroactive-ligand-receptor interaction, neurotrophin-signaling and photo-transduction. Twenty nine simple sequence repeat containing sequences were also found out of which 12 repeat loci were polymorphic with mean expected-&-observed heterozygosity of 0.471 and 0.983 respectively. Quantitative RT-PCR analyses of 13-known and 6-unknown transcripts revealed differences in expression level between preparatory and post-spawning phase. These transcriptomic sequences have significantly increased the genetic-&-genomic resources for reproduction-research in Labeo rohita.

Introduction

The Indian major carp Labeo rohita (Hamilton), a cyprinid is a leading candidate species for freshwater aquaculture not only in India but also in the whole sub-continent of South-East Asia with the annual production of 1.5 million tonnes in 2012 [1]. Due to its immense economic, ecological and cultural importance it received lot of research interest in different areas including culture [2], breeding [3] immunology [4], disease ecology [5], reproductive physiology [6] and nutrition [7]. One of the major problem in this species is highly monsoon dependent breeding habit [8] and inability to breed in confined pond water without hormonal induction [9]. This restricts seed production beyond the breeding season leading to suboptimal utilization of cultivable water area. Like other Indian teleost the reproductive cycle of L. rohita may be divided into four stages, preparatory period (February–April), pre-spawning period (May–June), spawning period (July–August), and post-spawning period (September–January) [10] and at each stage gonads show discrete change. Unprecedented summer temperature, scanty, irregular and shifted monsoon in recent years has further complicated seed production by affecting one or more reproductive stages. However, common carp (Cyprinus carpio) is from the same cyprinid group, shows prolific breeding in the similar environmental condition. Attempt has been made to study the effects of environmental manipulations on gonadal development [11] through dietary manipulation [12], multiple induced breeding [13], offseason breeding [14] of Indian major carp etc. Although literatures on different aspects of reproduction and breeding are available [3], the genetic mechanism underlying gonad maturation and seasonal breeding in tropical climate has not been fully understood. Genetic studies have been performed in the past decade, which focused on selective breeding [15], development of genetic markers [16], development of linkage map [17] and collection of immune related genes [4]. Reproduction of fish is centrally controlled by brain, pituitary liver and gonad (BPGL axis) [18] and multiple genes are involved in this process. Addressing issues like gonad maturation; spawning and seed production under changing climatic conditions will require comprehensive information on a number of reproduction-related genes. In our earlier attempt 182 reproduction-related genes were identified from 4,642 high-quality ESTs by Sanger sequencing [6], but the number was less. High throughput next generation sequencing technologies provide the platforms to generate transcriptome sequences with much lower cost than traditional Sanger method, thus the transcripts generated by NGS technology can boost genetic and genomic research of relatively lagging species [19]. Until the reference genome sequence becomes available, transcriptome sequencing is a fast and efficient means for gene discovery and genetic marker development. The simple sequence repeat (SSRs) markers are important resources for determining functional genetic variation and among the various molecular markers, SSRs are highly polymorphic [20], and serve as rich resource of diversity. SSRs derived from expressed sequence tags (ESTs) have special advantages such as those might be linked to known genes, having higher transferability among related species, lower cost for development and higher proportion of high-quality markers [20]. Transcriptome resources for reproductive tissues are currently available for other commercially important fishes, including channel catfish (Ictalurus punctatus) [21], common carp (Cyprinus carpio) [19], zebrafish (Danio rerio) [22], rainbow trout (Oncorhynchus mykiss) [23,24], coho salmon (Oncorhynchus kisutch) [25], tilapia (Oreochromis mossambicus) [26], Atlantic halibut (Hippoglossus hippoglossus) [27], senegalese sole (Solea senegalensis) [28], Atlantic salmon (Salmo salar) [29], and cod (Gadus morhua) [30] but less information is available for L rohita [5,6].

Hence, this work was taken up with the primary objective to generate transcripts sequence using mRNA-seq and provide well-assembled transcriptome sequences from the pooled RNA samples of brain, liver, intestine, kidney, tongue, nose, eye, gill, muscle, heart, ovary and testis tissues to identify reproduction relevant genes, verify their association with reproduction by transcript expression pattern and to develop EST-SSR markers within these transcripts of L rohita which may be utilized further in future for genetic diversity analysis, gene mapping, marker-assisted breeding as well as studying reproductive issues in this species.

Materials and Methods

Ethics Statement

This study was approved by the Animal Ethics Committee of ICAR-CIFA, Bhubaneswar. All the fishes (rohu, Labeo rohita) used in the experiments were handled according to the prescribed guidelines of the Institute.

Maintenance of Animals and tissue collection

The brood stock of rohu as well as common carp was reared in CIFA farm ponds under carp breeding unit following standard procedures [3,31]. The stocking density was maintained at the rate of 1500kg/ha with 1:1 ratio of rohu and common carp (Lat.20°1'06"–20°11'45"N, Long.80°50'52"–85°51'35"E). Physico-chemical parameters of water were monitored routinely during the entire course of investigation. The water temperature, pH, D.O., total alkalinity and total hardness were 28–30°C, 7.5–8.5, 100–140 ppm, and 100–130 ppm, respectively. Adult males and females of L rohita (800–1200g) were collected during May-June, (pre-spawning). The fishes were euthanized with MS-222 at 300 mg/L before dissection. Brain, liver, intestine, kidney, tongue, nose, eye, gill, muscle, heart, ovary and testis tissues were collected from minimum five fishes for each tissue, quickly frozen in liquid nitrogen and stored at –80°C, until used for RNA extraction.

Illumina sequencing and quality controls

Total RNA was extracted from 12 different tissues (50–100mg) following the Guanidium Thiocyanate method [32] using the TRIzol-Reagent (Invitrogen, Carlsbad, CA, USA) according to manufacturer’s instructions. RNA samples were then treated with RNase free DNase I (Qiagen) to remove potential genomic DNA. RNA integrity was initially checked in 1% denaturing gel. The RNA samples showing clear separation of 28S and 18S bands in the gel and spectrophotometric (Varian Cary 50 Bio) A260/280 absorption ratio greater than 1.9 were taken for further work. RNA size distribution and integrity were further analyzed in Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA) and samples having RIN (RNA Integrity Number) more than 7.0 were used for library preparation. One paired- end (PE) cDNA library was generated from the pooled total RNA (4μg) of rohu tissues in equal quantity using mRNA-Seq assay for transcriptome sequencing on Illumina Genome Analyzer II platform. The library was constructed according to the Illumina TruSeq RNA library protocol outlined in “TruSeq RNA Sample Preparation Guide” (Part # 15008136). The prepared library was quantified using Qubit (Invitrogen) and validated for quality by running an aliquot on High Sensitivity chip (Agilent, Cat # 5067–4626) on Bioanalyzer 2100 (Agilent technologies, Santa Clara, CA, USA). Sequencing was done in one lane to generate 72bp PE reads. The raw sequences generated by Illumina Genome Analyzer were processed with SeqQC-V2.1 (In-house tool kit of Genotypic Technology) for various quality controls, including filtering of high-quality reads based on the score value, removal of reads containing primer/ adaptor sequences and trimming of read length.

De novo assembly

All the assemblies were performed on a server with 48 cores processor and 256 GB random access memory. Two programs were used for de novo assembly of the PE sequence reads of rohu. Publicly available program, velvet (version 0.7.62; http://www.ebi.ac.uk/~zerbino/velvet/) was used to generate a non-redundant set of transcripts which has been developed for assembly of short reads using de-Bruijn-graph algorithm. Various assembly parameters were also optimized for best result. The trimmed high-quality sequence reads were assembled using velvet program at different (Hash length) k-mer length such as 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61 and 65 with various output parameters like number of used reads, nodes, total number of contigs, contigs longer than 100 bp, N50 length, longest contig length and average contig length as a function of k-mer. Assembly by velvet was followed by oases (version 0.1.8; http://www.ebi.ac.uk/~zerbino/oases/), which has also been developed for de novo assembly of short reads, which takes the assembly generated by velvet as input and exploits the read sequence and pairing information to obtain better contigs/transcripts particularly to get the isoforms.

Similarity search and functional annotation

Putative function of the velvet assembled contigs was deduced by using them as queries against the UniProtKB/SwissProt database and the non-redundant (nr) protein database in the BLASTX program. The cut off E-value was set at 1e-6 and only the top gene-ID and name were initially assigned to each contig. Gene ontology (GO) annotation analysis was performed in Blast2GO (http://www.blast2go.org/) version 2.5.0 for the assignment of gene ontology terms. The nr BLAST result was imported to Blast2GO. The final annotation file was produced after gene-ID mapping, GO term assignment, annotation augmentation and generic GO-slim process. The annotation result was categorized with respect to Biological Process, Molecular Function, and Cellular Component at level 2. Pathway analyses of unique sequences were carried out based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) database using the online (http://www.genome.jp/tools/kaas/) KEGG Automatic Annotation Server (KAAS) by using Bi-directional Best Hit (BBH) method. Enzyme commission (EC) numbers were obtained and used to putatively map protein sequences to a specific biochemical pathway.

Mapping and assessment of sequence reads onto rohu transcripts

To assess transcript and exon distribution in the genome, all the transcripts of oases assembly were mapped to the complete genome of zebrafish (zv9) and all the transcripts with significant hits were plotted by zebrafish chromosome number. For this, putative rohu non redundant transcripts were aligned with the reference sequences of zebrafish genome (zv9) using top hat alignment programme, compared and mapped in digital gene expression file using cufflinks program. In addition, the coverage of each transcript was determined in terms of number of fragment per kilobase of exon per million fragment mapped (FPKM). Further to assess transcript distribution in the genome of other species, the non redundant transcript data set of oases assembly was further subjected to BLAST against the mRNA databases of zebrafish (Danio rerio), salmon (Salmo salar) and catfish (Ictalurus punctatus) separately in NCBI. Those transcripts having more than 70% identity and 50% query coverage (mRNA) were taken for further analysed to get the mRNA annotated transcripts, mRNA overlapping transcripts and list of mRNA matching in that particular species.

Identification of orthologous genes involved in reproduction

To identify the genes and transcripts that play important role in reproduction process, the transcripts associated with GO terms under reproduction (GO: 0000003), hormone activity (GO: 0005179), receptor binding (GO: 0005102), receptor activity (GO: 0004872) and embryonic development (GO: 0009792) were selected.

Identification of EST-SSRs, SNPs and validation of microsatellite containing transcripts

To identify all the simple sequence repeats in assembled transcriptome of L rohita, perl script programme MISA (http://pgrc.ipk-gatersleben.de/misa/) was used. The mono-nucleotide repeats (more than 10 times), di-nucleotide (more than 6 times), tri, tetra, penta- nucleotide (more than 5 times) were considered as search criteria in MISA script. Maximum number of bases interrupting between two SSRs in a compound microsatellite was taken as 100. For the analysis of microsatellite polymorphism, DNA from 3 parents belonging to two linkage mapping panels of rohu [33] and 2 individuals (one each) from resistant and susceptible lines of rohu against aeromoniasis [5] was used to test the gene loci for diversity. Number of alleles, observed heterozygosity (HO), expected heterozygosity (HE) and polymorphic information content (PIC) were estimated using CERVUS software ver 3.0 (http://www.fieldgenetics.com/pages/home.jsp). The difference between observed and expected heterozygosity was tested using chi square test by SAS 9.2 version for significant deviation. Further, to study the presence of SSRs in other species, corresponding gene sequences from zebrafish as well as common carp were downloaded from the databases and SSRs were searched in these sequences as described for rohu. To identify SNPs in rohu contigs, reads were mapped with the complete genome of zebrafish (zv9) using Bowtie-0.12.7 and variations were detected by Sam tools 0.1.7 with maximum variant count ≥ 10.

Putative ORF sequences searching

Unidentified sequences (no match found in BLASTX and BLASTN) among unique sequences were analyzed by star-orf software (http://web.mit.edu/star/orf/runapp.html) using parameter of 80bp minimal ORF length to search for putative ORF proteins, which could be used to distinguish between coding and non-coding sequences. Once the start codon, coding sequences, stop codon and poly (A) tail were identified, the cDNA sequences were considered a full-length cDNA, all the possible frames were searched against the protein database using BLASTP tool of NCBI.

Expression analysis of selected orthologous gene using quantitative real-time PCR during preparatory and post-spawning phases

For the quantitative real time study, common carp and rohu were collected from the same earthen pond having same aquatic environment, as mentioned previously in animal maintenance section. Relative expression level of 19 transcripts (13 known and 6 unknown genes with putative ORFs) were measured by real-time PCR in brain, liver, pituitary, ovary and testis tissues collected from 15 individuals during initiation of gonad maturation (preparatory) and resting phase (post-spawning) each, using β-actin as a reference gene (GenBank accession no. EU184877) from rohu. Similarly, brain, liver, pituitary and ovary from common carp (Cyprinus carpio) was also collected from 15 individuals during preparatory phase for comparison with rohu. Equal quantity of tissue from each individual was pooled into three sets (5 fishes in each set) and RNA was extracted for each pooled tissues following the Guanidium Thiocyanate method [32] using the TRIzol-Reagent (Invitrogen, Carlsbad, CA, USA) according to manufacturer’s instructions. First strand cDNA was synthesized using M-MLV reverse transcriptase (Finnzymes, Vantaa, Finland). Validation of transcript specific primers (S1 Table) was checked by normal PCR and band intensities for different tissues were observed in agarose gel electrophoresis with β-actin as control (data not shown). The real time PCR amplifications were carried out using a Light Cycler 480 (Roche, Germany) with a negative control with no template. Real time PCR was repeated twice for each tissue for each sample. The crossing point, Cp values were acquired for both the target and reference gene using software version LCS480 1.5.0.39 of Light Cycler 480 (Roche, Germany). The relative transcript level of each transcript in each tissue was calculated by normalization of the value with the corresponding reference and compared among them using Cp values for brain cDNA as positive calibrator [34]. Comparison of relative expression level of each transcript between the two reproductive phases in individual tissue as well as between the two species was analyzed in REST 2009 software and the whisker-box plots were extracted with 2000 time iterations (http://www.REST.de.com).

Results

De novo assembly

A total of 74,725,656 PE (37,362,828 from each end) raw sequence reads with each 72 bp length were generated using Illumina Genome Analyzer II, encompassing about 16 GB of sequence data in fastq format. The raw reads produced have been deposited in the NCBI SRA database (accession number: SRA051586). After filtering the sequence data for low-quality reads at higher stringency and reads containing primer/adaptor sequence, resulted in a total of 64,971,614 (87%) high-quality sequence reads (more than 70% of bases in a read with more than 20 phred score). The final data set comprising 64.97 million high-quality reads was used for optimization of de novo assembly and analysis of rohu transcriptome (Table 1). N50 length of the contigs generated using velvet assembly program varied from 454 to 1309 of different k mer (31–65) values (Fig 1). Out of these the best k mer value found was 37, as it resulted in highest N50 length of 1309 bp, largest contig length of 16,961 bp and largest average contig length of 709 bp. The assembly resulted in a total of 62,283 contigs containing 8 contigs of 10 Kb, 13,707 contigs of 1 Kb, 26,145 contigs of 500 bp with a minimum of 100 bp lengths (Table 2). The total number of reads used for the assembly was also highest (69.61%) for k mer value of 37. The assembly of contigs generated by velvet at k-mer 37 was used again as input data in oases with default parameters. This resulted in a total number of 88,612 transcripts in comparison to the 62,283 contigs resulted from velvet.

thumbnail
Fig 1. Comparison of number of contigs, N50 length, average contigs length, at different k-mer value.

https://doi.org/10.1371/journal.pone.0132450.g001

Functional annotation and identification of reproduction-related genes

A total of 31,637 contigs had significant BLASTX hit corresponding to 17,925 unique protein accessions in the nr protein database. Gene ontology (GO) analysis of these 17,925 unique proteins resulted in a total of 78,317 annotations/GO terms including 33,343 (42.57%) biological process terms, 23,479 (29.98%) molecular function terms and 21,495 (27.44%) cellular component terms (Fig 2). Among the biological process category 7,841 and 7,375 genes were related to metabolic (GO: 0008152) and cellular processes (GO: 0009987) respectively; a significant numbers of genes were also identified from development process (2,534) and growth (248) sub-categories. Similarly under molecular function category, 11,798 genes were involved in the binding process (GO: 0005488) and 6,784 genes in the catalytic activity (GO: 0003824); whereas under the cellular component category, 11,533 genes from cell (GO: 0005623), and 6,500 genes corresponded to organelle (GO: 0043226) were the most represented categories. A total of 940 reproduction relevant genes were identified (Fig 3), among which 184 were mainly associated with reproduction related proteins (GO: 0000003), 223 related to hormone activity (GO: 0005179) and receptor binding related proteins (GO: 0005102), 178 receptor activity related proteins (GO: 0004872) and 355 embryonic development related proteins (GO: 0009792). Details of 940 reproduction related gene orthologues are given in S2 Table.

thumbnail
Fig 2. Percentages of annotated Labeo rohita sequences assigned with GO terms according to level 2 categories.

GO-terms were processed by Blast2Go and categorized at level 2 under three main categories. Each of the three GO categories is presented including (left to right): biological process, molecular function and cellular component.

https://doi.org/10.1371/journal.pone.0132450.g002

thumbnail
Fig 3. Distribution of reproduction-relevant transcripts identified in Labeo rohita.

A total of 940 reproduction relevant genes were distributed among 184 reproduction related proteins (GO: 0000003), 223 related to hormone activity (GO: 0005179) and receptor binding related proteins (GO: 0005102), 178 receptor activity related proteins (GO: 0004872) and 355 embryonic development related proteins (GO: 0009792).

https://doi.org/10.1371/journal.pone.0132450.g003

Mapping of rohu transcripts with zebra fish, salmon, and catfish

Among 31,091 rohu transcripts, 81,622 exons were found distributed, and all exons were distributed in 25 chromosomes of zebrafish. Out of 25 chromosomes, chromosome number 5 and 23 showed maximum number of match with rohu transcripts (1998 and 1487) and exons (5166 and 4267) respectively (Fig 4). On the other hand, out of 88,612 oases based rohu transcripts when compared with zebra fish, salmon, and catfish mRNA showed match with 31,091, 4,894 and 1,071 numbers of annotated transcripts respectively (Table 3). Interestingly it also showed 794 transcripts common in all these species (S1 Fig).

thumbnail
Fig 4. Distribution of rohu transcripts and exons on zebra fish chromosome.

rohu transcripts found distributed in all 25 chromosomes of zebrafish.

https://doi.org/10.1371/journal.pone.0132450.g004

thumbnail
Table 3. The oases contigs/transcripts blasted against the Salmon, Zebrafish, and Catfish mRNA sequences (from NCBI).

Those contigs with more than 70% identity and 50% query coverage (mRNA) as cut offs was taken for analysis;

https://doi.org/10.1371/journal.pone.0132450.t003

KEGG-pathway analysis for identification of reproduction related pathways

KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analysis was performed on all assembled contigs as alternative approach for functional categorization and annotation. Several KEGG pathways were represented by more than 200 unique transcripts. Enzyme commission (EC) numbers were assigned with 5,938 enzyme codes for 8,683 unique sequences (Table 4). Briefly, 2,269 (26.13%) were classified into the metabolism, 1,365 (15.72%) sequences grouped into the Genetic information processing (GIP), 1,188 (13.68%) unique sequences come under Environmental information processing (EIP), 1,283 (14.77%) unique sequences under cellular processes and finally 2,578 (29.69%) sequences under organismal systems had match with KEGG annotation (Table 4). Among reproduction-relevant pathways, GnRH signalling pathway (Fig 5) (66 unique transcript sequences coding for 40 genes of rohu out of 121 genes), oocyte meiosis (out of 137 genes 86 unique transcript sequences coding for 53 genes), steroid biosynthesis (16 unique transcript sequences coding for 14 enzymes of rohu out of 28 genes), steroid hormone biosynthesis (18 unique transcript sequences coding for 15 enzymes of rohu out of 38 genes), progesterone-mediated oocyte maturation (72 unique transcript sequences coding for 46 genes of rohu out of 107 genes), retinol metabolism (21 unique transcript sequences coding for 17 enzymes of rohu out of 38 genes), neuroactive ligand-receptor interaction (70 unique transcript sequences coding for 63 genes), neurotrophin signaling pathway (98 unique transcript sequences coding for 67 genes) and phototransduction (19 unique transcript sequences coding for 13 enzymes) were the major pathways found.

thumbnail
Fig 5. GnRH Pathway in rohu: 66 transcripts coding for 40 genes were mapped in rohu, out of 121 genes of GnRH pathway (KEGG databases).

https://doi.org/10.1371/journal.pone.0132450.g005

Identification of EST-SSRs and SNPs

A total of 22,383 EST-SSRs were identified in 17,244 transcripts of rohu with a frequency of one SSR per 3.41 kb of the sequence (Table 5). The mono-nucleotide repeats represented the largest fraction (60.33%) of SSRs identified followed by di-nucleotide (19.21%) and tri-nucleotide (9.83%) repeats. Only a small fraction of tetra- (390), penta- (70) and hexa-nucleotide (23) repeats were identified in rohu transcripts (Table 5 and S2 Fig). In total, there were 1893 compound repeats. Screening of microsatellites from reproduction relevant known genes revealed 29 different microsatellite containing sequences with different repeat motifs (Table 6). From 29 reproduction relevant microsatellite containing sequences 51 primers were designed from the flanking region, out of which 38 primers were selected for testing, but 32 (84%) primers showed PCR amplification. Checking further with these 32 loci in the mapping panel parents, twenty loci were found to be monomorphic and twelve polymorphic. The genetic diversity measures using CERVUS indicated mean expected and observed heterozygosity to be 0.471 and 0.983, respectively. Mean polymorphism information content was found to be 0.35 (Table 7). The observed heterozygosity was lower for the gene loci, SP02, SP04 and SP12, while it was higher in gene loci SP08, SP09, SP14, SP15, SP20 and SP21 than the expected. The difference between expected and observed heterozygosity was not significant (P>0.05). No linkage was detected among the loci. The difference between mean expected and mean observed heterozygosity was not significant (P>0.05). Observed and expected heterozygosity differed at individual locus level (Table 7). Corresponding to 29 microsatellite containing sequences in rohu 26 gene sequences were found for zebrafish and the matching percentage was greater (74%- 98%) at nucleotide level, as compared to only 3 sequences found for common carp with lower (10%- 91%) identity. Microsatellites could be observed in six zebrafish sequences but not in the common carp sequences (S3 Table). A total of 52925 SNPs were identified in 19402 transcripts of rohu (Table 8), out of which 1827 were homozygous and 51098 heterozygous (S4 Table).

thumbnail
Table 5. Statistics of microsatellite search in Labeo rohita.

https://doi.org/10.1371/journal.pone.0132450.t005

thumbnail
Table 6. List of microsatellites containing reproduction-relevant transcripts identified in Labeo Rohita.

https://doi.org/10.1371/journal.pone.0132450.t006

thumbnail
Table 7. Microsatellite Locus, repeat type motif and PCR product size, amplification temp, Number of alleles, observed heterozygosity (HO), expected Heterozygosity (HE) and polymorphic in content (PIC) of 12 rohu microsatellite loci.

https://doi.org/10.1371/journal.pone.0132450.t007

Expression analysis of reproduction-relevant transcripts during preparatory and post-spawning phases

Real-time RT-PCR was performed for 19 unigenes, including 13 known function categories genes (estrogen receptor binding site associated antigen 9 variant 1, vitellogenin receptor, insulin receptor b, fibrinogen gamma chain, green sensitive cone opsin, steroid receptor homolog SVP-46, spermatogenic glyceraldehydes 3-phosphate dehydrogenase, semaphorin3fa, follistatin like-2, cathepsin-Z, 11-beta-hydroxysteroid dehydrogenase, prolactin and activin receptor) (Fig 6) and 6 unknown transcripts (node-19676, node-20067, node-20271, node-6976, node-7314 and node-19294) with putative ORFs (Fig 7). Results showed clear differences in the level of expression in the same tissue between the two phases of reproduction (i.e. preparatory and post-spawning phase) in rohu. Relative expression analysis showed that expression ratio of estrogen receptor binding site associated antigen 9 variant 1 and follistatin like-2 were significantly (p<0.001 and p<0.038, respectively) up regulated only in ovary (Fig 6C), while estrogen receptor binding site associated antigen 9 variant 1 and vitellogenin receptor were up-regulated (p<0.001 and P<0.012, respectively) in testis (Fig 6D), during preparatory phase as compared to respective tissue levels in post-spawning phase. On the other hand vitellogenin receptor, fibrinogen gamma chain, green sensitive cone opsin, steroid receptor homolog SVP-46, spermatogenic glyceraldehydes 3-phosphate dehydrogenase, semaphorin3fa, follistatin like-2, cathepsin-Z, 11-beta-hydroxysteroid dehydrogenase, prolactin and activin receptor in brain (p<0.036 each) (Fig 6A), estrogen receptor binding site associated antigen 9 variant 1, vitellogenin receptor, insulin receptor b, green sensitive cone opsin, steroid receptor homolog SVP-46, spermatogenic glyceraldehydes 3-phosphate dehydrogenase, semaphorin3fa, follistatin like-2, cathepsin-Z, 11-beta-hydroxysteroid dehydrogenase, prolactin and activin receptor in liver (p<0.001 each) (Fig 6B), steroid receptor homolog SVP-46 in ovary (p<0.038) (Fig 6C), fibrinogen gamma chain, green sensitive cone opsin, steroid receptor homolog SVP-46 and prolactin in testis (p<0.048, p<0.001, p<0.048 and p<0.001, respectively) (Fig 6D), and estrogen receptor binding site associated antigen 9 variant 1, spermatogenic glyceraldehydes 3-phosphate dehydrogenase, semaphorin3fa, cathepsin-Z and activin receptor in pituitary (p<0.001, p<0.001, p<0.040, p<0.001 and p<0.042, respectively) (Fig 6E) were statistically down-regulated in preparatory phase in comparison to post-spawning phase.

thumbnail
Fig 6. A,B,C,D,E.

Comparison of relative expression ratio of 13 known transcripts in different tissues between post-spawning and preparatory phase using beta actin as reference (whisker-box plots), where the 13 transcripts are, 1 = Estrogen receptor binding site associated antigen 9 variant 1, 2 = Vitellogenin receptor, 3 = Insulin receptor b, 4 = Fibrinogen gamma chain, 5 = Green sensitive cone opsin, 6 = Steroid receptor homolog svp 46, 7 = Spermatogenic glyceraldehyde-3-phosphate dehydrogenase, 8 = Semaphorin 3fa, 9 = Follistatin-like 2, 10 = Cathepsin-Z, 11 = 11-beta-hydroxysteroid dehydrogenase, 12 = Prolactin and 13 = Activin receptor.

https://doi.org/10.1371/journal.pone.0132450.g006

thumbnail
Fig 7. A,B,C,D,E.

Comparison of relative expression ratio of 6 un-known transcripts in different tissues between post-spawning and preparatory phase using beta actin as reference (whisker-box plots), where 6 un-known transcripts are, 1 = Node 19676, 2 = Node 20067, 3 = Node 20271, 4 = Node 6976, 5 = Node 7314 and 6 = Node 19294.

https://doi.org/10.1371/journal.pone.0132450.g007

Expression ratio analysis of 6 unknown putative transcripts showed that (Fig 7), node-19676, node-20067, node-20271 were significantly up-regulated in testes (p <0.001, p <0.001 and p <0.030, respectively) (Fig 7A). Almost all the unknown transcripts were down regulated in brain (p <0.001) and in liver (p <0.001, 0.034, 0.031, 0.034, 0.043 and 0.001, respectively) (Fig 7B),while node-19676, node-20067, node-20271, node-7314 and node-19294 in ovary (p <0.001, p <0.001, p <0.001, p <0.001 and p <0.039, respectively) (Fig 7C), and node-20067 and node-20271 in pituitary (p <0.034 and p <0.001, respectively) (Fig 7E), were down regulated during preparatory phase and the levels were statistically different from post-spawning phase.

Expression study of these unknown transcripts in other prolific breeder i.e. common carp in similar tissues confirms that they also possess similar sequences. Expression levels were compared between common carp and rohu (S3 Fig) in preparatory phase. The results showed that transcript levels of node-19676, node-20067, node-20271, node-6976, node-7314, and node-19294 in brain (p<0.001, p<0.001, p<0.001, p<0.001, p<0.001 and p<0.050, respectively), node-19676, node-20067, node-6976, node-7314 and node-19294 in liver (p<0.001, p<0.001, 6976 (p<0.001, p<0.001 and p<0.023, respectively), node-19676, node-20067, node-6976 in ovary (p<0.001, p<0.001 and p<0.034, respectively) and node-20067, node-20271 and node-6976 in pituitary (p<0.001, p<0.001 and p<0.001, respectively) were up regulated in rohu, as compared to respective tissues in common carp. On the other hand node-7314 level was significantly up regulated (p<0.001) in common carp pituitary than in rohu.

Discussion

Sequencing, assembly and mapping of Labeo rohita transcriptome

Sequencing and characterization of transcriptome of non model species using RNA-seq is one of the most important applications of NGS technologies [35]. The de novo assembly of short reads without a known reference is considered difficult [36]. In the present study different k-mer results suggested that k-mer length affects inversely to the number of contigs. Out of these the best k-mer value obtained was 37, as it resulted in highest N50 length of 1309 bp, with largest contig length of 16961 bp and average contig length of 709 bp (Fig 1). Similar findings were reported in de-novo assembly of Chickpea transcriptome [36]. 62,283 numbers of contigs with average length of 709bp as compared to 40,596 numbers of contigs with average contig length of 308bp from rohu [5], indicated better quality of reads in the present study. Out of 62,283 contigs, 50.79% (31,637) provided significant BLASTX hit corresponding to 17,925 unique protein accessions in the nr protein database, however, 47.4% contigs showing annotation from rohu in previous study [5]. It also revealed that, about 50% of transcripts did not match in any databases thus are less likely to cover coding regions, or belongs to novel sequences [37]. BLASTX top-hit species distribution of gene annotations showed highest homology with the Danio rerio, followed by Oreochromis niloticus, and Cyprinus carpio with L rohita and lesser homology with Salmo salar, Ctenopharyngodon idella, Carassius auratus and Ictalurus punctatus (Data not shown). Probably, these results indicated a high level of similarities and conserveness of the L rohita gene content with Danio rerio and Cyprinus carpio than with Salmo salar and Ictalurus punctatus. However, higher homology of L rohita (cyprinidae) sequences with Oreochromis niloticus (cichlidae) as compared to Ctenopharyngodon idella and Carassius auratus (cyprinidae) may be explained on the basis of the fewer number of genes that are currently available in the NCBI database for these species as was also reported in rainbow trout [37]. Blast analysis of rohu transcripts against the mRNA databases showed maximum (31,091) match with zebrafish, followed by salmon (4,894) and catfish (1,071) which further indicated closer relation of rohu with zebrafish (S1 Fig). 27,693 contigs in common carp were found significantly matching with refseq proteins of zebrafish [22]. Based on the above results rohu transcripts were chosen for chromosome wise mapping with complete genome of zebrafish (zv9), which showed that rohu transcripts were distributed in all 25 chromosomes of zebrafish (Fig 4), indicating 25 numbers of chromosomes are also present in rohu [38]. Among them, chromosome 5 and 23 of zebrafish (zv9) showed maximum similarity with rohu transcripts while it was with chromosome 7 and 5 of zebrafish (zv9) in case of common carp transcripts [19].

Out of 17,925 gene orthologues found, a total of 940 important reproduction-relevant genes under reproduction, hormone activity, receptor binding, receptor activity and embryonic development were analyzed and reported for the first time in L rohita from the pre-spawning phase. These transcripts are important because ovarian recrudescence, responsiveness and stimulatory effect to both steroidogenic and gametogenic functions of the gonad normally occurs during the pre-spawning period [10]. Similarly, 2852 genes involved in maturation and development of the ovary from rainbow trout [21], 474 genes from ovary of tilapia [23], 2341 genes from atlantic halibut related to quality of egg parameters [25], 1200 genes from rainbow trout and clawed toad during late oogenesis [21], 275 genes in coho salmon from ovary during primary and early secondary oocyte growth [23] were also reported.

Analysis of reproduction relevant pathways in Labeo rohita

Transcriptome studies help in gene discovery and provide novel insight into various unique species-specific biological process/pathways [36]. The 8,683 sequences (Table 4) along with 328 different enzymes/orthologues representing nine important reproduction-relevant pathways obtained in KEGG analysis may serve as valuable resources for future gene identification and functional analysis, as well as development of microarray for reproduction research for this species. Among different pathways mapped, GnRH signaling is the major reproduction-relevant pathways identified. Reproduction in fishes is dependent on the coordinate actions of various hormones in which gonadotropin-releasing hormone (GnRH), acts as master regulator via the hypothalamic—pituitary—gonadal (HPG) axis [39]. Therefore GnRH signaling pathway study in this species will be of key interest in the context of its seasonal (monsoon) nature of breeding under tropical climate in comparison to prolific breeder like zebrafish [40]. Out of 121 genes mentioned in this pathway in zebrafish, about 40 genes are captured in L rohita, which is quite significant and not reported earlier.

Generally, immature oocytes are developed into fertilizable eggs through meiotic maturation induced by specific hormones [41]. While progesterone- mediated oocyte maturation is the major studied pathway in xenopus, it can also be induced by other steroid hormones and the pathway may differ from animal to animal [42]. 46 out of 107 genes of progesterone- mediated oocyte maturation pathway, and 53 out of 137 genes in oocyte meiosis pathway were mapped in rohu in the present study.

Steroid biosynthesis and steroid hormone biosynthesis pathways are important for fish reproduction. The steroid hormones are all derived from cholesterol [43]. Numerous organs are known to have the capacity to synthesize biologically active steroids, including the adrenal gland, testis, ovary, brain, placenta, and adipose tissue. 14 out of 28 genes from steroid biosynthesis pathway and 15 out of 38 genes from steroid hormone biosynthesis pathway of zebrafish were reported for the first time in rohu, which are of immense value for future reproductive biology study.

In retinol metabolism pathways, retinal is the predominant retinoid in eggs and oocytes of marine fish as well as some freshwater fish species and may constitute almost the entire pool of retinoids in these eggs [44]. The cellular pathways for retinoid metabolism are mainly known from studies in mammalian species. These pathways are highly conserved and principally identical among all classes of vertebrates, probably also in fish [44]. Out of 38 genes from retinol metabolism pathways of zebrafish, 17 genes were observed in L rohita.

Similarly neurotrophins are growth factors implicated in the development and maintenance of different neuronal populations in the nervous system [45]. No neurotrophin signaling pathways are reported for zebrafish in KEGG database; however, in the present study, 67 genes involved in neurotrophin signaling pathways for rohu were found.

Photoperiodic manipulation has emerged as an effective tool of reproductive management in culture fisheries, and understanding the physiology of photoperiodic regulation of fish reproduction became the priority topic of research in different countries [11]. So study of photo-transduction pathways for this species is of immense value. Recently, significant advancement of gonad maturation of Indian major carp species (rohu, catla and mrigal) was possible through photothermal manipulation [14]. For the photo-transduction pathway in rohu, out of 38 genes, 13 genes were mapped from KEGG pathway databases.

Expression analysis of reproduction relevant genes by real time PCR

Some of the important gene orthologues i.e, vitellogenin receptor, insulin receptor b, fgg protein, green sensitive cone opsin, steroid receptor homolog svp 46, prolactin, activin receptor, spermatogenic glyceraldehyde-3-phosphate dehydrogenase, semaphorin 3fa, follistatin-like 2, cathepsin- Z and estrogen receptor binding site associated antigen 9 variant 1, were analyzed by qPCR in preparatory and post-spawning period. All these transcripts were collected during pre-spawning phase, but preparatory and post-spawning phases are equally important to know the events in initiation of gonad maturation in Indian carps [14] and accumulation of any transcript in these stages indicate its role in maturation.

Vitellogenin synthesized and released from the liver, is carried through blood and taken up into oocytes by the vitellogenin receptor; which is an essential process in oviparous animals to ensure successful reproduction [46]. Although different isoform of vitellogenins [47] and their role in gonad maturation [48] are found in the literature, nothing is reported about vitellogenin receptor and their role. Expression of vitellogenin receptor was down regulated in most of the tissues studied (except in testis) in preparatory phase as compared to post-spawning in L rohita. Amounts of vitellogenin receptor were less in individuals with yolked oocytes (ripening stage, May-June) and increased after spawning in July in Atlantic bluefin tuna (Thunnus thynnus L.) [49]. Generally Insulin is implicated in growth, development and reproduction in teleosts [50] and expression of different insulin receptor genes were studied [49]. In L rohita expression of insulin receptor b was found down regulated particularly in liver and almost no change in other tissues during preparatory phase as compared to post-spawning, whereas high level of Insulin receptor b mRNA was reported in ovary in rainbow trout [51]. The role of fibrinogen-γ (FGG) in uterine epithelial cells during normal pregnancy, pseudo-pregnancy and in hormone-treated rats is suggested [52]. Fibrinogen gamma chain expression during preparatory phase was significantly lower in brain and testis than in post-spawning phase and no variation observed in other tissues in rohu. However, in zebra fish higher expression was observed in the embryonic yolk syncytial layer than in the early cells of the developing liver [53]. Photoreceptors mainly consist of rods and cones. In the retina of diurnal primates, cones are further subdivided into three subtypes, the red-, green-, and blue-sensitive cones, whose visual pigments are maximally sensitive to long, middle, and short wavelengths, respectively [54]. Significantly lower expression of green sensitive cone opsin was observed in brain, liver and testis during preparatory phase in L rohita, where as expression of Pi-green1 and Pi-green2 cone opsins were observed in skin and lateral eyes in Paracheirodon innesi [55]. Steroid receptor homolog svp 46 showed significantly reduced expression in brain, liver and testis while spermatogenic glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and Semaphorin 3fa both were down regulated in brain, liver and pituitary during preparatory phase in L rohita. In human GAPDH protein was expressed in both sertoli cells and elongated sperms [56], whereas high expression levels of semaphorin 3fa domains were observed in human oocyte from the earliest follicle stages [57] but no report was available about their expression level in any fish species. Expression level of follistatin-like 2 was significantly lower in ovary during preparatory phase, whereas in xenopus it is reported as an early gastrula expressed gene [58]. Cathepsin-Z level was significantly higher in ovary in post-spawning rohu while abundant cathepsin-Z expression was observed in rainbow trout in low quality unfertilized eggs [59]. In most vertebrates, 11-beta-hydroxysteroid dehydrogenase b2 is essential for conferring aldosterone-specific actions in mineralocorticoid target tissues and for protecting glucocorticoid-sensitive tissues during stress [44]. Expression level of 11-beta-hydroxysteroid dehydrogenase was significantly less in brain and liver during preparatory phase in rohu; whereas expression of this transcript was observed nearly in all peripheral tissues in zebrafish [44]. Prolactin plays important roles in freshwater fish reproduction [60] and seasonal acclimatization [61]. Expression level of prolactin was significantly lower in brain, liver and testis during preparatory phase compared to post-spawning in L rohita. However in Cyprinus carpió higher level of mRNA expression was noticed in summer carp pituitary, as compared to winter carp [61]. Activins are critical components of the signaling network that controls female reproduction and different receptors control their roles and functions in hypothalamus [62]. Activin receptor level was significantly low in brain, liver and pituitary during preparatory phase of L rohita, whereas in Ctenopharyngodon idella activin receptor transcripts shows high expression levels in extra-gonadal tissues, including pituitary, brain, and liver [63]. Report of mRNA encoding estrogen receptor binding site associated antigen 9 variant 1 is present in mammalian oocytes [64], but nothing is known about either expression pattern or function in oocytes during maturation, fertilization, and subsequent embryonic development. A significantly higher expression level of this transcript was found in ovary and testis in preparatory rohu in the present study.

Almost all the unknown transcripts were down regulated in brain, liver and ovary (Fig 7), and node-19676, node-20067 and node-20271 were up-regulated in testes during preparatory phase. Many unknown transcripts or novel sequences (34.7%) were also reported during transcriptomic analyses in zebrafish gonad and brain [65]. Searching of similar sequences for the unknown transcripts in the present database of common carp was not successful, and always showing only a few bases matching both at BLASTN, tBLASTN as well as in BLASTX and BLASTP as was reported in our previous study [6], but expression of all these transcripts are also observed in these prolific breeders and significantly higher expression (p<0.001) of node-7314 was noticed in common carp pituitary. Level of expression of almost all the 6 unknown transcripts were significantly higher in seasonal breeder rohu in comparison to carp which indicates that there may be some possible role of these transcripts in reproduction of rohu as was reported in our previous study also [6].

Development of EST-SSR markers and SNPs and validation of EST-SSR markers

Identification of large number of SSRs in the L rohita transcripts with frequency of one SSR per 3.41 kb of the sequence is very interesting and will enrich the existing marker resources of rohu to facilitate genetic improvement of this species.

Twelve microsatellite loci associated with the genes involved in reproductive process showed polymorphism in the present study out of 29 identified, while twenty repeat loci were reported to be polymorphic from 128 loci in our previous study [6]. The higher percentage of polymorphic loci in this study could be due to the high polymorphism between mapping parents. However, a high attrition rate of potential microsatellite marker is generally observed in case of EST-SSRs, starting from primer design step following identification of a simple sequence repeat, through agarose gel analysis, to polyacrylamide gel optimization and final analysis. The relatively low success rate of primers showing good amplification may be due at least in part to high intra-specific polymorphism in the L rohita genome, as was seen in the analysis of flanking region sequences in this study. Similarly Cameron et al., [66] attributed variation in amplification efficiency of sea urchin microsatellite loci to a high level of genomic polymorphism.

Presence of repeats in some of the corresponding zebrafish sequences indicated that SSR may be present in these sequences across the species although it could not be confirmed in common carp due to lack of sequences in databases. A total of 52925 SNPs including 1827 homozygous and 51098 heterozygous were identified in rohu by comparing with zebrafish (zv9) genome, as no reference sequences are reported for rohu in databases. SNPs were reported for disease resistant and susceptible lines of rohu [17] but the SNPs detected in the present study need further validation.

Identification of Isoforms

It has been suggested that assembly of velvet followed by oases yields better contigs/transcripts to produce transcript isoforms [36]. Production of 88, 612 transcripts by oases analysis in comparison to the 62,283 contigs resulted in velvet, suggest that rohu transcript showing isoforms. Observation of gene isoforms from reproduction related transcripts in L rohita is an interesting finding. Seven isoforms of activin receptors were found in rohu, while six variants were observed in grass carp [63]. TATA box binding proteins have two isoforms in rohu; similar splice variant was observed in human, encoding the polyglutamine-containing N-terminal domain that accumulates in Alzheimer's disease [67]. However no isoforms for these proteins were reported in any other fish species. Transferrin receptor showed two isoforms in rohu, which is quite similar with other vertebrates [68]. Thyroid hormone receptor showed seven isoforms in rohu while two thyroid hormone receptor-α genes (thraa, thrab) were found in zebrafish [69]. Seven isoforms of progesterone receptor membrane component was observed in rohu in contrast to three forms reported in channel catfish [70]. Six, four and three isoforms are observed for Prostaglandin synthase, Beta-galactosyltransferase and Semaphorin respectively in rohu, but no isoforms are reported in other fish species. Three isoforms were found for retinoic acid receptor in rohu as compared to seven isoforms in zebrafish [71]. Six isoforms for estrogen-related receptor and two isoforms for cadherin were found in rohu, number of which varies in mammals [72] and zebrafish [73]. Present study revealed six isoforms of insulin receptor in rohu, whereas zebrafish expressed two isoforms of it (insra and insrb) [74]. Nuclear receptors are a class of proteins found within cells that are responsible for sensing steroid and thyroid hormones and certain other molecules and in rohu twelve isoforms of nuclear receptor were observed. Talin showed two isoforms in rohu, where as talin-1 and talin-2 in model vertebrates produces two talins through alternative mRNA splicing [75]. DNA methyltransferase showed two isoforms both in rohu and zebrafish [76]. It is necessary to mention here that these isoforms in rohu are the product of next generation sequencing in which the sequences were generated as short read 75bp sequence followed by assembly of them by the software. Therefore validity of all these isoforms are highly essential for final conclusion.

Conclusion

Production of 62,283 high-quality L rohita transcriptome derived from brain, pituitary, liver, intestine, kidney, tongue, nose, eye, gill, muscle, heart ovary and testis tissues from pre-spawning phase will contribute a significant non-redundant set of ESTs resources. Out of 17,925 important gene orthologues found, a total of 940 reproduction-relevant genes were analyzed and reported for the first time in L rohita. In KEGG analysis, 8,683 well-categorized, annotated transcriptome along with 328 different enzymes/orthologues representing nine important reproduction-relevant pathways were obtained. A total of 22,383 SSRs were identified in 17,244 transcripts of rohu and 12 polymorphic loci were identified from 29 reproduction related genes. Difference in tissue expression levels of 13 known genes and 6 unknown putative genes indicates variation between preparatory and post-spawning phase of these transcripts in monsoon breeder carp L rohita. Isoforms for several reproduction related gene transcripts in L rohita is an interesting finding. These data may serve as important and valuable resources for L rohita genetics and genomics which will be beneficial as a reference set for the production of large-scale transcriptome study in future as well as for gene identification and functional analysis for rohu reproduction.

Supporting Information

S1 Fig. Mapping of rohu transcripts with zebra fish, salmon, and catfish Bottom of Form mRNA.

https://doi.org/10.1371/journal.pone.0132450.s001

(TIF)

S3 Fig. Comparison of tissue expression ratio of 6 un-known transcripts between rohu and common carp in brain, pituitary, ovary and liver tissues during preparatory phase using beta actin as reference (whisker-box plots).

https://doi.org/10.1371/journal.pone.0132450.s003

(TIF)

S1 Table. Transcript specific Forward (F) and reverse (R) primers used in real-time PCR.

https://doi.org/10.1371/journal.pone.0132450.s004

(DOCX)

S2 Table. List of reproduction-relevant transcripts identified in Labeo rohita.

https://doi.org/10.1371/journal.pone.0132450.s005

(DOC)

S3 Table. Comparison of repeat types identified in L. rohita with corresponding gene sequences of Cyprinus carpio and Danio rario.

https://doi.org/10.1371/journal.pone.0132450.s006

(DOCX)

S4 Table. List of SNPs identified from rohu contigs.

https://doi.org/10.1371/journal.pone.0132450.s007

(XLSX)

Acknowledgments

This work was supported by Indian Council of Agricultural Research (ICAR) and Central Institute of Freshwater Aquaculture (CIFA), Government of India. We would like to thank DG, ICAR for his encouragement.

Author Contributions

Conceived and designed the experiments: SN. Performed the experiments: DKS SN SPP PKM PD PR. Analyzed the data: DKS SN SPP PKM PD. Contributed reagents/materials/analysis tools: PR. Wrote the paper: DKS SN SPP PKM PD PR JKS PJ. Overall monitoring, manuscript overview and editing: JKS PJ.

References

  1. 1. World Review of Fisheries and Aquaculture: The state of world fisheries and aquaculture. Food and Agriculture Organisation of the United Nations, Rome; 2012.
  2. 2. Khan HA, Jhingran VG (1995) Synopsis of biological data on Rohu. Rome: Food and Agriculture Organization of the United Nations. FAO Fisheries Synopsis No. 111.
  3. 3. Routray P, Verma DK, Sarkar SK, Sarangi N (2007) Recent advances in carp seed production and milt cryopreservation. Fish physiology biochemistry 33: 413–427.
  4. 4. Das S, Chhottaray C, Das-Mahapatra K, Saha JN, Baranski M, Robinson N, et al. (2014) Analysis of immune-related ESTs and differential expression analysis of few important genes in lines of rohu (Labeo rohita) selected for resistance and susceptibility to Aeromonas hydrophila infection. Molecular biology report 41(11):7361–7371.
  5. 5. Robinson N, Sahoo PK, Baranski M, Mahapatra KD, Saha JN, Das S, et al. (2011) Expressed Sequences and Polymorphisms in Rohu Carp (Labeo rohita, Hamilton) Revealed by mRNA-seq. Marine Biotechnology 10126(12): 9433–9438.
  6. 6. Sahu DK, Panda SP, Panda S, Das P, Meher PK, Hazra RK et al. (2013) Identification of reproduction-related genes and SSR-markers through expressed sequence tags analysis of a monsoon breeding carp rohu, Labeo rohita (Hamilton). Gene 524:1–14. pmid:23583682
  7. 7. Patra BC (2011) Nutritional energetics of an indian major carp Labeo rohita (ham.), family cyprinidae. International journal current research 3(11): 259–263.
  8. 8. Quasim SZ, Qayyum A (1962) Spawning frequencies and breeding season of some freshwater fishes with special reference to those occurring in the plains of northern India. Indian journal of Fishries 8:24–43.
  9. 9. Bhattacharya S (1999) Recent advances in the hormonal regulation of gonadal maturation and spawning in fish. Current science 3:342–349.
  10. 10. Sundararaj BI, Vasal S (1976) Photoperiod and Temperature Control in the Regulation of Reproduction in the Female Catfish Heteropneustes fossilis. Journal of the fisheries research board of Canada 33(4):959–973.
  11. 11. Maitra SK, Seth M, Chattoraj A (2006) Photoperiod, pineal photoreceptors and melatonin as the signal of photoperiod in the regulation of reproduction in fish. Reproductive biology and endocrinology 10(2):73–87.
  12. 12. Nandi S, Chattopadhyay DN, Verma JP, Sarkar SK, Mukhopadhyay PK (2001) Effect of dietary supplementation of fatty acids and vitamins on the breeding performance of the carp Catla catla. Reprod. Nutr. Dev. 41 (4):365–375. pmid:11789892
  13. 13. Gupta SD, Rath SC, Dasgupta S, Tripathi SD (1995) A first report on quadruple spawning of Catla catla (Ham.). Veterinary arhives 65 (5):143–148.
  14. 14. Sarkar SK, Saha A, Dasgupta S, Nandi S, Verma DK, Routray P, et al. (2010) Photothermal manipulation of reproduction in Indian major carp: a step forward for offseason breeding and seed production. Current science 7:960–965.
  15. 15. Das-Mahapatra K, Jana RK, Saha JN, Gjerde B, Sarangi N (2006) Lessons from the breeding program of rohu: In Development of aquatic animal genetic improvement and dissemination programs: current status and action plans. Malaysia: World Fish Centre 34–40.
  16. 16. Das P, Barat A, Meher PK, Ray PP, Majumdar D (2005) Isolation and characterization of polymorphic microsatellites in Labeo rohita and their crosss species amplification in related species. Molecular ecology notes 5:231–233.
  17. 17. Robinson N, Baranski M, Mahapatra KD, Saha JN, Das S, Mishra J, et al. (2014) A linkage map of transcribed single nucleotide polymorphisms in rohu (Labeo rohita) and QTL associated with resistance to Aeromonas hydrophila. BMC genomics 30:15–541.
  18. 18. Weltzien FA, Andersson E, Andersen Ø, Shalchian-Tabrizi K, Norberg B (2004) The brain-pituitary-gonad axis in male teleosts, with special emphasis on flatfish (Pleuronectiformes). Comp Biochem Physiol A Mol Integr Physiol. 137(3):447–477. pmid:15123185
  19. 19. Ji P, Liu G, Xu J, Wang X, Li J, Zhao Z, et al. (2012) Characterization of common carp transcriptome: sequencing, de novo assembly, annotation and comparative genomics. PLoS one 7(4): 351–352.
  20. 20. Li AQ, Zhao CZ, Wang XJ, Liu ZJ, Zhang LF, Song GQ, et al. (2010) Identification of SSR markers using soybean (Glycine max) ESTs from globular stageembryos. Electron J Biotechnol 13(5):1–11.
  21. 21. Li C, Zhang Y, Wang R, Lu J, Nandi S, Mohanty S, et al. (2012) RNA-seq analysis of mucosal immune responses reveals signatures of intestinal barrier disruption and pathogen entry following Edwardsiella ictaluri infection in channel catfish, Ictalurus punctatus. Fish shellfish immunology 32(5): 816–827. pmid:22366064
  22. 22. Vesterlund L, Jiao H, Unneberg P, Hovatta O, Kere J (2011). The zebrafish transcriptome during early development. BMC developmental biology 24: 11–30.
  23. 23. Von-Schalburg KR, Rise ML, Brown GD, Davidson WS, Koop BF (2005) A comprehensive survey of the genes involved in maturation and development of the rainbow trout ovary. Biology of Reproduction 72: 687–699. pmid:15496514
  24. 24. Gohin M, Bobe J, Chesnel F (2010) Comparative transcriptomic analysis of follicle enclosed oocyte maturational and developmental competence acquisition in two non-mammalian vertebrates. BMC genomics 11:18. pmid:20059772
  25. 25. Luckenbach JA, Iliev DB, Goetz FW, Swanson P (2008) Identification of differentially expressed genes during primary and early secondary oocyte growth in coho salmon Oncorhynchus kisutch. Reproductive biology and endocrinology 6: 2. pmid:18205936
  26. 26. Chu SL, Weng CF, Hsiao CD, Hwang PP, Chen YC, Ho JM, et al. (2006) Profile analysis of expressed sequence tags derived from ovary of tilapia, Oreochromis mossambicus. Aquaculture 251: 537–548.
  27. 27. Mommens M, Fernandes JMO, Bizuayehu TT, Bolla SL, Johnston IA, Babiak I (2010) Maternal gene expression in Atlantic halibut (Hippoglossus hippoglossus L.) and its relation to egg quality. BMC research notes 3: 138. pmid:20497529
  28. 28. Cerdà J, Bobe J, Babin PJ, Admon A, Lubzens E (2008) Functional genomics and proteomic approaches for the study of gamete formation and viability in farmed finfish. Reviews of Fish Science 16: 54–70.
  29. 29. Leong JS, Jantzen SG, Von-schalburg KR, Cooper GA, Messmer AM, Liao NY, et al. (2010) Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome. BMC genomics 11: 279. pmid:20433749
  30. 30. Goetz FW, McCauley L, Goetz GW, Norberg B (2006) Using global genome approaches to address problems in cod mariculture, ICES Journal of Marine Science 63: 393–399.
  31. 31. Gupta SD, Reddy PVGK, Pani KC (1990) Advancing maturity and spawning in Asiatic carps through brood stock management. In: Keshavanath P, Radhakrishnan KV (Ed.) Carp seed production technology procedings of the workshop on carp seed production technology.
  32. 32. Chomczynski P, Sacchi N (1987) Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Analytical biochemistry 162(1): 156–159. pmid:2440339
  33. 33. Sahoo L, Patel A, Sahu BP, Mitra S, Meher PK, Mahapatra KD, et al. (2014) Preliminary genetic linkage map of Indian major carp, Labeo rohita (Hamilton 1822) based on microsatellite markers. Journal of Genetics (Article in press).
  34. 34. Pfaffl MW (2001) A new mathematical model for relative quantification in real-time RT—PCR. Nucleic acids research 29: 2002–2007.
  35. 35. Morozova O, Hirst M, Marra MA (2009) Applications of new sequencing technologies for transcriptome analysis. Annual review of genomics and human genetics 10: 135–51. pmid:19715439
  36. 36. Garg R, Patel RK, Tyagi AK, Jain M (2011) De-novo assembly of chickpea transcriptome using short reads for gene discovery and marker identification. DNA research18: 53–63. pmid:21217129
  37. 37. Salem M, Rexroad CE, Wang J, Thorgaard GH, Yao J (2010) Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches. BMC genomics 13(11): 564.
  38. 38. Patel A, Das P, Barat A, Sarangi N (2009) Estimation of genome size in Indian major carps Labeo rohita (Hamilton), Catla catla (Hamilton), Cirrhinus mrigala (Hamilton) and Labeo calbasu (Hamilton) by Feulgen microdensitometry method. Indian journal of fishiries 56(1): 65–67.
  39. 39. Chakraborti P, Bhattacharya S (1984) Plasma thyroxine levels in freshwater perch: influence of season, gonadotropins, and gonadal hormones. Gen comp endocrinol 53(2): 179–186. pmid:6421653
  40. 40. Millar RP, Lu ZL, Pawson AJ, Flanagan CA, Morgan K, Maudsley SR, et al. (2004) Gonadotropin-releasing hormone receptors. Endocrine review 25: 235–275.
  41. 41. Schmitt A, Nebreda AR (2002) Signalling pathways in oocyte meiotic maturation. Journal of cell science 115 (12): 2457–2459. Dupré A,et al. S
  42. 42. Haccard O, Dupré A, Liere P, Pianos A, Eychenne B, Jessus C, et al. (2012) Naturally occurring steroids in Xenopus oocyte during meiotic maturation. Unexpected presence and role of steroid sulfates. Molecular cell endocrinology 362(1–2): 110–119.
  43. 43. Alderman SL, Vijayan MM (2012) 11β-Hydroxysteroid dehydrogenase type 2 in zebrafish brain: a functional role in hypothalamus-pituitary-interrenal axis regulation, Journal of endocrinology 215(3): 393–402. pmid:23042946
  44. 44. Levi L, Ziv T, Admon A, Levavi-Sivan B, Lubzens E (2012) Insight into molecular pathways of retinal metabolism, associated with vitellogenesis in zebrafish. American journal of physiology—endocrinology and metabolism 302 (6): 626–644.
  45. 45. Teng KK, Hempstead BL (2004) Neurotrophins and their receptors: signaling trios in complex biological systems. Cellular and molecular life sciences 61(1): 35–48. pmid:14704852
  46. 46. Dominguez GA, Quattro JM, Denslow ND, Kroll KJ, Prucha MS, Porak WF, et al. (2012). Identification and transcriptional modulation of the largemouth bass, Micropterus salmoides, vitellogenin receptor during oocyte development by insulin and sex steroids. Biological reproduction 87(3): 67.
  47. 47. Nath P, Maitra S (2001) Role of two plasma vitellogenins from Indian major carp (Cirrhinus mrigala) in catfish (Clarias batrachus) vitellogenesis. General and comparative endocrinology 124(1): 30–44. pmid:11703069
  48. 48. Nath P, Bhakta M, Mitra K (1992) Demonstration of two forms of vitellogenin in serum of estradiol-17 beta-treated Indian major carp, Labeo rohita. Indian journal of experimental biology 30(6): 464–469. pmid:1506024
  49. 49. Pousis C, Santamaria N, Zupa R, De Giorgi C, Mylonas CC, Bridges CR, et al. (2012) Expression of vitellogenin receptor gene in the ovary of wild and captive Atlantic bluefin tuna (Thunnus thynnus). Animal reproductoni science 132 (1–2):101–110.
  50. 50. Johns SM, Kane MD, Denslow ND, Watanabe KH, Orlando EF, Villeneuve DL, et al. (2009) Characterization of ontogenetic changes in gene expression in the fathead minnow (Pimephales promelas). Environmental toxicology and chemistry 28(4): 873–880. pmid:19391683
  51. 51. Greene MW, Chen TT (1999) Characterization of teleost insulin receptor family members. General and comparative endocrinology 115 (2): 254–269. pmid:10417239
  52. 52. Lecce L, Kaneko Y, Madawala RJ, Murphy CR (2011) ICAM1 and fibrinogen-γ are increased in uterine epithelial cells at the time of implantation in rats. Molecular reproduction and development. 78(5): 318–327. pmid:21448983
  53. 53. Fish RJ, Vorjohann S, Béna F, Fort A, Neerman-Arbez M (2012) Developmental expression and organisation of fibrinogen genes in the zebrafish. Journal of thrombosis and haemostasis 107(1): 158–166.
  54. 54. Kenneth C, Wikler , Pasko Rakic (1994) An array of early differentiating cones precedes the emergence of the photoreceptor mosaic in the fetal monkey retina. Proc. Proceedings of the National Academy of Sciences. 91:6534–6538.
  55. 55. Kasai A, Oshima N (2006) Light-sensitive motile iridophores and visual pigments in the neon tetra, Paracheirodon innesi. Zoological Science 23(9): 815–819. pmid:17043404
  56. 56. Liu J, Sun CM, Zhang CL, Wang X, Li JY (2013) Location and characterization of GAPDS in male reproduction. Urologia Internationalis 90(4): 449–454. pmid:23306140
  57. 57. Callander DC, Lamont RE, Childs SJ, McFarlane S (2007) Expression of multiple class three semaphorins in the retina and along the path of zebrafish retinal axons. Developmental dynamics 236(10): 2918–2924. pmid:17879313
  58. 58. Dal-Pra S, Fürthauer M, Van-Celst J, Thisse B, Thisse C (2006) Noggin1 and Follistatin-like2 function redundantly to Chordin to antagonize BMP activity. Developmental biology 298(2): 514–526. pmid:16890217
  59. 59. Aegerter S, Jalabert B, Bobe J (2005) Large scale real-time PCR analysis of mRNA abundance in rainbow trout eggs in relationship with egg quality and post-ovulatory ageing. Molecular reproduction and development 72(3): 377–385. pmid:16075464
  60. 60. Weberm GM, Grau EG (1999) Changes in serum concentrations and pituitary content of the two prolactins and growth hormone during the reproductive cycle in female tilapia, Oreochromis mossambicus, compared with changes during fasting. Comparative biochemistry and physiology part c:pharmacology, toxicology and endocrinology 124(3): 323–335.
  61. 61. Figueroa J, Molina A, Alvarez M, Villanueva J, Reyes A, Leon G, et al. (1994) Prolactin gene expression and changes of prolactin pituitary level during the seasonal acclimatization of the carp. Comparative Biochemistry and Physiology Part B: Biochemistry and Molecular Biology 108(4): 551–560.
  62. 62. Sandoval-Guzmán T, Göngrich C, Moliner A, Guo T, Wu H, Broberger C, et al. (2012) Neuroendocrine control of female reproductive function by the activin receptor ALK7. FASEB J 26(12): 4966–4976. pmid:22954591
  63. 63. Song C, Wang X, Zhou H (2010) Molecular cloning of activin type I and type II receptors and differential regulation of their expression by activin in grass carp pituitary cells. Gen comp endocrinology 166(1): 211–216.
  64. 64. Nagler JJ, Cavileer TD, Verducci JS, Schultz IR, Hook SE, Hayton WL (2012) Estrogen receptor mRNA expression patterns in the liver and ovary of female rainbow trout over a complete reproductive cycle. Gen comp endocrinology 178(3): 556–561.
  65. 65. Sreenivasan R, Cai M, Bartfai R, Wang X, Alan Christoffels A, Orban L (2008) Transcriptomic analyses reveal novel genes with sexually dimorphic expression in the zebrafish gonad and brain. PloS one 3(3): 1791.
  66. 66. Cameron RA, Leahy PS, Britten RJ, Davidson EH (1999) Microsatellite loci in wild-type and inbred Strongylocentrotus purpuratus. Developmental biology 208: 255–264. pmid:10191043
  67. 67. Reid SJ, Whittaker DJ, Greenwood D, Snell RG (2009) A splice variant of the TATA-box binding protein encoding the polyglutamine-containing N-terminal domain that accumulates in Alzheimer's disease. Brain research 1268: 190–199. pmid:19285969
  68. 68. Ikuta K, Yersin A, Ikai A, Aisen P, Kohgo Y (2010) Characterization of the interaction between diferric transferrin and transferrin receptor 2 by functional assays and atomic force microscopy. Journal of molecular biology 397(2): 375–384. pmid:20096706
  69. 69. Takayama S, Hostick U, Haendel M, Eisen J, Darimontn B (2008) An F-domain introduced by alternative splicing regulates activity of the zebrafish thyroid hormone receptor alpha. General and comparative endocrinology 155(1):176–189. pmid:17583703
  70. 70. Kazeto Y, Goto-Kazeto R, Thomas P, Trant JM (2005) Molecular characterization of three forms of putative membrane-bound progestin receptors and their tissue-distribution in channel catfish, Ictalurus punctatus. Journal of molecular endocrinology 34(3):781–791. Alderman SL, pmid:15956347
  71. 71. Alsop D, Matsumoto J, Brown S, Van-Der-Kraak G (2008) Retinoid requirements in the reproduction of zebrafish. General and comparative endocrinology 156(1): 51–62. pmid:18158153
  72. 72. Pinto PI, Teodósio R, Socorro S, Power DM, Canário AV (2012) Structure, tissue distribution and estrogen regulation of splice variants of the sea bream estrogen receptor α gene. Gene 503(1): 18–24. pmid:22579469
  73. 73. Tada MN, Senzaki K, Tai Y, Morishita H, Tanaka YZ, Murata Y, et al. (2004) Genomic organization and transcripts of the zebrafish Protocadherin genes. Gene 340(2):197–211. pmid:15475161
  74. 74. Toyoshima Y, Monson C, Duan C, Wu Y, Gao C, Yakar S, et al. (2008) The role of insulin receptor signaling in zebrafish embryogenesis. Endocrinology 149(12): 5996–6005. pmid:18687786
  75. 75. Senetar MA, McCann RO (2005) Gene duplication and functional divergence during evolution of the cytoskeletal linker protein talin. Gene 362: 141–152. pmid:16216449
  76. 76. Smith TH, Dueck CC, Mhanni AA, McGowan RA (2005) Novel splice variants associated with one of the zebrafish dnmt3 genes. BMC developmental biology 5: 23. pmid:16236173