Novel transcriptome assembly and improved annotation of the whiteleg shrimp (Litopenaeus vannamei), a dominant crustacean in global seafood mariculture

Ghaffari, Noushin; Sanchez-Flores, Alejandro; Doan, Ryan; Garcia-Orozco, Karina D.; Chen, Patricia L.; Ochoa-Leyva, Adrian; Lopez-Zavala, Alonso A.; Carrasco, J. Salvador; Hong, Chris; Brieba, Luis G.; Rudiño-Piñera, Enrique; Blood, Philip D.; Sawyer, Jason E.; Johnson, Charles D.; Dindot, Scott V.; Sotelo-Mundo, Rogerio R.; Criscitiello, Michael F.

doi:10.1038/srep07081

Download PDF

Article
Open access
Published: 25 November 2014

Novel transcriptome assembly and improved annotation of the whiteleg shrimp (Litopenaeus vannamei), a dominant crustacean in global seafood mariculture

Noushin Ghaffari¹,
Alejandro Sanchez-Flores²,
Ryan Doan³,
Karina D. Garcia-Orozco⁴,
Patricia L. Chen⁵,
Adrian Ochoa-Leyva⁶,
Alonso A. Lopez-Zavala⁴,
J. Salvador Carrasco⁴,
Chris Hong⁵,
Luis G. Brieba⁷,
Enrique Rudiño-Piñera⁸,
Philip D. Blood⁹,
Jason E. Sawyer¹⁰,
Charles D. Johnson¹,
Scott V. Dindot³,
Rogerio R. Sotelo-Mundo⁴ &
…
Michael F. Criscitiello^3,5,11

Scientific Reports volume 4, Article number: 7081 (2014) Cite this article

11k Accesses
81 Citations
5 Altmetric
Metrics details

Subjects

Abstract

We present a new transcriptome assembly of the Pacific whiteleg shrimp (Litopenaeus vannamei), the species most farmed for human consumption. Its functional annotation, a substantial improvement over previous ones, is provided freely. RNA-Seq with Illumina HiSeq technology was used to analyze samples extracted from shrimp abdominal muscle, hepatopancreas, gills and pleopods. We used the Trinity and Trinotate software suites for transcriptome assembly and annotation, respectively. The quality of this assembly and the affiliated targeted homology searches greatly enrich the curated transcripts currently available in public databases for this species. Comparison with the model arthropod Daphnia allows some insights into defining characteristics of decapod crustaceans. This large-scale gene discovery gives the broadest depth yet to the annotated transcriptome of this important species and should be of value to ongoing genomics and immunogenetic resistance studies in this shrimp of paramount global economic importance.

Integrated transcriptome catalog of Tenualosa ilisha as a resource for gene discovery and expression profiling

Article Open access 17 April 2023

De novo transcriptomes of six calanoid copepods (Crustacea): a resource for the discovery of novel genes

Article Open access 27 April 2023

Transcriptome data for an ancient ‘living-fossil’ mollusc, Entemnotrochus rumphii

Article Open access 24 August 2024

Introduction

Litopenaeus vannamei is a prawn native to the eastern Pacific Ocean from Sonoran Mexico to northern Peru where it has long been caught by inshore fisherman and offshore trawlers. Commonly called both the whiteleg shrimp and Pacific white shrimp, this species was first aquacultured in Florida in 1973 using a mated individual from Panama. By the early 1980's it was being cultured in Hawaii, the contiguous United States, Central and South America and its mariculture spread to Asia in the 2000's¹. It is now the dominant crustacean species grown in aquaculture worldwide, its use having surpassed that of the giant tiger prawn Penaeus monodon². The United Nations Food and Agriculture Organization lists global production of this species growing from 146,382 tons in 2000 to 2,721,929 tons (>$11 billion US) in 2010, making the production of this species one of the fastest growing global food crops.

L. vannamei's dominant role in shellfish aquaculture has generated demand for genetic tools in this species that can be applied to nutrition, reproduction, development and resistance to infectious disease. There have been several RNA-Seq and transcriptome reports for L. vannamei, including next-generation sequencing datasets³, studies of transcriptomic response to pollutant exposure⁴, different tissues⁵ and infection^{6,7,8,9,10,11}. However, this report is the most extensive gene discovery work in the species to date and the first to make extensive functional annotation publicly available to the scientific community.

In this study, we used Illumina Hiseq for RNA-Seq experiments in hepatopancreas, gill, pleopod and abdominal muscle of one male shrimp to produce a robust pooled transcriptome assembly for L. vannamei using the Trinity software package. The output contigs were then functionally annotated employing Trinotate. These RNA-Seq reads have been availed to the public via these BioSample accessions: SAMN02918336, SAMN02918337, SAMN02918338, SAMN02918339 at NCBI and our results are available at: http://repository.tamu.edu/handle/1969.1/152151. We demonstrate the utility of this resource with verification of immune transcripts, identification of novel shrimp genes and metabolic pathway analysis.

Results

High quality de novo assembly of L. vannamei transcriptome

A total of 400,228,040 Illuimina HiSeq reads from hepatopancreas, gill, pleopod and abdominal muscle tissues were generated. After trimming the adapters, 399,056,712 (Table 1) reads were assembled with Trinity resulting in 110,474 contigs (Supplemental Data 1) with an N50 of 2,701 bases (Table 2). The HiSeq platform, the diversity of tissues sampled and the highly efficient Trinity pipeline allowed this significant improvement over recent transcriptomics efforts yielding only N50 values < 1000^3,4,6.

Table 1 RNA-Seq reads before and after adapter filtering

Full size table

Table 2 Summary statistics of L. vannamei transcriptome assembly

Full size table

Functional annotation of shrimp transcriptome

Following the transcriptome assembly, we annotated the contigs by the Trinotate pipeline. The Trinotate software suite automates the functional annotation of the transcriptome.

Our annotation report from the Trinotate pipeline is presented as a document with the following column headers: 1) gene_id, 2) transcript_id, 3) Top_BLASTX_hit, 4) RNAMMER, 5) prot_id, 6) prot_coords, 7) Top_BLASTP_hit, 8) Pfam, 9) SignalP, 10) TmHMM, 11) eggNOG, 12) gene_ontology and 13) prot_seq. The first two columns are: “gene_id” and “transcript_id”, representing predicted genes and their corresponding transcripts, respectively. The columns “Top_BLASTX_hit” and “Top_BLASTP_hit” show the top BlastX and BlastP hit results of homology searches against the NCBI database. BlastX is one of the latest additions to the Trinotate annotation pipeline and compares all six open reading frames (ORF) of the query sequences against the protein database. The RNAMMER column shows information about predicted ribosomal RNA genes discovered in the transcriptome assembly that were predicted by hidden Markov models (HMM). The prot_id, prot_coords and prot_seq columns provide the ID, location and translation of the longest ORFs, respectively. The Pfam column represents the HMMER/PFAM protein domain identification search results. HMMER is used to search databases for homologs of proteins, employing hidden Markov models. The SignalP column shows the presence and location of predicted signal peptides. Similarly, the TmHMM column presents the predicted transmembrane regions. The eggNOG (Evolutionary genealogy of genes: Non-supervised Orthologous Groups) column has the search result of the database of orthologous groups of genes, which are further annotated with functional description lines. Lastly, the gene_ontology column shows the relationship of these shrimp data to the Gene Ontology (GO) terms that aim to unify the representation of genes and gene products across all species.

Using the Trinotate pipeline, a total of 165,922 annotations were determined for our Trinity assembled contigs. We have designated the last eleven columns of Trinotate output detailed above (Top_BlastX_hit through prot-seq) as Annotation Holding Output Columns (AHOC). Considering that more than 165 K annotations for our contigs are too numerous for careful examination, we have created the following three filters: Filter A selects rows that have at least 10 AHOC, Filter B selects rows that have at least 9 AHOC and Filter C selects rows that have at least 8 AHOC. Applying these filters to our data, Filter A results in selecting 590 gene IDs, Filter B 4,843 gene IDs and finally Filter C 21,323 gene IDs. This method ensures that predicted genes/transcripts with maximum annotation information can be selected for targeted manual curating. Supplemental Data 2 represents the results of these filtrations graphically and the filtered datasets are available in Supplemental Data 3.

In comparing these shrimp translated transcriptome contigs to those of other animals we were also curious as to know what proteins in NCBI's non-redundant database and from which species were most highly represented in BlastX hits of these assembled contigs (Supplemental Data 4). Drosophila myosin had the most (107) hits and Drosophila proteins dominated these BlastX results. Many of the proteins with high hits could be considered “housekeeping” gene products, but some tissue-specific proteins were in the top twenty, such as Downs syndrome cell adhesion molecule (Dscam)¹². Dscam receptors are diversified through mutually exclusive alternative exon splicing¹³, have roles in self-recognition in immunity and neural development¹⁴ and have been characterized in this species of shrimp¹⁵ as well as other crustaceans¹⁶.

Immune gene survey

To test the depth and accuracy of this annotated transcriptome, we searched in our dataset for the immune-related transcripts discovered by Sookruksawong et al.⁹. These transcripts were particularly interesting to us because they were representing differentially expressed immune-related genes between shrimp lines resistant and susceptible to Taura syndrome virus (TSV) and may also represent genes important in shrimp resistance to current scourges such as early mortality syndrome (EMS), also known as acute hepatopancreatic necrosis syndrome (AHPNS)¹⁷.

We used BlastX to find similarity between our 110,474 contigs as queries and the proteins identified in the TSV transcriptomic study⁹. There were 4,493 hits with an e-value < 1E-4 among our 110,474 contigs and 3,088 hits with an e-value < 1E-10. In our subsequent analysis, we used the 3,088 hits resulting from this more restrictive (<1E-10) e-value filtering. Supplemental Data 5 shows the complete list of these proteins with additional information for each protein. In the first post-processing, we selected the top 50 BlastX hits with the more stringent e-value filtering and completed their annotation information by adding corresponding NCBI information to their records. Additionally, we selected the most frequently appearing (>40 hits) of these previously identified immune related proteins among our top BlastX hits and showed their representation as a pie chart. Figure 1 shows these proportions among our 3,088 BlastX results for all contigs. The highest three hits are zinc-finger 658b-like and serine/threonine phosphatase ankyrin repeat-like both from the purple sea urchin Strongylocentrus purpatus and zinc-finger BTB domain from the bee Bombus impatiens. This initial comparison survey will springboard studies of other immune gene families^18,19.

Arthropod conservation and new decapod crustacean genes

We also categorized the BlastX results to understand the distribution and frequency of the species that appear in the homology search between our contigs and the NCBI database. Figure 2 depicts all of the hits that had at least twenty BlastX hits and their frequency of appearing. Drosophila melanogaster is greatly represented (20 of the 59 top BlastX hits) as the model arthropod that has received most intensive study for decades and we suspect has the best coverage in the bioinformatic databases of tissue and developmental stage specific expression of any arthropod. Well-studied mammals are interspersed with invertebrate hits (13 human, 8 mouse, 3 rat, 3 Xenopus and 1 zebrafish in the top 59 that hit 20 or more times).

Furthermore, we used BLAST to search similarities between our Trinity assembled contigs and the Daphnia pulex genome. The Joint Genome Institute annotation of v1.0 “Frozen Gene Catalog” was employed, which has all manual curations, as well as automatically annotated models chosen from the FilteredModels v1.1 set. In the first search, we used the BlastN tool to search for sequence similarities between our contigs and the D. pulex transcripts and CDs. We found 5,668 contigs with blast hits that had e-values < 1E-4 and 2,610 with hits scores as e-value < 1E-10 (Table 3). Table 4 shows the BlastN search results of the contigs against the D. pulex transcripts and CDs, for the latter case. The results are sorted by the most abundant hits, representing the top twenty genes, which had annotation information available at wFleaDatabase. It provides the gene IDs assigned by wFleaDatabase, as well as NCBI IDs. The complete 960 hits are available as Supplemental Data 6.

Table 3 Blast of L. vannamei contigs against D. pulex

Full size table

Table 4 Top BlastN search results for Trinity assembled contigs against D. pulex transcripts and CDS database

Full size table

In order to find the proteins corresponding to our contigs, we employed BlastX to examine all six frames. The D. pulex protein database was used. The BlastX search resulted in 30,534 hits with e-value < 1E-4 and 26,224 hits with e-value < 1E-10. The results are represented in Table 5 for the twenty most frequents BlastX hits, with e-value < 1E-10. For each entry, the corresponding UniProt link is provided for inquiring further information.

Table 5 Top BlastX search results for Trinity assembled contigs against D. pulex protein database

Full size table

KEGG and GO pathway analyses

We also analyzed our contigs with the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database²⁰. Figure 3 shows an example of this analysis denoting the transcripts identified by our study involved in DNA replication. The protein products of several of these transcripts are required for White Spot Syndrome Virus (WSSV) replication²¹ and offer potential targets for interference with the pathogen's replication cycle²². An additional 337 pathways with shrimp orthologs represented in our transcriptome highlighted are available from the authors.

BLAST2GO (Figure 4) identified D. pulex as the top-hit species to this L. vannamei transcriptome by Blast analysis, followed by Tribolium castaneum and in third place Pediculus humanus. Chen et al⁶ identified the latter species as the one with most Blast hits of the White Spot Syndrome response transcriptome. Our comparison seems more consistent with the evolutionary relationship between Daphnia and Penaeid crustaceans.

At the level of molecular function, binding molecules were found to be most abundant among the transcriptome (53%) followed by proteins with catalytic activity (28%). Antioxidant responses are invoked as a key component of the shrimp immune response and in this reference transcriptome appeared in less than one percent of the transcripts. However, it is known that they are more abundant upon an immune challenge⁶.

Discussion

Invertebrate shrimp do not have a classical adaptive immune system based upon antibodies, T cells and specific memory of antigen^23,24. Although our definitions of immunity beyond the jawed vertebrates are shifting²⁵, there is so far only a consensus of evidence of the more ancient innate system in shrimp²⁶. Thus, we cannot vaccinate shrimp for protection against WSSV or EMS²⁷, making understanding of the total expressed transcriptome, genetic variability in the innate immune genes and regulatory regions of this invertebrate species all the more important. This improved transcriptome for L. vannamei is a step towards that goal.

L. vannamei is the most farmed of many important decapod crustacean food species including other shrimp and prawns, crayfish, lobster and crabs. Intensive aquaculture methods continue to improve as the global demand for shellfish increases. Thus, the application of this work in the whiteleg shrimp transcriptome will be applied back into the production organism in the aquaculture facility and will pave the way for similar immunogenomic enhancement in other species²⁸. For example the congener Litopenaeus setiferus of the northwest Atlantic and Gulf of Mexico is sometimes still cultured and is also susceptible to white spot syndrome virus, findings in L. vannamei will likely be readily adaptable to this sister species^29,30. On even broader aquaculture industry supply and biosphere “One Health” levels, the improvements to this decapod crustacean transcriptome (and eventually other –omics) could have relevance for studies of krill. These small shrimp-like crustaceans are increasingly harvested for fish-meal and are often cited as the keystone species of the polar ecosystems³¹. They are threatened by climate change and krill decline due to diatom losses in 1998 were linked to plummeting shearwater populations and the lack of that year's salmon spawn³².

Also of note is the potential ability this robust transcriptome affords in addressing the question of what it is that makes decapod crustaceans unique. Our comparison of the L. vannamei transcriptome to D. pulex also reveals the decapod shrimp transcripts that do not match any transcript from the branchiopod Daphnia. Besides the importance of these animals to fisheries, the order Decapoda provides crucial scavenging of aquatic ecosystems. Comparative transcriptomics can now address the developmental genetics that have allowed these crustaceans to diversify and dominate this ecological niche. In addition, this shrimp transcriptome can in turn serve as a point of genetic contrast for other more derived groups such as Brachyuran crabs that may have undergone carcinisation several independent times during their natural history by tagmosis of segments, anterial-posterial compression and ventral fusion of the abdomen³³.

Several lines of inquiry have suggested that arthropods are capable of mounting specific immune responses subsequent to immune priming^{34,35,36,37,38}. The gene that has drawn the most attention for possibly conferring adaptive immune capabilities in these invertebrates is DSCAM. Shrimp DSCAM employs alternative splicing to greatly diversify the expressed DSCAM repertoire, using many more possible introns for the diverse portion of the third immunoglobulin domain (79 in L. vannamei and P. monodon, versus 48 in D. melanogaster, 24 in Daphnia magna and 39 in Eriocheir sinensis)³⁹. DSCAM transcripts with diverse immunoglobulin domains, transmembrane regions and cytoplasmic tails are represented in the transcriptome described here (e.g. comp873 and comp34255, Supplemental Data 1).

Germane to analyses of taxa specific genes and particularly immune related genes are the assembly of transcripts encoded by multigene families employing only short NGS data de novo without a genome. The lack of introns is problematic for the more difficult to resolve multigene families. For example, a newly birthed retrogene lacking an intron could be incorrectly assembled with its progenitor gene using our NGS dataset, a confusion that would be easily avoided with genomic data. This is both a cautionary warning against over-extrapolating these data and also mandates genomic tools in this species.

With a genome of 2.5 gigabases spread over 44 pairs of chromosomes⁴⁰, genomic resources for L. vannamei are still lacking. This annotated transcriptome should help to accelerate functional genomics in this species and perhaps other commercially and environmentally important decapod crustaceans while full genome sequencing projects are in progress. As food produced from global aquaculture now exceeds beef globally, shrimp is primed to play a major role in the “blue revolution” if the genetics of nutrition, growth and immunity can quickly catch up to the decades of study species such as cattle and poultry have enjoyed⁴¹.

Methods

Shrimp

Total RNA was prepared from one male Pacific white leg shrimp (L. vannamei) collected near Bahia Kino, Sonora, Mexico. Four tissues were targeted for distinct RNA-Seq experiments: hepatopancreas, gills, abdominal (tail) muscle and pleopods. These four tissues allow transcriptomic profiling of cells engaged in many physiological processes in this arthropod including production of digestive enzymes, absorption of food, respiration, long-sarcomere muscle contraction (unique to invertebrates), mucosal immunity and reproduction.

RNA isolation and Illumina sequencing

Total RNA was prepared from the four tissues using Trizol Reagent (Life Technologies) per the manufacturer's instructions. From total RNA, mRNA was purified using magnetic oligo(dT) beads, then fragmented using divalent cations under elevated temperature. cDNA was synthesized from the fragmented mRNA using Superscript II (Invitrogen), followed by second strand synthesis. cDNA fragment ends were repaired and phosphorylated using Klenow, T4 DNA Polymerase and T4 Polynucleotide Kinase. Next, an adenosine was added to the 3′ end of the blunted fragments, followed by ligation of Illumina adapters via T-A mediated ligation. The ligated products were size selected by AMPure XP Beads and then PCR amplified using Illumina primers. The library creation, quality determination using an Agilent Bioanalyzer and analysis were performed by Ambry Genomics. The Illumina HiSeq2000 was employed for sequence analysis. The libraries were seeded onto the flowcell at 9pM per lane yielding approximately 745 K pass-filter clusters per mm² tile area. The libraries were sequenced using 116 + 10 + 101 cycles of chemistry and imaging.

Data preprocessing

Initial data processing and base calling, including extraction of cluster intensities, was done using RTA 1.13.48 (HiSeq Control Software 1.5.15). Sequence quality filtering script was executed in the Illumina CASAVA software (version 1.8.2, Illumina, Hayward, CA). Data yield (in Mbases), percent pass-filter (%PF), number of reads, percent raw clusters per lane and quality (%Q30) was examined in Demultiplex_Stats.htm files. Other quality metrics were assessed in All.htm and IVC.htm including IVC plots and visualizations of cluster intensity over the duration of the sequencing run. The percentage of Q30 bases that passed filtering was above 77 for all lanes of the Illumina HiSeq flowcell.

Transcriptome assembly

De novo assembly of the L. vannamei transcriptome was performed from the RNA-Seq data from the four pooled tissues with the trimming off adaptor sequences. The February 25, 2013 release of Trinity was used for assembly, Trinity Trans-decoder for coding sequence prediction and Trinotate for functional annotation. The overall workflow is summarized graphically in Figure 5 and the transcriptome assembly in more detail in the next paragraph.

After initial quality control steps, any reads that had adapter sequences attached to them were discarded. For adapter trimming the Cutadapt software was used⁴². The adapter-free reads were used for transcriptome assembly using the Trinity software⁴³. Trinity is a de novo algorithm developed specifically for reconstructing the transcriptome, using de Bruijn graphs. Transcriptome assembly is challenging mainly because RNA-Seq data coverage levels are not evenly distributed. Furthermore, alternative splicing complicates assembly from individual genes. The goal of the Trinity package is to deliver one graph per expressed gene. Trinity consists of three parts: 1) Inchworm, 2) Chrysalis and 3) Butterfly. During these three steps, Trinity makes linear contigs from RNA-Seq reads, generates and expands de Bruijn graphs and finally outputs the transcripts and isoforms. The process starts by decomposing the reads into small overlapping pieces called Kmers and extending them by coverage. Finding the common sections of the intermediate transcripts determines alternative splicing and those transcripts are re-grouped. The de Bruijn graphs are generated by integration of isoforms that are similar except one base. And finally, by finding and expanding the common section of transcripts and representing the most compact path on the graph, Trinity delivers the fully assembled transcriptome. The outputs are saved in a FASTA format, which includes all the transcripts. These can be accessed here: http://repository.tamu.edu/handle/1969.1/152151.

KEGG pathway and gene ontology analysis

The KEGG Automatic Annotation Server (KAAS, http://www.genome.jp/kaas-bin/kaas_main?mode=est_b) was employed to map KEGG pathways of assigned shrimp orthologs²⁰. KEGG orthology (KO) assignments were performed based on the bi-directional best hit (BHH) of BLAST⁴⁴. Functional annotation of the gene ontology (GO) terms was done using the BLAST2GO program (http://www.BLAST2GO.org/)⁴⁵.

References

Cock, J., Gitterle, T., Salazar, M. & Rye, M. Breeding for disease resistance of Penaeid shrimps. Aquaculture 286, 1–11, DOI 10.1016/j.aquaculture.2008.09.011 (2009).
Article Google Scholar
Tran, L. et al. Determination of the infectious nature of the agent of acute hepatopancreatic necrosis syndrome affecting penaeid shrimp. Dis Aquat Organ 105, 45–55, Doi 10.3354/Dao02621 (2013).
Article PubMed Google Scholar
Li, C. et al. Analysis of Litopenaeus vannamei transcriptome using the next-generation DNA sequencing technique. PLoS One 7, e47442, 10.1371/journal.pone.0047442 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Guo, H. et al. Trascriptome analysis of the Paci fi c white shrimp Litopenaeus vannamei exposed to nitrite by RNA-seq. Fish & shellfish immunology 35, 2008–2016, 10.1016/j.fsi.2013.09.019 (2013).
Article CAS Google Scholar
O'Leary, N. A. et al. Analysis of multiple tissue-specific cDNA libraries from the Pacific whiteleg shrimp, Litopenaeus vannamei. Integrative and comparative biology 46, 931–939, 10.1093/icb/icl006 (2006).
Article CAS PubMed Google Scholar
Chen, X. et al. Transcriptome analysis of Litopenaeus vannamei in response to white spot syndrome virus infection. PLoS One 8, e73218, 10.1371/journal.pone.0073218 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Clavero-Salas, A. et al. Transcriptome analysis of gills from the white shrimp Litopenaeus vannamei infected with White Spot Syndrome Virus. Fish & shellfish immunology 23, 459–472, 10.1016/j.fsi.2007.01.010 (2007).
Article CAS Google Scholar
Robalino, J. et al. Insights into the immune transcriptome of the shrimp Litopenaeus vannamei: tissue-specific expression profiles and transcriptomic responses to immune challenge. Physiological genomics 29, 44–56, 10.1152/physiolgenomics.00165.2006 (2007).
Article CAS PubMed Google Scholar
Sookruksawong, S., Sun, F., Liu, Z. & Tassanakajon, A. RNA-Seq analysis reveals genes associated with resistance to Taura syndrome virus (TSV) in the Pacific white shrimp Litopenaeus vannamei. Dev Comp Immunol 41, 523–533, 10.1016/j.dci.2013.07.020 (2013).
Article CAS PubMed Google Scholar
Veloso, A., Warr, G. W., Browdy, C. L. & Chapman, R. W. The transcriptomic response to viral infection of two strains of shrimp (Litopenaeus vannamei). Dev Comp Immunol 35, 241–246, 10.1016/j.dci.2010.10.001 (2011).
Article CAS PubMed Google Scholar
Zeng, D. et al. Transcriptome analysis of Pacific white shrimp (Litopenaeus vannamei) hepatopancreas in response to Taura syndrome Virus (TSV) experimental infection. PLoS One 8, e57515, 10.1371/journal.pone.0057515 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Cerenius, L. & Soderhall, K. Variable immune molecules in invertebrates. The Journal of experimental biology 216, 4313–4319, 10.1242/jeb.085191 (2013).
Article CAS PubMed Google Scholar
Meijers, R. et al. Structural basis of Dscam isoform specificity. Nature 449, 487–491, nature06147 [pii] 10.1038/nature06147 (2007).
Article CAS ADS PubMed Google Scholar
Shi, L. & Lee, T. Molecular diversity of Dscam and self-recognition. Advances in experimental medicine and biology 739, 262–275, 10.1007/978-1-4614-1704-0_17 (2012).
Article CAS PubMed Google Scholar
Hung, H. Y. et al. Properties of Litopenaeus vannamei Dscam (LvDscam) isoforms related to specific pathogen recognition. Fish & shellfish immunology 35, 1272–1281, 10.1016/j.fsi.2013.07.045 (2013).
Article CAS Google Scholar
Watthanasurorot, A., Jiravanichpaisal, P., Liu, H., Soderhall, I. & Soderhall, K. Bacteria-Induced Dscam Isoforms of the Crustacean, Pacifastacus leniusculus. PLoS pathogens 7, e1002062, 10.1371/journal.ppat.1002062 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bondad-Reantaso, M. G., Subasinghe, R. P., Josupeit, H., Cai, J. N. & Zhou, X. W. The role of crustacean fisheries and aquaculture in global food security: Past, present and future. Journal of invertebrate pathology 110, 158–165, DOI 10.1016/j.jip.2012.03.010 (2012).
Article PubMed Google Scholar
Buckley, K. M. & Rast, J. P. Characterizing immune receptors from new genome sequences. Methods Mol Biol 748, 273–298, 10.1007/978-1-61779-139-0_19 (2011).
Article CAS PubMed Google Scholar
Tassanakajon, A., Somboonwiwat, K., Supungul, P. & Tang, S. Discovery of immune molecules and their crucial functions in shrimp immunity. Fish & shellfish immunology 34, 954–967, 10.1016/j.fsi.2012.09.021 (2013).
Article CAS Google Scholar
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic acids research 35, W182–185, 10.1093/nar/gkm321 (2007).
Article PubMed PubMed Central Google Scholar
Carrasco-Miranda, J. S. et al. Crystal structure of the shrimp proliferating cell nuclear antigen: structural complementarity with WSSV DNA polymerase PIP-box. PLoS One 9, e94369, 10.1371/journal.pone.0094369 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Attasart, P. et al. Inhibition of white spot syndrome virus replication in Penaeus monodon by combined silencing of viral rr2 and shrimp PmRab7. Virus research 145, 127–133, 10.1016/j.virusres.2009.06.018 (2009).
Article CAS PubMed Google Scholar
Hoffmann, J. A., Kafatos, F. C., Janeway, C. A. & Ezekowitz, R. A. B. Phylogenetic perspectives in innate immunity. Science 284, 1313–1318, DOI 10.1126/science.284.5418.1313 (1999).
Article CAS ADS PubMed Google Scholar
Hauton, C. & Smith, V. J. Adaptive immunity in invertebrates: a straw house without a mechanistic foundation. Bioessays 29, 1138–1146, Doi 10.1002/Bies.20650 (2007).
Article CAS PubMed Google Scholar
Criscitiello, M. F. & de Figueiredo, P. Fifty shades of immune defense. PLoS pathogens 9, e1003110, 10.1371/journal.ppat.1003110 (2013).
Article CAS PubMed PubMed Central Google Scholar
Soderhall, K. Special issue: Invertebrate immunity. Developmental and Comparative Immunology 23, 263–266 (1999).
Article CAS Google Scholar
Lee, S. Y. & Soderhall, K. Early events in crustacean innate immunity. Fish & shellfish immunology 12, 421–437, DOI 10.1006/fsim.2002.0420 (2002).
Article CAS Google Scholar
Hauton, C. & Smith, V. Studying immunity at the whole organism level - Reply. Bioessays 30, 406–406, Doi 10.1002/Bies.20737 (2008).
Article Google Scholar
Lightner, D. V. Epizootiology, distribution and the impact on international trade of two penaeid shrimp viruses in the Americas. Rev Sci Tech Oie 15, 579–601 (1996).
Article CAS Google Scholar
Lightner, D. V., Hasson, K. W., White, B. L. & Redman, R. M. Experimental infection of western hemisphere penaeid shrimp with Asian white spot syndrome virus and Asian yellow head virus. J Aquat Anim Health 10, 271–281, Doi 10.1577/1548-8667(1998)0100271:Eiowhp>2.0.Co;2 (1998).
Article Google Scholar
Simard, Y. & Harvey, M. Predation on Northern krill (Meganyctiphanes norvegica Sars). Advances in marine biology 57, 277–306, 10.1016/B978-0-12-381308-4.00010-8 (2010).
Article PubMed Google Scholar
Macklin, S. A. Report on the FOCI International Workshop on Recent Conditions in the Bering Sea. 22–26 (National Oceanographic and Atmospheric Administration Seattle, 1998).
Tsang, L. M., Chan, T. Y., Ahyong, S. T. & Chu, K. H. Hermit to King, or Hermit to All: Multiple Transitions to Crab-like Forms from Hermit Crab Ancestors. Systematic biology 60, 616–629, DOI 10.1093/sysbio/syr063 (2011).
Article PubMed Google Scholar
Moret, Y. & Siva-Jothy, M. T. Adaptive innate immunity? Responsive-mode prophylaxis in the mealworm beetle, Tenebrio molitor. Proceedings. Biological sciences/The Royal Society 270, 2475–2480, 10.1098/rspb.2003.2511 (2003).
Article CAS Google Scholar
Rahman, M. M., Roberts, H. L., Sarjan, M., Asgari, S. & Schmidt, O. Induction and transmission of Bacillus thuringiensis tolerance in the flour moth Ephestia kuehniella. Proc Natl Acad Sci U S A 101, 2696–2699, 10.1073/pnas.0306669101 (2004).
Article CAS ADS PubMed PubMed Central Google Scholar
Sadd, B. M. & Schmid-Hempel, P. Insect immunity shows specificity in protection upon secondary pathogen exposure. Current biology: CB 16, 1206–1210, 10.1016/j.cub.2006.04.047 (2006).
Article CAS PubMed Google Scholar
Pham, L. N., Dionne, M. S., Shirasu-Hiza, M. & Schneider, D. S. A specific primed immune response in Drosophila is dependent on phagocytes. PLoS pathogens 3, e26, 10.1371/journal.ppat.0030026 (2007).
Article CAS PubMed PubMed Central Google Scholar
Roth, O. et al. Paternally derived immune priming for offspring in the red flour beetle, Tribolium castaneum. The Journal of animal ecology 79, 403–413, 10.1111/j.1365-2656.2009.01617.x (2010).
Article PubMed Google Scholar
Ng, T. H., Chiang, Y. A., Yeh, Y. C. & Wang, H. C. Review of Dscam-mediated immunity in shrimp and other arthropods. Dev Comp Immunol 46, 129–138, 10.1016/j.dci.2014.04.002 (2014).
Article CAS PubMed Google Scholar
Chow, S., Dougherty, W. J. & Sandifer, P. A. Meiotic Chromosome Complements and Nuclear-DNA Contents of 4 Species of Shrimps of the Genus Penaeus. J Crustacean Biol 10, 29–36, Doi 10.2307/1548667 (1990).
Article Google Scholar
Simpson, S. The Blue Food Revolution. Sci Am 304, 54–61, DOI 10.1038/scientificamerican0211-54 (2011).
Article Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011).
Article Google Scholar
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nature protocols 8, 1494–1512, 10.1038/nprot.2013.084 (2013).
Article CAS PubMed Google Scholar
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic acids research 28, 27–30 (2000).
Article CAS Google Scholar
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676, 10.1093/bioinformatics/bti610 (2005).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by grants from Texas A&M Agrilife and Texas Veterinary Medical Diagnostic Laboratory #124151 to CDJ, SVD and MFC, Texas A&M and Mexican CONACyT to RRS, SVD and MFC, USDA Formula Animal Health to MFC and SVD and Mexican National Institute of Fisheries (INAPESCA) to RRS, LGB, ERP, SVD and MFC. The generation of RNA-Seq raw data was outsourced to Ambry Genetics, Aliso Viejo CA. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number OCI-1053575, to provide compute time on the Blacklight system at the Pittsburgh Supercomputing Center (PSC).

Author information

Authors and Affiliations

Genomics and Bioinformatic Services, Texas A&M AgriLife Research, College Station, TX 77845, USA
Noushin Ghaffari & Charles D. Johnson
Unidad Universitaria de Apoyo Bioiformático, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
Alejandro Sanchez-Flores
Department of Veterinary Pathobiology, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX 77843, USA
Ryan Doan, Scott V. Dindot & Michael F. Criscitiello
Centro de Investigación en Alimentación y Desarrollo (CIAD), Carretera a Ejido La Victoria, Km 0.6, Sonora, 83304, Hermosillo, Mexico
Karina D. Garcia-Orozco, Alonso A. Lopez-Zavala, J. Salvador Carrasco & Rogerio R. Sotelo-Mundo
Department of Veterinary Pathobiology, Comparative Immunogenetics Laboratory, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX 77843, USA
Patricia L. Chen, Chris Hong & Michael F. Criscitiello
Unidad de Genómica de Poblaciones Aplicada la Salud, Facultad de Química, UNAM, Instituto Nacional de Medicina Genómica (INMEGEN), México, 14610, D.F., Mexico
Adrian Ochoa-Leyva
Laboratorio Nacional de Genómica para la Biodiversidad, Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato , Mexico
Luis G. Brieba
Departamento de Medicina Molecular y Bioprocesos, Instituto de Biotecnologia, Universidad Nacional Autónoma de Mexico, Cuernavaca, Morelos, Mexico
Enrique Rudiño-Piñera
Pittsburgh Supercomputing Center, Pittsburgh, PA 15213, USA
Philip D. Blood
Department of Animal Sciences, Texas Agrilife Research, Texas A&M University, College Station, TX 77843, USA
Jason E. Sawyer
Department of Microbial Pathogenesis and Immunology, College of Medicine, Texas A&M Health Sciences Center, Texas A&M University, College Station, TX 77843, USA
Michael F. Criscitiello

Authors

Noushin Ghaffari
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Sanchez-Flores
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Doan
View author publications
You can also search for this author in PubMed Google Scholar
Karina D. Garcia-Orozco
View author publications
You can also search for this author in PubMed Google Scholar
Patricia L. Chen
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Ochoa-Leyva
View author publications
You can also search for this author in PubMed Google Scholar
Alonso A. Lopez-Zavala
View author publications
You can also search for this author in PubMed Google Scholar
J. Salvador Carrasco
View author publications
You can also search for this author in PubMed Google Scholar
Chris Hong
View author publications
You can also search for this author in PubMed Google Scholar
Luis G. Brieba
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Rudiño-Piñera
View author publications
You can also search for this author in PubMed Google Scholar
Philip D. Blood
View author publications
You can also search for this author in PubMed Google Scholar
Jason E. Sawyer
View author publications
You can also search for this author in PubMed Google Scholar
Charles D. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Scott V. Dindot
View author publications
You can also search for this author in PubMed Google Scholar
Rogerio R. Sotelo-Mundo
View author publications
You can also search for this author in PubMed Google Scholar
Michael F. Criscitiello
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.G., L.B., E.R., J.A.S., C.D.J., S.V.D., R.R.S. and M.F.C. designed the project. A.A.L., J.S.C., R.R.S. and M.F.C. collected specimens and harvested tissues. R.D., K.D.G., P.C., A.A.L., J.S.C., C.H., S.V.D., R.R.S. and M.F.C. prepared nucleic acids and sequencing libraries. N.G., A.S., A.O., C.H., P.D.B., S.V.D. and R.R.S. performed bioinformatics analyses. N.G. and M.F.C. drafted the text. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplemental Information (Supplemental Data 2)

Supplementary Information

Supplemental Data 1

Supplementary Information

Supplemental Data 3

Supplementary Information

Supplemental Data 4

Supplementary Information

Supplemental Data 5

Supplementary Information

Supplemental Data 6

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/

Reprints and permissions

About this article

Cite this article

Ghaffari, N., Sanchez-Flores, A., Doan, R. et al. Novel transcriptome assembly and improved annotation of the whiteleg shrimp (Litopenaeus vannamei), a dominant crustacean in global seafood mariculture. Sci Rep 4, 7081 (2014). https://doi.org/10.1038/srep07081

Download citation

Received: 04 June 2014
Accepted: 29 August 2014
Published: 25 November 2014
DOI: https://doi.org/10.1038/srep07081

This article is cited by

Shrimp Glucose-6-phosphatase 2 (G6Pase 2): a second isoform of G6Pase in the Pacific white shrimp and regulation of G6Pase 1 and 2 isoforms via HIF-1 during hypoxia and reoxygenation in juveniles
- Laura E. Hernández-Aguirre
- Alma B. Peregrino-Uriarte
- Gloria Yepiz-Plascencia
Journal of Bioenergetics and Biomembranes (2023)
Characterization of the Gene Repertoire and Environmentally Driven Expression Patterns in Tanner Crab (Chionoecetes bairdi)
- Grace Crandall
- Pamela C. Jensen
- Steven Roberts
Marine Biotechnology (2022)
Adaptive mitochondrial response of the whiteleg shrimp Litopenaeus vannamei to environmental challenges and pathogens
- Chrystian Rodriguez-Armenta
- Orlando Reyes-Zamora
- Adriana Muhlia-Almazan
Journal of Comparative Physiology B (2021)
The gluconeogenic glucose-6-phosphatase gene is expressed during oxygen-limited conditions in the white shrimp Penaeus (Litopenaeus) vannamei: Molecular cloning, membrane protein modeling and transcript modulation in gills and hepatopancreas
- Laura E. Hernández-Aguirre
- Keni Cota-Ruiz
- Gloria Yepiz-Plascencia
Journal of Bioenergetics and Biomembranes (2021)
Identification of Functional SSR Markers in Freshwater Ornamental Shrimps Neocaridina denticulata Using Transcriptome Sequencing
- Chang-Wen Huang
- Pei-Yun Chu
- Yeh-Hao Wang
Marine Biotechnology (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

High quality de novo assembly of L. vannamei transcriptome

Functional annotation of shrimp transcriptome

Immune gene survey

Arthropod conservation and new decapod crustacean genes

KEGG and GO pathway analyses

Discussion

Methods

Shrimp

RNA isolation and Illumina sequencing

Data preprocessing

Transcriptome assembly

KEGG pathway and gene ontology analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links