Abstract
A cDNA library from a pool of all the seven silk glands from a tropical spider species was constructed. More than 1000 expressed sequence tag (EST) clones were created. Almost 65% of the EST clones were identified and around 50% were annotated. The cellular and functional distribution of the EST clones indicated high protein synthesis activity in spider silk glands. Novel clones with repetitive amino acid sequences, which is one of the most important characteristics of spider silk genes, were isolated. One of these clones, namely TuSp2 in current research, contains two almost identical fragments with one short C-terminal domain. Reverse transcription (RT) PCR and expression analysis showed that it is expressed in the tubuliform gland and involved in eggcase silk formation. Furthermore, its single repetitive domain can be induced to form various types of materials, including macroscopic fibers, transparent film and translucent hydrogel. This study implies promising potentials for future identification of novel spidroins and development of new spidroin-based biomaterials.
Similar content being viewed by others
Introduction
Spider silks have been renowned for their extraordinary mechanical properties. The combination of high tensile strength and elasticity makes them comparable to the best man-made materials using modern technology1. In order to understand their structures-functions-property relationships, a number of spider fibroin (spidroin) cDNA/genes have been identified, including major ampullate fibroin MaSp1 and MaSp22,3,4,5,6,7,8,9; minor ampullate silk gland fibroin MiSp1 and MiSp25,10, flagelliform silk gland fibroin Flag4,11, tubuliform silk gland fibroin TuSp112,13,14,15,16,17, ADF-2 (later characterized as MaSp1) and ECP or ECP-like genes5,12,13,18, and aciniform silk gland fibroin AcSp119. Recently, silk gene from piriform (constructing attachment disc) silk gland and aggregate gland (secreting sticky glue) were also reported20,21,22.
There might be more spider silk fibroin genes awaiting isolation and characterization. In addition, exclusive of the major component from each gland, there may be other minor fibroin components which also play roles in silk fiber formation. For example, in addition to the major component of TuSp1, ADF-2 and ECP-1,2 were isolated from the tubuliform gland or egg case5,12,13, which may be involved in the silk formation and/or conformation transition of the silk fiber assembly.
Besides the attempt of identifying novel spider silk cDNA/genes, many efforts have been put on the spinning process, through which the soluble highly concentrated fibroin dope is converted to insoluble fibers23,24,25,26,27. It is indicated that spinning conditions play a role on the mechanical properties of the fibers26. In this spinning process, pH decreases28 and ion content changes26,29 along the spinning duct. At the same time, water content decreases drastically in the concentrated protein solution. Although some models were suggested for the spinning process, molecular information for elucidating this sophisticated process is still incomplete. Therefore, it is necessary to identify the molecular basis which underlies the complex molecular events of silk formation.
Sequencing of randomly selected cDNA clones (EST, expressed sequence tag)30 has been proven to be a relatively inexpensive and efficient means to rapidly access many of the expressed genes of a specific tissue or organism. This method is important for providing the expression profile of the source tissue. It has also been commonly used to identify novel genes involved in specific biological processes, and is especially valuable for organisms where genomic data are limited.
In this study, we used the EST method to identify the molecules involved in spider silk formation. Mainly because of its availability and large body size, a local species of the orb-web spider, Nephila antipodiana (N.a.), was used in this study. The expression profile of spider silk glands was analyzed through our EST data and a number of molecules that have implications in silk fibroin synthesis, silk formation or procession were identified. We also report the identification of a partial novel cDNA molecule, which is specifically expressed in the tubuliform gland of N.a. and involved in eggcase silk formation.
Results and Discussion
Overview of EST clones in spider silk glands
A cDNA library from the 7 types of silk glands of 5 adult female N.a. was constructed. The titer of the primary library was 7.5 × 106 pfu while that of the amplified library reached 9 × 1011 pfu/ml. The empty clone percentage was 1.8% and the average insert length was approximately 900 bp, which ranges from 300 bp to 3000 bp. These data are summarized in Table 1. Following the method of size selection and sequencing, 1076 randomly selected clones from the spider silk gland cDNA library were single-pass sequenced from the 5′ ends. Fasta analysis showed that 698 of the 1076 clones had statistically significant similarity (Poisson P value ≤ 10−5, sequence overlap > 150 bp) to the available sequences in the public DNA database. This high identification rate may be mainly due to the pre-selection of the insert size, leading to sequencing only the coding region, which is more conservative than the 3′ UTR (un-translated region). Besides, more and more DNA sequences are available, providing a larger pool to compare with. Sequences of 505 EST clones (with informative functions) generated from this study have since been submitted to NCBI dbEST (NCBI access no. GT029466 to GT029977). The remaining 378 clones did not share significant sequence homology with any known sequences in the databases; and are under further investigations (Table 2).
Distribution of the identified clones in spider silk glands cDNA library
As cellular localization of a protein is closely related to their biological functions, the identified EST clones from the spider silk glands (698 clones from the 7 types of silk glands) are classified into seven broad categories largely according to the major cellular localization of their encoded proteins: cytosolic proteins (CS), cytoskeletal protein (CK), membrane proteins(MP), nuclear proteins(NP), secreted and extracellular proteins(SP), translational machinery proteins(TM), mitochondria proteins (MT) (including both nucleus coded and mitochondria coded proteins) and the clones encode proteins with unknown location due to the shortage of information (UN). The distribution of cellular localization of the EST clones is listed in Table 3. The cytosolic proteins (30%) and proteins involved in translational machinery (22%) exceeded 20%. As we know, the cellular distribution of the genes expressed in one tissue represents roughly the energy distribution of this source tissue. It thus provides us a profile of the energy flow and the molecular events involved in it. Since the spider silk glands are highly active in silk protein synthesis, it is logical to have a high percentage of translational machinery molecules. In the cytosolic protein group, many of them are involved in protein folding and sorting, indicating the vigorous activities of post translational modifications and strict controls for the fibroin synthesis in the silk glands. The category of unknown EST clones included 31%, which may be due to the fact of limited molecular information from the silk glands (65%). The functional information (unknown category) is not available at current stage. There might be novel cDNA molecules involving specific biological functions in the spider silk synthesis included in this category, which needs to be further studied.
Functional distribution of EST clones
Since it is not practical and not necessary for us to discuss all the genes identified with only statistical data, we try to focus here some interesting phenomena acquired from the EST pool. Based on the occurring frequency of the cDNA clones, we listed the most abundant cDNA clones from the 7 types silk gland library (Table S1, clones appear for at least 3 times, separated in three parts, which include clones of translational machinery; general clones, and clones of heat shock proteins). There include 223 clones, representing 21% of the clones (223 clones/1076 clones) from the library. In this list, some of the clones represent house-keeping genes or the genes involved in mainstream metabolism, such as ADP/ATP translocase, actin, glyceraldehyde-3-phosphate dehydrogenase, cytochrome oxidase subunit I, aldehyde dehydrogenase, carboxylesterase precursor, techylectin 5 A, IgE-dependent histamine release factor, and ubiquitin; and the rest may play more specific roles in the silk protein synthesis. In addition, 3 cDNA clones similar to excretory/secretory mucin MUC-5 gene (muc-5) (Toxocara canis) are also identified from N.a. silk glands. The MUC-5 gene was isolated from the mucin coating the T. canis labile surface31. It is a secreted glycoprotein and the exact function of this gene is not clear. However, the abundance of this cDNA clone suggests that it might play a role in spidroin synthesis and silk formation, for which we try to provide more detailed discussions.
Silk genes
Seven clones of known silk genes, representing major ampullate silk fibroin gene MaSp1 (4 clones), minor ampullate silk gene MiSp1 (1 clone)32, tubliform silk gene TuSp1 14, and aciniform silk gene-like are identified33,34. So far, there are a number of silk genes reported from spiders and our ESTs sequencing results, consistent with the sequences of previously identified silk genes, demonstrated that current strategy is effective in silk gene isolation.
Recently, large scale transcriptome analysis methods have been applied to identify new silk genes18. ECP homologues in an araneid was thus found. Newly developing methods are automatic and time-saving. However, due to the extremely long and highly repetitive nature of silk structural genes, there might be technical limitations to annotate the transcripts efficiently.
Genes involved in spidroin synthesis and storage
As shown in Table S1, a large number of RNA genes and translational elongation factors (12% or 132/1076 clones) are present in our EST data. The most abundant elongation factor is the elongation factor 1 alpha, which binds to aminoacyl-tRNA during translation elongation. This is quite reasonable since silk proteins are synthesized at an extraordinary rate in silk glands. Four copies of sec. 61 alpha subunit and three copies of ubiquitin cDNA clones are present in the EST data. Sec. 61 alpha subunit is a major component of the Sec. 61 complex, which forms the protein-conducting aqueous channels in the ER membranes35. This result may indicate that silk protein synthesis in silk glands is under the ER quality control, to ensure that the silk proteins are correctly folded for silk spinning. In addition to Sec. 61 and ubiquitin, a number of heat shock proteins, including Hsp70, Hsp60, Hsp20.8 A and small heat shock/alpha-crystallin protein precursor, are found in the silk gland (Table S1). They are the major chaperone molecules and are believed to prevent aggregation of unfolded proteins and to facilitate nascent proteins for their folding and assembly36,37,38. The identification of various heat shock proteins from spider silk glands may therefore provide clues for understanding of the process involved in fiber silk protein folding and assembly and quality control. Ornithine decarboxylase (ODC) is found to be the most abundant gene expressed in spider silk glands. 70 cDNA clones encode for ornithine decarboxylase, accounting for 6.5% of the total 1076 identified cDNA clones. ODC is the rate-limiting enzyme of the polyamine synthesis and involved in diverse biological processes, including cell growth, differentiation, transformation and apoptosis39. In silk gland, however, the functions of ODC are yet to be elucidated.
Novel cDNA clones containing repetitive sequence
In present study, 378 EST clones (38%) showed no significant similarity to those in the public database and 219 contain deduced animo acid sequences with 107 of them contain deduced amino acid sequences without any stop codon in the middle. To find out repetitive motifs, the software Vector NTI was used to translate cDNA sequences manually into different reading frames with online statistical resource (http://www.ebi.ac.uk/Tools/saps/). As a result, more than 10 of the 107 deduced protein sequences were found to contain repetitive sequences at the amino acid sequence level, which is one of the most important features of spidroin genes. Schematic figure representing the comparative organizations of three representative EST clones is showed in Fig. 1 while their sequences will be available upon request (B6 with NCBI accession no. EF044306., while the others are still under investigations). Both clones B6 and A105 contain multiple repetitive domains (RP) and C-terminal non-repetitive domains (CTD), while clone C622 contains one glycine-rich (15%) and one glycine-threonine-rich (57%) domains with a longer C-terminus. Short repetitive motifs, such as GT(S)XTGT(S)T(S), are present in glycine-threonine-rich domain in C622. We now designated B6 as gene TuSp2 (tubuliform spider fibroin 2), following the convention of TuSp1 since they are both specifically expressed in the tubuliform gland and involved in eggcase silk formation (Fig. 2, see below)14.
Sequence and size of gene TuSp2
The EST clone of TuSp2 encodes a partial cDNA of 1,194 bp, with an open reading frame of 270 amino acid residues (Fig. S1). It contains two almost identical fragments: one from amino acid residue 1 to 90 and the other from 144–233, which is rich in serine, alanine, asparagine and leucine residues (Fig. S2). There is a difference of 18 nucleotides in these two fragments, leading to 12 different amino acid residues. The organization of this molecule is shown in Fig. 1. The linker between residue 91 and 142 could also be part of RP domain. In order to validate the existence of this gene, Northern blot analysis was applied and found that the full-length TuSp2 reaches approximately 8 kb in length (Fig. 2A). This result indicates that TuSp2 likely encodes a fibroin protein with high molecular weight of over 200 kDa. Till now, mainly two types of eggcase silk genes: the huge TuSp1 (or CySp1,2) and the much smaller ECP or ECP-like genes have been identified in tubuliform gland14,17,18. The repetitive domain of TuSp2 shows low sequence identity ( < 25%) to those of TuSp1, MiSp1, AcSp1 and MaSp1 (Fig. S3). TuSp2 consists of an unusual short nonrepetitive C-terminal domain of 37 amino acid residues and shows no similarity to those of any known spidroins.
Very recently, the first genome of orb-weaving spider Nephila clavipes (N.c.) was uncovered40. Genome BLAST analysis revealed a partial sequence with ~60% identity to TuSp2 in the amino acid level (Fig. S4). This partial sequence in the N.c. genome involves one full RP domain and a short C-terminal domain of ~55 amino acid residues, suggesting that TuSp2 could also exist in N.c. silk gland. However, other RP and the N-terminal domains of putative TuSp2 from N.c. are missing, most likely due to the incompleteness of the N.c. genome.
Transcriptional distribution of gene TuSp2
A number of RT-PCR primers were designed to apply RT-PCR reactions to identify the expression pattern of TuSp2. As shown in Fig. 2B, TuSp2 mRNA was detected solely in the tubuliform gland and the pooled silk gland. As controls, MaSp1 and MiSp1 mRNAs were detected in the major and minor ampullate glands respectively, as well as in the pooled silk glands, but not in the tubuliform gland. Furthermore, ubiquitously expressed actin mRNA was detected in all four samples (Fig. 2B). Under current experimental condition, the RT-PCR reactions were limited to less than twenty-five cycles, and the results indicated the priority of individual genes in various glands. To further investigate the properties of TuSp2, a miniature construct of single repetitive domain of TuSp2 (TuSp2-RP) was expressed as a soluble recombinant protein in E. coli. Purified TuSp2-RP (Fig. S5) was applied as antigen to raise multi-clonal antibodies. Dragline, eggcase and total silks without egg cases were collected, dissolved and blotted onto the PVDF membrane. As shown in Fig. 2C, only eggcase silk demonstrates clear cross reactions to the antisera against recombinant TuSp2-RP. It was thus confirmed that TuSp2 is specifically assembled into eggcase silk fibers.
Spidroin-based biomaterials from TuSp2
It’s well known that silk fibroins can be processed into many material forms, including fiber, film or hydrogel. We found that single TuSp2-RP in water solution at a concentration of at least 10 mg/ml can be transformed into macroscopic fibers upon physical shearing at room temperature (Fig. 3A). Since the fibers tend to stick to each other, it is unlikely to acquire a single string of fiber for mechanical tests. TuSp2-RP can also be induced to form a transparent film structure upon drying on a petri dish or self-assemble into translucent hydrogel material at a pH close to its pI (Fig. 3B–D) at room temperature. Due to the low molecular weight of TuSp2 single domain (~15 kDa), the transparent film and translucent gel are too brittle to investigate their physical properties. Nevertheless, the ability of TuSp2 domain to form various structures provides promising potentials for the development of high-molecular-weight TuSp2 with multiple RP domains for biomaterial applications.
In summary, we have constructed a cDNA library from a pool of all the seven silk glands and found out a number of EST clones containing novel repetitive amino acid sequences, which possess no significant similarity to those of known silk proteins. On the basis of both sequence and biochemical data, we infer that TuSp2, different from both TuSp1 and ECP genes, is a novel gene involved in eggcase silk formation. We also demonstrated the capability of TuSp2 repetitive domain to form various types of materials. Current study implies promising potentials for identification of novel spidroins and development of new spidroin-based biomaterials in the future.
Methods
Sample collection and construction of a silk gland cDNA library
Five adult female spiders of N. antipodiana were collected from the bush in eastern part of Singapore. In order to collect spiders with active silk biosynthesis, a large portion of the spider web was destroyed one day before collection. Only spiders found to repair their webs were collected. Tissues comprising of the seven types of silk glands were isolated and total RNA was extracted using Trizol reagent (Gibco BRL). Poly-A RNA was purified using an Oligo-dT affinity column (Stratagene). For cDNA library construction, Stratagene’s Lambda Uni-Zap XR cloning system was used according to manufacturer’s protocol. In brief, ~5 µg of Poly-A RNA was extracted and used for cDNA synthesis. Two restriction enzymatic digestion sites of XhoI and EcoRI were introduced. cDNA molecules were directionally cloned into the predigested Zap vector and packaged using the package extract according to manufacturer’s instruction manual (Stratagene). A total of 7.5 × 106 pfu (plaque forming unit) of bacteriophage were obtained in the original library and subsequently were amplified using thirty 150-mm culture dishes. The titer of the amplified library was 9 × 1011 pfu/ml and the amplified spider silk gland DNA library was then aliquoted into 60 Eppendorf tubes and stored at −80 °C for long term storage. The percentage of empty clones determined using the blue-white selection by PCR amplification with T3 and T7 primers.
cDNA clone selection and sequencing
The phage library was converted to the plasmid library by mass in-vivo excision according to the standard protocol (Stratagene). To improve the possibility of accession the coding region, PCRs were performed using T3 and T7 primers for selection of cDNA clones with large inserts ( >1 kb) for sequencing reaction. DNA sequencing was carried out using the ABI BigDye Terminator (Version 3) and the ABI PRISM® 3100 sequencer (Applied Biosytems). Each clone was sequenced from the 5′ end using the vector primer T3 and sequencing readouts > 600 bp were usually obtained.
Sequence homology search and analysis
After removal of vector sequences and correction of obvious reading errors, all EST sequences were submitted to Mail-Fasta via email (fasta@ebi.ac.uk) for sequence homology search. The search was against all the DNA entries in the EMBL database (http://www.ebi.ac.uk/Tools/fasta33/index.html, UniProt). The first 100 most homologous sequences were listed, and usually the best 10–30 sequence alignments were examined. The identification of a spider homologous cDNA clone was based on the smallest Poisson probability, P(N), reported by the mail-Fasta analysis. In general, the highest P(N) value accepted for clone identification was 1 × 10−5. An identified clone generally shares a high sequence identity (usually > 65%) over a relatively long fragment ( >150 bp) with the most similar sequence homolog. We mainly searched through literature to decide the cellular localization of the characterized EST clones. The clones not sharing reliable similarities to known sequences in the database were classified in the category of ‘unknown’. In particular, nucleotide sequence of the EST clone B6 reported in this paper belongs to this ‘unknown’ category and was submitted to the Gene Bank database under the accession no. EF044306.
Northern blot analysis
Total RNA from the spider silk glands (~1–10 μg) was separated on a 1.0% agarose-formaldehyde gel and blotted onto a nylon membrane (Roche) with 10 × SSC (1 × SSC is 15 mM sodium citrate, 150 mM NaCl), crosslinked by an automatic crosslinker (Stratagene). The cDNA fragments from EST clone of were labeled with Dig DNA labeling kit (Roche). Hybridization was performed and was followed by immunological development according to the manufacturer’s instructions by chemiluminescent method using CSPD substrate (DIG Luminescent Detection Kit, Roche, Switzerland).
RT-PCR analysis
Total RNA from major ampullate gland sac, minor ampullate sac, tubliform gland and mixed tissue of all the seven types of silk glands was isolated with Trizol (Invitrogene). First strand cDNA was synthesized from the total RNA with SuperScript RT (Invitrogen). PCRs were carried out using Taq polymerase (Promega) with 25 cycles. The primers used for PCRs are:
B434-RT-F (5′-TTGATCTTGCTGGACGGGACTTGA-3′) and B434-RT-R (5′-TCCACATCTGTTGGAAGGTGGACA-3′) for actin;
D938-RT-F (5′-CTTCACGCTTATCTTCTCCTGAAG-3′) and D938-RT-R (5′-AACAATCTGAGCAGCTTGACCAG-3′) for MaSp2;
145-RT-F (5′-CTGGAAGTGCTGCAGGAAATGCTT-3′) and 145-RT-R (5′-ACTAGAAGCAGCGGAAGATGCAGT-3′) for MiSp1;
B6-RT-F (5′-CAGGACCAAACTTGTCTATTGGAG-3′) and B6-RT-R (5′-GAGGTTTACCATCGCTGAAGAATG-3′) for clone B6.
Construction, expression and purification of TuSp2-RP
The cDNA encoding TuSp2-RP (90 amino acid residues) with a linker at its N-terminus (52 amino acid residues excluding the first proline residue) was amplified from B6 cDNA using one pair of primers:
5′-CGCGGATCCAACCTGAGCATTGGCGATACC-3′
5′-AATCTCGAGTTAGCCAATTTCCACCATTTTTTT-3′
The PCR product was subcloned into a pET-M expression vector between BamH I and Xho I restriction sites. pET-M was derived from pET-32a (+) expression vector (Novagen) by the removal of Thioredoxin tag, S tag, Nco I and EcoR V restriction sites. The recombinant protein with a hexa-His tag (MHHHHHHSSGLVPRGS) at the N-terminus was expressed in E.coli BL21 (DE3) in LB medium. The cells were grown in 1 L medium with 100 µg/mL of ampicillin at 37 °C. When the OD600 reached ~0.5, isopropyl β-D-1-thiogalactopyranoside (IPTG) was added to a final concentration of 0.1 mM to induce silk protein expression for 12 hours at 20 °C. The cells were re-suspended in 20 mM Tris buffer (pH 8.0, 300 NaCl and 10 mM imidazole) and sonicated for 30 min at 4 °C. The recombinant protein was purified with nickel-nitrilotriacetic acid resin (Qiagen) under native condition according to the manufacturer′s instructions. Imidazole in the eluted protein solution was removed by dialysis against a PBS buffer at 4 °C. Finally, the purified protein solution was stored at 4 °C without agitation.
Dot blotting analysis
Five BALB/c mice (~20 g) were used for antibody production at the Animal Holding Unit (AHU), Ningxia Medical University. Briefly, purified recombinant TuSp2-RP protein was mixed with Freund’s complete adjuvant (Sigma) at a 1:1 ratio (v/v) to form an emulsion. Each mouse was injected intraperitoneally (i.p.) with 100 μg of recombinant protein. Two booster injections with Freund’s incomplete adjuvant were similarly administered each two weeks later. The animals were bled seven days after the last injection and the serum prepared routinely. Equal amount of dragline silk, eggcase silk, total silks without egg cases and B6 recombinant protein was dot blotted to PVDF membranes (Bio-Rad). Ponceau (0.2%) staining was applied simultaneously to compare the relative amount of the samples. The membranes were blocked for 1 h at room temperature with 5% milk. The primary antibody was anti-sera against recombinant protein (1:200 dilution). The negative control was serum from uninjected normal rats (1:200 dilution). Secondary antibody was HRP-conjugated anti mouse whole IgG (Proteintech). The signals were recorded by the ChemiDoc Touching Imaging System (BioRad).
Silk materials formation
Recombinant RP was concentrated to ~10 mg/ml at 4 °C by using an Amicon® Ultra Centrifugal Filter (Millipore) according to the manufacturer’s instructions. The concentrated RP solution was stored at 4 °C without agitation. Macroscopic fibers were formed from 250 μL of concentrated RP solution in a 1.5 or 2.0 mL tube upon physical shearing on a bench-top orbital shaker (50–70 rpm) at room temperature. Protein gel can be formed by adjusting pH of as little as 50 μL of concentrated RP solution to ~5.0 at room temperature and film was developed by casting ~100 μL of concentrated RP solution on a petri dish at room temperature.
References
Gosline, J. M., Guerette, P. A., Ortlepp, C. S. & Savage, K. N. The mechanical design of spider silks: from fibroin sequence to mechanical function. J Exp Biol. 202, 3295–3303 (1999).
Beckwitt, R. & Arcidiacono, S. Sequence conservation in the C-terminal region of spider silk proteins (Spidroin) from Nephila clavipes (Tetragnathidae) and Araneus bicentenarius (Araneidae). J Biol Chem. 269, 6661–6663 (1994).
Beckwitt, R., Arcidiacono, S. & Stote, R. Evolution of repetitive proteins: spider silks from Nephila clavipes (Tetragnathidae) and Araneus bicentenarius (Araneidae). Insect Biochem Mol Biol. 28, 121–130, https://doi.org/10.1016/S0965-1748(97)00083-0 (1998).
Gatesy, J., Hayashi, C., Motriuk, D., Woods, J. & Lewis, R. Extreme diversity, conservation, and convergence of spider silk fibroin sequences. Science. 291, 2603–2605, https://doi.org/10.1126/science.1057561 (2001).
Guerette, P. A., Ginzinger, D. G., Weber, B. H. & Gosline, J. M. Silk properties determined by gland-specific expression of a spider fibroin gene family. Science. 272, 112–115, https://doi.org/10.1126/science.272.5258.112 (1996).
Hinman, M. B. & Lewis, R. V. Isolation of a clone encoding a second dragline silk fibroin. Nephila clavipes dragline silk is a two-protein fiber. J Biol Chem. 267, 19320–19324 (1992).
Xu, M. & Lewis, R. V. Structure of a protein superfiber: spider dragline silk. Proc Natl Acad Sci USA. 87, 7120–7124, https://doi.org/10.1073/pnas.87.18.7120 (1990).
Sanggaard, K. W. et al. Spider genomes provide insight into composition and evolution of venom and silk. Nat Commun. 5, 3765, https://doi.org/10.1038/ncomms4765 (2014).
Chen, G. et al. Full-length minor ampullate spidroin gene sequence. PLoS One. 7, e52293, https://doi.org/10.1371/journal.pone.0052293 (2012).
Colgin, M. A. & Lewis, R. V. Spider minor ampullate silk proteins contain new repetitive sequences and highly conserved non-silk-like “spacer regions”. Protein Sci. 7, 667–672, https://doi.org/10.1002/pro.5560070315 (1998).
Hayashi, C. Y. & Lewis, R. V. Evidence from flagelliform silk cDNA for the structural basis of elasticity and modular nature of spider silks. J Mol Biol. 275, 773–784, https://doi.org/10.1006/jmbi.1997.1478 (1998).
Garb, J. E. & Hayashi, C. Y. Modular evolution of egg case silk genes across orb-weaving spider superfamilies. Proc Natl Acad Sci USA. 102, 11379–11384, https://doi.org/10.1073/pnas.0502473102 (2005).
Hu, X. et al. Egg case protein-1. A new class of silk proteins with fibroin-like properties from the spider Latrodectus hesperus. J Biol Chem. 280, 21220–21230, https://doi.org/10.1074/jbc.M412316200 (2005).
Huang, W. et al. Characterization and expression of a cDNA encoding a tubuliform silk protein of the golden web spider Nephila antipodiana. Biochimie. 88, 849–858, https://doi.org/10.1016/j.biochi.2006.02.010 (2006).
Lin, Z., Huang, W., Zhang, J., Fan, J. S. & Yang, D. Solution structure of eggcase silk protein and its implications for silk fiber formation. Proc Natl Acad Sci USA. 106, 8906–8911, https://doi.org/10.1073/pnas.0813255106 (2009).
Tian, M. & Lewis, R. V. Molecular characterization and evolutionary study of spider tubuliform (eggcase) silk protein. Biochemistry. 44, 8006–8012, https://doi.org/10.1021/bi050366u (2005).
Zhao, A. C. et al. Novel molecular and mechanical properties of egg case silk from wasp spider, Argiope bruennichi. Biochemistry. 45, 3348–3356, https://doi.org/10.1021/bi052414g (2006).
Chaw, R. C., Arensburger, P., Clarke, T. H., Ayoub, N. A. & Hayashi, C. Y. Candidate egg case silk genes for the spider Argiope argentata from differential gene expression analyses. Insect Mol Biol. 25, 757–768, https://doi.org/10.1111/imb.12260 (2016).
Hayashi, C. Y., Blackledge, T. A. & Lewis, R. V. Molecular and mechanical characterization of aciniform silk: uniformity of iterated sequence modules in a novel member of the spider silk fibroin gene family. Mol Biol Evol. 21, 1950–1959, https://doi.org/10.1093/molbev/msh204 (2004).
Blasingame, E. et al. Pyriform spidroin 1, a novel member of the silk gene family that anchors dragline silk fibers in attachment discs of the black widow spider, Latrodectus hesperus. J Biol Chem. 284, 29097–29108, https://doi.org/10.1074/jbc.M109.021378 (2009).
Choresh, O., Bayarmagnai, B. & Lewis, R. V. Spider web glue: two proteins expressed from opposite strands of the same DNA sequence. Biomacromolecules. 10, 2852–2856, https://doi.org/10.1021/bm900681w (2009).
Collin, M. A., Clarke, T. H., Ayoub, N. A. & Hayashi, C. Y. Evidence from Multiple Species that Spider Silk Glue Component ASG2 is a Spidroin. Sci Rep. 6, 21589, https://doi.org/10.1038/srep21589 (2016).
Andersson, M. et al. Biomimetic spinning of artificial spider silk from a chimeric minispidroin. Nat Chem Biol. https://doi.org/10.1038/nchembio.2269 (2017).
Rising, A. & Johansson, J. Toward spinning artificial spider silk. Nat Chem Biol. 11, 309–315, https://doi.org/10.1038/nchembio.1789 (2015).
Lin, Z., Deng, Q., Liu, X. Y. & Yang, D. Engineered large spider eggcase silk protein for strong artificial fibers. Adv Mater. 25, 1216–1220, https://doi.org/10.1002/adma.201204357 (2013).
Vollrath, F. & Knight, D. P. Liquid crystalline spinning of spider silk. Nature. 410, 541–548, https://doi.org/10.1038/35069000 (2001).
Scheibel, T. Spider silks: recombinant synthesis, assembly, spinning, and engineering of synthetic proteins. Microb Cell Fact. 3, 14, https://doi.org/10.1186/1475-2859-3-14 (2004).
Vollrath, F. & Knight, D. P. Structure and function of the silk production pathway in the spider Nephila edulis. Int J Biol Macromol. 24, 243–249, https://doi.org/10.1016/S0141-8130(98)00095-6 (1999).
Dicko, C., Kenney, J. M., Knight, D. & Vollrath, F. Transition to a beta-sheet-rich structure in spidroin in vitro: the effects of pH and cations. Biochemistry. 43, 14080–14087, https://doi.org/10.1021/bi0483413 (2004).
Adams, M. D. et al. Complementary DNA sequencing: expressed sequence tags and human genome project. Science. 252, 1651–1656, https://doi.org/10.1126/science.2047873 (1991).
Doedens, A., Loukas, A. & Maizels, R. M. A cDNA encoding Tc-MUC-5, a mucin from Toxocara canis larvae identified by expression screening. Acta Trop. 79, 211–217, https://doi.org/10.1016/S0001-706X(01)00137-1 (2001).
Gao, Z. et al. Structural Characterization of Minor Ampullate Spidroin Domains and their Distinct Roles in Fibroin Solubility and Fiber Formation. Plos One. submitted (2012).
Wang, S., Huang, W. & Yang, D. Structure and function of C-terminal domain of aciniform spidroin. Biomacromolecules. 15, 468–477, https://doi.org/10.1021/bm401709v (2014).
Wang, S., Huang, W. & Yang, D. NMR structure note: repetitive domain of aciniform spidroin 1 from Nephila antipodiana. J Biomol NMR. 54, 415–420, https://doi.org/10.1007/s10858-012-9679-5 (2012).
Meacock, S. L., Greenfield, J. J. & High, S. Protein targeting and translocation at the endoplasmic reticulum membrane–through the eye of a needle? Essays Biochem. 36, 1–13, https://doi.org/10.1042/bse0360001 (2000).
Clarke, A. R. Molecular chaperones in protein folding and translocation. Curr Opin Struct Biol. 6, 43–50, https://doi.org/10.1016/S0959-440X(96)80093-5 (1996).
Fink, A. L. Chaperone-mediated protein folding. Physiol Rev. 79, 425–449 (1999).
Welch, W. J. The role of heat-shock proteins as molecular chaperones. Curr Opin Cell Biol. 3, 1033–1038, https://doi.org/10.1016/0955-0674(91)90125-I (1991).
Pendeville, H. et al. The ornithine decarboxylase gene is essential for cell survival during early murine development. Mol Cell Biol. 21, 6549–6558, https://doi.org/10.1128/MCB.21.19.6549-6558.2001 (2001).
Babb, P. L. et al. The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression. Nat Genet. 49, 895–903, https://doi.org/10.1038/ng.3852 (2017).
Acknowledgements
Work at NMY is supported by National Natural Science Foundation of China under Award Numbers 81260243 and 31560042. Work at NUS is supported by a grant from the Ministry of Education, Singapore (R154000453112). Work at TJU is supported by a startup grant from Tianjin University.
Author information
Authors and Affiliations
Contributions
W.H., D.Y. and Z.L. designed the experiments. W.H., Y.Z., Y.C., Y.W., W.Y., N.Z. and Z.L. performed the experiments. W.H., T.Z.L., Z.G., D.Y. and Z.L. analyzed the data. W.H., D.Y. and Z.L. wrote the paper.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Huang, W., Zhang, Y., Chen, Y. et al. From EST to novel spider silk gene identification for production of spidroin-based biomaterials. Sci Rep 7, 13354 (2017). https://doi.org/10.1038/s41598-017-13876-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-017-13876-1
This article is cited by
-
1H, 15N and 13C resonance assignments of a repetitive domain of tubuliform spidroin 2
Biomolecular NMR Assignments (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.