A chromosome-level genome assembly provides insights into the environmental adaptability and outbreaks of Chlorops oryzae

Zhou, Ailin; Huang, Cong; Li, Yi; Li, Xinwen; Zhang, Zhengbing; He, Hualiang; Ding, Wenbing; Xue, Jin; Li, Youzhi; Qiu, Lin

doi:10.1038/s42003-022-03850-7

Download PDF

Article
Open access
Published: 26 August 2022

A chromosome-level genome assembly provides insights into the environmental adaptability and outbreaks of Chlorops oryzae

Ailin Zhou^1,2,
Cong Huang³,
Yi Li⁴,
Xinwen Li⁴,
Zhengbing Zhang⁴,
Hualiang He¹,
Wenbing Ding^1,2,
Jin Xue¹,
Youzhi Li ORCID: orcid.org/0000-0003-1774-3575^1,2 &
…
Lin Qiu ORCID: orcid.org/0000-0003-3920-0950¹

Communications Biology volume 5, Article number: 881 (2022) Cite this article

1766 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Chlorops oryzae is a pest of rice that has caused severe damage to crops in major rice-growing areas in recent years. We generated a 447.60 Mb high-quality chromosome-level genome with contig and scaffold N50 values of 1.17 Mb and 117.57 Mb, respectively. Hi-C analysis anchored 93.22% scaffolds to 4 chromosomes. The relatively high expression level of Heat Shock Proteins (HSPs) and antioxidant genes in response to thermal stress suggests these genes may play a role in the environmental adaptability of C. oryzae. The identification of multiple pathways that regulate reproductive development (juvenile hormone, 20-hydroxyecdsone, and insulin signaling pathways) provides evidence that these pathways also play an important role in vitellogenesis and thus insect population maintenance. These findings identify possible reasons for the increased frequency of outbreaks of C. oryzae in recent years. Our chromosome-level genome assembly may provide a basis for further genetic studies of C. oryzae, and promote the development of novel, sustainable strategies to control this pest.

A chromosome-level genome assembly of yellow stem borer (Scirpophaga incertulas)

Article Open access 08 March 2024

A chromosome-level genome assembly of Stenchaetothrips biformis and comparative genomic analysis highlights distinct host adaptations among thrips

Article Open access 04 August 2023

A chromosome-level genome assembly of Sesamia inferens

Article Open access 25 January 2024

Introduction

Chlorops oryzae (Diptera, Chloropidae) is an important pest of rice. Newly hatched larvae burrow into the stems of rice plants, then move to the growing tips where they feed on developing leaves and young panicles¹ (Fig. 1). Over the last century, this species has become widespread throughout Japan and Korea causing severe damage to rice crops². In recent years, it has spread to mountainous and semi-mountainous regions and has caused severe crop damage in the country’s main rice-growing regions, becoming China’s most destructive rice pest. Despite the increasing economic impact of this pest, there is limited data on the major ecological characteristics of C. oryzae, such as environmental adaptability and frequency of outbreaks, due to a lack of genomic resources for this organism.

**Fig. 1: *Chlorops oryzae* larvae damage to rice plants.**

There are ~400 insect species with genomes available at NCBI (https://www.ncbi.nlm.nih.gov/genome/browse#!/overview/insects), and genomic tools, particularly the availability of a high-quality assembled genomes, could help explain the molecular mechanism for C. oryzae invasion adaptability and outbreak frequency^3,4. Several genome sequences have been reported in Diptera taxa, including Drosophila melanogaster⁵, Anopheles gambiae⁶, Musca domestica⁷, Ceratitis capitata⁸ and Bactrocera dorsalis⁴, which have helped determine the molecular and genetic mechanisms of many biological problems. Unfortunately, no genome, particularly a chromosome-level genome, is available for C. oryzae.

Changing environmental conditions such as global warming are known to influence the frequency of insect outbreaks. The growth, development, and reproduction of insects can all be directly affected by temperature. For example, in multivoltine taxa such as the Aphididae and some Lepidoptera (e.g Pieris brassicae), higher temperature decreases development time, potentially increasing the number of generations produced per year⁹. Temperature also affects insect distribution and abundance¹⁰. According to the Intergovernmental Panel on Climate Change (IPCC), the planet is warming by around 0.6 °C per annum, with global temperatures anticipated to rise by an average of 1.4–5.8 °C by 2100¹¹. Over the past 55 years, the average number of high-temperature days recorded in China has increased by 28.4%¹². Consequently, this raises the question of how C. oryzae adapts to a warming environment. Furthermore, the emergence of insecticide resistance may be an additional factor influencing pest outbreaks. C. oryzae has become widespread and is increasing rapidly, making crop protection more challenging¹³. Insecticides are currently the primary means of controlling C. oryzae in the field. Growing evidence suggests that resistance to chemical insecticides is caused by decreased sensitivity of target-site proteins and increased metabolic detoxification of insecticides¹⁴. Metabolic resistance arises from the overexpression of detoxification enzyme genes, which belong to three major metabolic detoxification gene families: carboxylesterase, glutathione-S-transferases and cytochrome P450¹³. P450s are one of the largest gene families in all organisms, performing highly diverse physiological and biochemical functions essential for the detoxification and/or activation of heterologous and endogenous compounds¹⁵. Understanding the genomic characteristics that underpin high temperature adaptation and insecticide resistance is essential for developing effective prevention and control measures for this pest.

To determine the internal mechanisms underlying the adaptability and frequency of outbreaks of C. oryzae in recent years, we generated a high-quality chromosome-level genome assembly of C. oryzae through the combined application of PacBio and Illumina sequencing, the HiSeq X Ten platform, and Hi-C. We then conducted a comparative analysis using available insect genomes to gain a better understanding of the genomic evolution of C. oryzae. This reference genome has facilitated the identification of xenobiotic detoxification enzymes such as cytochrome P450s. Additionally, by combining transcriptome and qPCR technologies, we further investigated the molecular basis underlying the ability of C. oryzae to adapt to novel and changing environmental conditions. Finally, we discuss the results of functional studies on the reproductive development of C. oryzae. Our findings may provide a genetic basis for future research on C. oryzae outbreaks and promote the development of effective strategies to control this pest.

Results

Assembly and annotation of the C. oryzae genome

C. oryzae has a diploid chromosome number (2n) of 8. To analyze this genetic resource, we assembled the C. oryzae genome using PacBio long reads and Hi-C chromatin contact information. Removing low-quality and short reads, left a total of 55.92 Gb clean reads for genome assembly, which k-mer analysis estimated to have a genome size of 462.76 Mb (Supplementary Fig. 1 and Supplementary Table 1). A total length of 448.90 Mb assembled genome obtained by using the third-generation sequencing, after correction with Illumina reads, we obtained 3407 contigs with a total length of 447.60 Mb and an N50 of 1.17 Mb. The assembly was then significantly improved, yielding 1575 scaffolds with a total length of 447.78 Mb and an N50 of 117.57 Mb. Hi-C chromatin contact information further supported 4 scaffolds being anchored, ordered, and oriented to give four chromosomes with >93% of assembled bases located on these (Fig. 2, Table 1 and Supplementary Table 2). The completeness of the genome, evaluated by calculating the genome coverage rate for a set of single-copy orthologous eukaryotes genes with BUSCO (v3.0.1), was estimated at 96.1% (Supplementary Table 3). Further analysis of GC content and sequencing coverage showed a normal distribution among assembled scaffolds (Supplementary Fig. 2), which is indicative of low contamination of the assembly. More than 97% of consensus transcripts mapped to the assembly (Supplementary Table 4). These results suggest high accuracy and completeness of the genome assembly.

**Fig. 2: Circular diagram depicting the characteristics of the *Chlorops oryzae* genome.**

Table 1 Features of the Chlorops oryzae genome assembly.

Full size table

In total, 256.8 Mb of repeat sequences were identified, comprising 56.30% of the C. oryzae genome (Supplementary Table 5). DNA transposons and retroelements accounted for 7.25% and 14.09%, respectively. 5.03% were classified as long interspersed elements (LINEs), 0.01% as short interspersed elements (SINEs) and 9.05% as long terminal repeats (LTRs) of the genome. The protein-coding genes in the reference genome were predicted by EVidenceModeler (EVM) (http://evidencemodeler.github.io/), a total of 17,259 gene models were predicted in the assembled genome as the reference gene set (Supplementary Table 6). Of these genes, 14,863 (86.12%) coding proteins were annotated by functional databases (Supplementary Table 7), including Uniprot, GO, KO, Map, NR, NT, PFAM, and eggNOG. Furthermore, we identified different types of noncoding RNAs (ncRNAs), including 1378 tRNAs with tRNAscan-SE (http://lowelab.ucsc.edu/tRNAscan-SE/), and 161 miRNAs, 130 rRNAs and 93 snRNAs, by referencing known noncoding RNA libraries, Rfam (http://rfam.xfam.org/) (Supplementary Table 8).

Gene orthology and phylogenetic analysis

OrthoMCL (http://OrthoMCL.org/OrthoMCL/) was used to identify orthologous genes in C. oryzae and 14 other insect species from six orders (Isoptera, Hemiptera, Hymenoptera, Coleoptera, Diptera and Lepidoptera). A total of 2298 single-copy orthologous genes and 1602 multiple-copy orthologous genes were identified (Supplementary Table 9). Protein sequences of the single-copy genes were used to infer phylogenetic relationships and estimate the divergence between species. The result indicates that C. oryzae diverged from C. capitata (both of which are members of the Cyclorrhapha) around 186 million years ago (Fig. 3).

**Fig. 3: Phylogenetic tree and gene orthology of *Chlorops oryzae* and 14 other insect genomes.**

Gene family expansion and contraction

We used CAFÉ software to study the gene family expansion and contraction of C. oryzae and related species during evolution. The results showed that compared with the common ancestor of C. oryzae and C. capitata, the C. oryzae genome displayed 561 expanded and 416 contracted gene families (Supplementary Fig. 3).

The C. oryzae cytochrome P450 gene

The enhanced metabolization of insecticides by cytochrome P450 monooxygenases is a common insecticide resistance mechanism¹⁶. We identified 69 cytochrome P450 genes that mapped to the C. oryzae chromosomes (Fig. 4). Phylogenetic analysis of P450s clearly represents four major clans, i.e., the CYP2, the CYP3, the CYP4, and the mitochondrial (Mito) clade (Fig. 4). The distribution of P450 genes across the genome revealed 5 gene clusters with three or more P450 genes (Supplementary Fig. 4).

**Fig. 4: Phylogenetic tree of the cytochrome P450 (P450) gene family of *Chlorops oryzae* and other insects.**

Thermal stress response

Thermal stress can alter the permeability of insects’ cell membranes, decrease their water content and affect their enzyme and protein activity. Heat Shock Proteins (HSPs) and antioxidant genes play important roles in protecting insects against these adverse consequences of thermal stress^17,18. We manually annotated the HSP and antioxidant gene families in the C. oryzae genome, including 6 HSP90, 14 HSP70, 14 HSP60, 10 HSP40, 13 small HSP (sHSP), 2 catalase (CAT), 8 peroxidase (POD), 5 superoxide dismutase (SOD) and 22 glutathione-S-transferase (GST) (Supplementary Data 1; Supplementary Fig. 5–13). Supplementary Fig. 14 and 15 show the distribution of HSP genes and antioxidant genes, respectively, across the genome.

To investigate the role of stress response and antioxidant genes in ameliorating the adverse effects of heat stress in C. oryzae, we conducted a comparative transcriptomic analysis to identify genes that were differentially expressed in larvae that had been exposed to either normal, or high, temperatures. All 41,064 assembled unigenes were submitted to BLASTX for annotation in the NR, NT, SwissProt, KEGG, KOG, Pfam and GO databases. Pairwise comparisons among different temperature treatment groups showed that 1519 transcripts were upregulated and 1823 downregulated, between 24 °C and 33 °C (24 °C as control group), that 1487 transcripts were upregulated and 6996 downregulated, between 24 °C and 39 °C (24 °C as control group), and that 1641 transcripts were upregulated and 6232 downregulated, between 33 °C and 39 °C (33 °C as control group) (Supplementary Data 2). All differentially expressed annotated genes were classified into three categories, “Biological process”, “Cellular component” and “Molecular function”, by Gene ontology (GO) analysis. “Cellular process” and “Metabolic process” were the most common subcategories in the “Biological process” category, whereas “Binding” and “Catalytic activity” were the most common subcategories in the “Molecular function” category (Supplementary Fig. 16–19).

We identified 62 candidate HSPs, 2 candidate CAT, 27 candidate GST, 10 candidate POD and 9 candidate SOD, genes in the different temperature treatment transcriptomes (Supplementary Fig. 20–24). We used qRT-PCR to validate RNA sequencing (RNA-seq) data by measuring the expression of stress response, and antioxidant, genes. The expression of antioxidant genes such as SOD, GST and POD was significantly affected by temperature (Fig. 5 and Supplementary Fig. 25), suggesting that these genes are involved in the response of C. oryzae to high temperature stress. Consistent with the RNA-seq results, some HSP genes, such as HSP83, HSP70, HSP68, HSP67B2, HSP27 and HSP23, were also upregulated (Fig. 5 and Supplementary Fig. 25).

**Fig. 5: Effects of temperature stress on mRNA levels of stress and antioxidant genes in *Chlorops oryzae* larvae.**

Reproductive development

It is well known that the reproductive capacity of insects is critical to insect population outbreaks. Ovarian maturity is fundamental for female insect reproduction, which can be regulated by juvenile hormone (JH), 20-ecdysterone (20E) or insulin through regulating vitellogenesis¹⁹. We performed RNAi experiments to disrupt JH, 20E and insulin-like, signaling in newly emerged adult females, focusing on how ovarian development is regulated by different upstream signals. RNAi knockdown of vitellogenin (Vg) completely prevented ovary maturation (Fig. 6 and Supplementary Fig. 26). Furthermore, RNAi knockdown of key genes in the JH pathway (Methoprene-tolerant (Met) and Krüppel homolog 1 (Kr-h1)) reduced yolk deposition, thereby inhibiting ovarian development, but RNAi knockdown of Taiman (Tai, a binding partner of Met) had no effect on ovarian development (Fig. 6 and Supplementary Fig. 26). RNAi knockdown of key genes in the insulin pathway (InR (insulin receptor), FOXO (Forkhead box-containing protein)), TOR (Target of Rapamycin), and PI3K (phosphatidylinositol 3-kinase) and USP (ultraspiracle protein) (a key gene in the 20E pathway) also affected oocyte maturation and prevented ovarian development (Fig. 6 and Supplementary Fig. 26). These results indicate that JH, insulin, and 20E are crucial for normal ovary maturation in C. oryzae.

**Fig. 6: Effects of RNAi knockdown of the *Vitellogenin* (Vg) and eight other genes involved in the JH, 20E and insulin-like peptide, signaling pathways, on ovarian development.**

Discussion

We generated a high-quality C. oryzae genome assembly by combing the PacBio Sequel system and HiSeq X Ten platform with Hi-C technology. Long-read sequencing and Hi-C assisted assembly strategies have previously produced high-quality genome assemblies of other animals^20,21 and plants^22,23. Our results, including contig N50, Scaffold N50, GC content, BUSCO evaluation and the full-length transcripts, indicate that the reference genome had high levels of completeness and accuracy and may provide a foundation for further genetic research on C. oryzae.

Cytochrome P450 is an ancient and large superfamily involved in the metabolism of exogenous and endogenous compounds²⁴. This gene family has been well studied in insects due to its contribution to adaptation to exogenous compounds and pesticide resistance^21,25. In some orders, the number of these detoxification family genes is related to the level of insecticide resistance. For example, compared to other blattodea species, the expanded cytochrome P450 monooxygenase gene family of American cockroaches Periplaneta americana is associated with higher insecticide resistance and survival under extreme conditions²⁶. We predicted fewer P450 genes in C. oryzae than in other Diptera. However, since other studies have found no obvious link between the size of detoxification gene families and resistance, C. oryzae may not have any less detoxification capacity than other insects and size could be corelated with the breadth of the host range²⁷. Further studies are required to identify key genes in the C. oryzae cytochrome P450 family.

Heat shock proteins are ubiquitous and evolutionarily conserved families of proteins in all living organisms that are critical for environmental adaptation²⁸. HSPs usually act as molecular chaperones, facilitating the correct refolding of proteins and preventing the aggregation of denatured proteins, but they also participate in diverse cellular processes such as signal transduction, DNA replication, metabolic detoxification and immune defense reactions²⁹. Furthermore, HSPs can be induced by extreme temperatures, oxidation, UV and heavy metals to help organisms withstand adverse environmental conditions²⁸. Since their discovery in D. melanogaster larvae, HSPs have been found to be involved in the heat stress responses of many insects^30,31. Our results show that the expression of several HSPs (HSP83, HSP70, HSP68, HSP67B2, HSP27 and HSP23) was significantly upregulated in the high temperature treatment groups relative to the control, which suggests that these genes play an important role in counteracting heat stress in C. oryzae.

HSP83 is a member of HSP90 family. HSP90 proteins often regulate the ability of animals to adapt to adverse environmental conditions and serve as the primary self-protection mechanism. Consistent with our results, HSP83 was upregulated in Sesamia nonagrioides after exposure to an elevated temperature (40 °C)³². The others, HSP70, HSP68 and HSP67B2 belong to the most conserved HSP gene family-the HSP70 family^31,33. Several studies have found that HSP70s play important roles in resistance to heat stress. In Drosophila, thermotolerance was significantly improved by the inducible expression of HSP70s³³. Furthermore, the survival rate of female B. tabaci subject to heat stress dramatically decreased after HSP70 knockdown³⁴ and in Diaphorina citri HSP70 was significantly upregulated in insects subject to heat stress¹⁸. Besides that, HSP27 and HSP23 are members of the sHSP family, which has the most diverse functions among stress-response proteins. Consistent with the results of a study on Chironomus riparius³⁵, we found that expression of HSP27 and HSP23 markedly increased at high temperatures. Apart from responding to heat and oxidative stress, sHSPs may also be involved in diapause³⁶, embryo formation, physiological regulation³⁷ and metamorphosis³⁸. Overall, our results indicate that HSP83, HSP70, HSP68, HSP67B2, HSP27 and HSP23 play important roles in the ability of C. oryzae to tolerate thermal stress. Further research is, however, required to determine the specific roles of these genes in C. oryzae.

Heat stress causes a variety of physiological stress responses in insects, including increased production of reactive oxygen species (ROS) that can cause oxidative damage. Oxidative damage in proteins ranges from specific amino acid modifications and peptide breakage to the loss of enzyme activity³⁹. To prevent such damage, organisms have developed antioxidant defense mechanisms, such as specific antioxidant systems (e.g., vitamins, glutathione, antioxidant enzymes, etc.)⁴⁰. Our results show that exposure to high temperatures significantly affected the expression of SOD, GST and POD, which suggests that these antioxidant enzymes are involved in antioxidant responses to thermal stress in C. oryzae. SOD is known to play an important role in reducing the level of superoxide radicals induced by low, or high, ambient temperatures⁴¹. In Monochamus alternatus, the SOD gene was upregulated in larvae exposed to 40 °C¹⁷. Similarly, we found that SOD expression was significantly higher in the 39 °C treatment group than in the 24 °C control group, suggesting that SOD was induced by exposure to high temperatures. SOD scavenges superoxide anions thereby protecting insects from thermal stress. GST is thought to participate in the inactivation of accumulated toxic, lipid, peroxidation products caused by oxidative damage and xenobiotics treatment⁴². We found that expression of GST was significantly higher in the 36 °C treatment group than in the control, and similar results have been reported in Ostrinia furnacalis⁴³ and Panonychus citri⁴⁴. Although GST was not significantly upregulated in the 39 °C treatment group compared to the control, this could be because lipid peroxidation was mitigated by other antioxidant mechanisms. In addition to SOD and GST, insects also have POD, which breaks down H₂O₂⁴⁵. We found that POD expression was significantly lower in the temperature treatment groups relative to the control. Conversely, exposure to a high temperature (35 °C) for 1 h dramatically increased POD activity in Aphidius gifuensis⁴⁶, whereas there was no significant difference in POD expression between high temperature treatment groups and the control (25 °C) in P. citri, even after the duration of exposure to high temperatures was increased⁴⁴. Further research is required to understand the role of POD in the responses of insects to thermal stress.

Ovarian development, the most important part of the reproductive system of female insects¹⁹, is essential for maintaining insect populations. Ovarian maturity can be regulated by JH, 20E or insulin through regulating vitellogenesis¹⁹. Our RNAi experiments on newly emerged females demonstrate that JH, insulin and 20E are critical to the regulation of oocyte maturation and ovarian development in C. oryzae. Apparently, vitellogenesis and egg maturation are coordinated by three hormonal signals in this pest, which is not unexpected given the complex reproductive processes in dipterans¹⁹. And it also appears to be the case in P. americana²⁶. These results highlight the importance of understanding the molecular mechanisms underlying hormonal signaling pathways during ovarian maturation. JH, which acts via Met, controls vitellogenesis and oocyte maturation. Knockdown of Met consequently inhibits JH-induced Vg expression, ovarian development and lipid accumulation⁴⁷. As an early response gene in the JH signaling pathway, Kr-h1 has been confirmed to play an important role in yolk formation and ovarian development in Bactrocera dorsalis, Locusta migratoria and Helicoverpa armigera^47,48,49. Consistent with these findings, our results show that Met and Kr-h1 RNAi depletion block ovarian maturation in C. oryzae. Similarly, knockdown of the endoplasmic reticulum glucose-regulated chaperone Grp78 gene, which is also regulated by JH, significantly inhibited follicular cell development and reproduction in L. migratoria⁵⁰. These findings show that the JH signaling pathway regulates insect reproduction via multiple factors.

We found that knockdown of key genes in the insulin pathway (InR, FOXO, TOR, and PI3K) decreased yolk deposition and blocked ovarian development. The insulin signaling pathway is dependent on adequate nutrition, only female mosquitoes that have obtained a blood-meal can complete normal ovarian development. After a blood meal, amino acids activate TOR signaling, phosphorylate transcription activator S6K and transcription inhibitor 4E-BP, and RNAi Rheb, S6K-mediated gene, blocks Vg expression and egg maturation⁵¹. TOR signaling often regulates insect reproduction in combination with insulin signaling. Insulin binds to the insulin receptor InR, inducing phosphorylation of InR and interacting with the substrate to activate the PI3K pathway after which the normal transcription of Vg is regulated by cascade phosphorylation (PDK), protease B (Akt/PKB) and FOXO⁵². RNAi-mediated silencing of InR has been found to have negative effects on insect reproduction in several species, confirming the role of the insulin pathway in controlling reproductive processes^53,54. In addition to the direct effect of the insulin pathway on vitellogenesis, an interaction between this and other pathways may also regulate ovarian development in C. oryzae. In some insects, insulin/TOR signaling regulates ovarian maturation by affecting JH signaling or JH biosynthesis^55,56. In adult Drosophila, InR mutations led to a decrease in JH titer⁵⁷, whereas silencing TOR and starvation caused a significant decrease in the levels of JH synthase and JH synthesis mRNA in the corpora allata of adult female cockroaches⁵⁸. The TOR nutritional signaling pathway was found to have a similar effect on JH biosynthesis in A. aegypti and N. lugens^59,60. In Blattella germanica, knockdown of InR was found to inhibit JH biosynthesis and decrease Vg expression, thereby blocking ovarian development⁵³. The regulatory networks controlling ovarian maturation in insects are clearly complicated, and additional research is therefore required to understand the molecular mechanisms controlling this process in C. oryzae.

C. oryzae outbreaks frequently in recent years. Adaptation to warmer temperatures may have increased the frequency of outbreaks. In addition, pesticide resistance also facilitates C. oryzae outbreaks by making the species more difficult to control. Reproductive development is also crucial for maintaining insect populations. In summary, this paper provides insights on possible reasons for C. oryzae frequent outbreaks in recent years. Our chromosome-level genome assembly should both facilitate future genetic research on the causes of C. oryzae outbreaks and support the development of sustainable control strategies for this pest.

Methods

Insects

C. oryzae larvae were collected in 2019 in Hanshou County, Hunan province, China, and reared on fresh rice stems in the laboratory. Larvae were kept at 24 ± 1 °C and >80% relative humidity, under a photoperiod of 16:8 (L:D) h. Larvae were used for Illumina sequencing for transcriptome analysis, and 100 female adults were collected for Illumina, PacBio, and Hi-C sequencing for genome analysis.

Genome sequencing

The whole genome was sequenced on the PacBio Sequel System (https://www.pacb.com/products-and-services/pacbio-systems/sequel/) based on single-molecule real-time (SMRT) sequencing technology. The template library was constructed using a SMRTbell Template Prep Kit 1.0 and a SMRTbell Damage Repair Kit. Following the procedure described in the PacBio brochure “>20 kb Template Preparation Using BluePippin^™ Size-Selection System (15–20 kb Cutoff) for Sequel^™ Systems”, the quality DNA was fragmented with g-TUBE (covaries, 520079), concentrated with AMPure^® PB magnetic beads and the fragments eluted with Pacific Biosciences^® Elution Buffer. The fragments were damage-repaired with ExoVII, end-repaired with End Repair Mix and ligated with the blunt adapter. After removing failed ligation products with ExoIII and ExoVII, ligation products were purified twice with AMPure^® PB Beads, and selected for size with the BluePippin^™ Size-Selection System. The fragments obtained were bead-purified, damage-repaired, and used as ~20 kb SMRTbell templates. These templates were annealed with primers and bound to DNA polymerase using a PacBio DNA/Polymerase Kit and magnetic beads, and loaded into the PacBio Sequel™ System for sequencing.

Generation of short reads for genome correction

In order to collect Illumina paired-end reads, we used agarose electrophoresis (1% agarose gels) to check for possible degradation and contamination of genomic DNA, determined its purity with a NanoPhotometer® (IMPLEN, CA, USA) and measured its concentration with a Qubit^® 2.0 Fluorometer (Life Technologies, CA, USA). Only genomic DNA that passed these quality controls was included in the short fragment library constructed following the TruSeq DNA Sample Preparation Guide (Illumina, 15026486 Rev. C). This procedure mainly included the steps of DNA fragmentation, end-repairing, base “A” tailing, ligation of adapters, the recovery of DNA of the required size from gels and PCR amplification of the recovered DNA. Amplification products were used as libraries for sequencing once they passed quality checks. In brief, amplification products were quantified with Qubit2.0 and their size range determined with Agilent 2100. If fragments were within the expected size range, the library was accurately quantified with a Bio-RAD CFX 96 real time quantitative PCR thermocycler and a Bio-RAD KIT iQ SYBR GRN Q-PCR thermocycler. The quality library was sequenced on a HiSeq X Ten Platform set to the PE150 program and paired-end reads obtained.

K-mer analysis

Before genome assembly, genome features can be estimated from the sequences obtained by sequencing. We used the analysis method based on K-mer to estimate the genome size. We iteratively selected the sequence with the length of K bases from a continuous sequence. If the length of each sequence is L, the length of K-mer is K, then L-K + 1 K-mer can be obtained. Here, we took K = 21 for analysis. The distribution of K-mers depends on the characteristics of the genome and follow a Poisson’s distribution.

Genome assembly

The quality of the reads exported by Sequel™ Systems was evaluated with the in-built High Quality Region Finder (HQRF), which identifies the longest high quality region generated for each read by a singly-loaded DNA polymerase according to the signal to noise ratio. Upon generation by the system, all bases were marked with “!” in order to perfect the format. High-quality reads (or regions) were marked with “0.8” and low-quality ones with “0”.

High-quality reads were assembled into contigs using Canu (v1.5 https://github.com/marbl/canu) by setting the parameters as follows: canu -pacbio-raw sample.subreads.fasta -p sample -d sample-canu genomeSize = 40 m gridEngineMemoryOption = “-l vf = MEMORY”. Canu used all-versus-all overlap information to correct individual reads. It selected these overlaps in a two-step filtration process comprised of global and local filtration. Global filtration identified targets where a read may provide correction support, whereas local filtration allowed a read to accept, or reject, the correction evidence provided by other reads. At the trimming stage, Canu identified the region of each read without correction support, trimming or splicing reads into their longest regions with correction support. These regions were subject to a final check for sequencing errors, and then used to construct the best overlap graph based on the output contigs and to compile summary statistics.

Errors in the primary assembly were identified and corrected with BLASR (v5.1, https://github.com/ Pacific Biosciences/blasr) and Arrow (v2.2.1), a tool built in Smrt Link (https://downloads.pacbcloud.com/public/software/installers/smrtlink_5.0.1.9585.zip). The PacBio reads were first mapped to the raw contigs using BLASR with the parameters:—bam—bestn 5—minMatch 18—nproc 4—minSubreadLength 1000—minAlnLength 500—minPctSimila rity 70—minPctAccuracy70—hitPolicy randombest—randomSeed 1, after which consensus sequences and variant calls were obtained via Arrow (v2.2.1) with the default parameters.

The consensus genome was subject to a final round of base-error correction (polishing) by referring to the Illumina reads with BWA (v0.7.9a) and Pilon (v1.22, https://github.com/broadinstitute/pilon). The Illumina paired-end reads were mapped to the contigs by BWA (parameter, -k 30), after which Pilon (v1.22) (default parameters) used this alignment to correct the assembly. The quality of the genome sequence obtained was further evaluated with BUSCO (v3.0.1, http://busco.ezlab.org/) (default parameters) based on a set of single-copy orthologous eukaryotes genes.

Hi-C

We performed Hi-C sequencing to facilitate assembly of the C. oryzae genome. After crosslinking, samples were used for quality control. The Hi-C library was then prepared and sequenced on the Illumina Novaseq platform with 2 × 150 bp reads at Annoroad Gene Technology Co. Ltd. (Supplementary Table 10; Supplementary Fig. 27). We first used the bowtie 2 end-to-end algorithm to align cleaned reads with the reference genome⁶¹. Unmapped reads mainly consisted of chimeric fragments that spanned the ligation junction. According to the Hi-C introduction and fill-in strategy, HiC-Pro (v2.7.8) was used to detect the ligation site with an exact matching program and to align the 5' segment read on the genome⁶². The results of each mapping step were then merged into a single alignment file. Lachesis, the assembly package, was used to cluster, order and orient reads. Finally, we cut the chromosomes predicted by Lachesis into equal-length bins, such as 1 Mb or 500Kb, and constructed a heat map according to the interaction signals revealed by the effective mapping between bins (Supplementary Fig. 27).

Genome annotation

Two methods, homologous sequence prediction and ab initio prediction, were used to predict repetitive sequences. Homologous sequence prediction is based on RepBase (https://www.girinst.org/server/RepBase/index.php), a repeat sequence database. RepeatMasker and RepeatProteinMask were used to predict sequences similar to known repeat sequences⁶³. RepeatModeler (http://www.repeatmasker.org/RepeatModeler/) was used in ab initio prediction. First, a de novo repeat sequence library was established by RepeatModeler, and then repeat sequences were predicted by RepeatMasker. In addition, the ab initio prediction method was also used to find tandem repeat sequences in the genome with TRF software⁶⁴.

Gene structure prediction was performed using three strategies: evidential support of transcriptional data, homologous prediction, and ab initio prediction. For the evidential support of transcriptional data, we used EST/CDA sequence and genome alignment to predict gene structure, with the commonly used software PASA (http://pasa.sourceforge.net/)⁶⁵. In homologous prediction, the coding protein sequences of known homologous species (Drosophila melanogaster, Ceratitis capitata, and Lucilia cuprina) were compared with genome sequences of C. oryzae and the gene structure was predicted by BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi)⁶⁶, Genewise (http://www.ebi.ac.uk/~birney/wise2/)⁶⁷. Software based on the statistical characteristics of genomic sequence data (such as codon frequency, exon-intron distribution) was used to predict gene structure in ab initio prediction. The most commonly used software packages are Augustus (http://augustus.gobics.de/), SNAP (https://github.com/KorfLab/SNAP) and GeneMark (http://exon.gatech.edu/GeneMark/). Finally, to synthesize the above forecast results, the gene sets predicted by each strategy were integrated into a non-redundant and more complete gene set with EVidenceModeler (EVM) (http://evidencemodeler.github.io/)⁶⁸.

Functional annotation of genes was performed based on the best match to the Swissprot (https://web.expasy.org/docs/swiss-prot_guideline.html), NT (https://www.ncbi.nlm.nih.gov/nucleotide/), NR (ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/nr.gz), PFAM (http://xfam.org/), eggNOG (http://eggnogdb.embl.de/), GO (http://geneontology.org/page/go-database) and KEGG (http://www.genome.jp/kegg/), database. For the final integration, information from different functional annotation sources were combined for each gene.

Non-coding RNA, including ribosome RNA (rRNA), small nuclear RNA (snRNA), microRNA (miRNA) and transfer RNA (tRNA) were identified. rRNA, snRNA, and miRNA were predicted by comparison with known noncoding RNA libraries, Rfam (http://rfam.xfam.org/), and tRNA was annotated using the tRNAscan-SE (http://lowelab.ucsc.edu/tRNAscan-SE/) software package⁶⁹.

Orthology and phylogeny

In addition to C. oryzae, 14 representative insect species: Zootermopsis nevadensis (accession number: GCA_000696155.1) (Isoptera, outgroup), Nilaparvata lugens (downloaded from OMIGA annotation project) and B. tabaci (accession number: GCF_001854935.1) (Hemiptera), Apis mellifera (accession number: GCF_003254395.2) and Nasonia vitripennis (accession number: GCF_000002325.3) (Hymenoptera), Tribolium castaneum (accession number: GCF_000002335.3) and Anoplophora glabripennis (accession number: GCA_000390285.1) (Coleoptera), Bombyx mori (accession number: GCF_000151625.1) and Chilo suppressalis (downloaded from InsectBase v2.0) (Lepidoptera), D. melanogaster (accession number: GCF_000001215.4), C. capitata (accession number: GCF_000347755.1), L. cuprina (accession number: GCF_000699065.1), Anopheles gambiae (downloaded from VectorBase) and Aedes aegypti (accession number: GCF_002204515.2) (Diptera), were selected for orthology analysis. Protein sequences translated from the longest transcripts of each gene were aligned to identify conserved orthologs with BLASTP (E-value = 1e-5). Finally, OrthoMCL was used to cluster gene families based on the BLASTP results⁷⁰.

We used protein sequences of identified single-copy genes to reconstruct the phylogeny and MUSCLE (v3.8.31) (http://www.drive5.com/muscle/) to perform multiple alignment of the protein sequence of each orthologous group. Phylogenetic analysis in PhyML (v3.0) was performed using the maximum likelihood methods with 100 bootstrap replicates⁷¹. The mcmctree (http://abacus.gene.ucl.ac.uk/software/paml.html) (burn-in = 20,000, sample-frequency = 2) in PAML (v4.9) packagewas used to estimate divergence time with the BRMC method⁷². Calibration time from the TimeTree (http://www.timetree.org/) was used to calibrate divergence time.

Gene family expansion and contraction analysis

According to the cluster analysis results of gene families, filtering them to remove gene families whose number of genes is >200 in one species and <2 in other species, and gene families whose total number of genes in the gene family is less than the number of species family. Then, using the CAFÉ software (http://sourceforge.net/projects/cafehahnlab) with PGM (probabilistic graphical models) model to simulate the acquisition and loss of genes under the specified evolutionary tree, and analyzing the expansion and contraction of gene family through hypothesis test.

Gene family identification and analysis

We first downloaded a set of reference protein sequences from NCBI GenBank and used Hidden Markov models (HMMs) to obtain references for gene identification. SPDE (v1.2)⁷³ (E-value ≤ 1e-5) was used to search for candidate genes in the C. oryzae genome. Then, genes were manually annotated by BLASTP and GENEWISE, and the number of genes is consistent with that the number we identified. 62 genes are complete and 7 genes needed to be fixed. After which a neighbor-joining tree was constructed using MEGA7⁷⁴ with the Poisson correction method and 1000 bootstrap replicate searches. The final phylogenetic tree was prepared in iTOL (v5) (http://itol.embl.de) and Adobe Illustrator (Adobe Systems, San Jose, CA, USA). The phylogenetic tree of the target gene family was constructed using genes from C. oryzae, D. melanogaster, L. cuprina, and C. capilata (Supplementary Data 3).

Location of P450 genes, HSP genes and antioxidant genes on the chromosome

To locate all identified genes on the chromosome, we first used SPDE (v1.2)⁷³ (E-value ≤ 1e-5) to search for candidate genes in the C. oryzae genome and mapped those found onto the chromosome using a GFF3 file and TBtools (v1.071)⁷⁵.

Transcriptome sequencing and analysis during larval temperature stress experiments

C. oryzae causes economically damage to rice crops and can complete second and third generations under high temperatures, which can result in outbreaks. To understand the responses of C. oryzae larvae to temperature stress, larvae were randomly assigned to one of three temperature treatment groups: 33 °C, 36 °C and 39 °C. Each group was comprised of 20 larvae and had three biological replicates. Larvae in each group were subjected to one of the above temperature treatments for 2 h whereas the control group was kept at 24 °C. At the end of the experiment larvae were frozen in liquid nitrogen for 5 min, then stored at −80 °C until required. 1.5 μg of RNA from each sample was used to construct a cDNA (Complementary DNA) library. An RNA-seq library was sequenced on an Illumina Hiseq platform. SOAPnuke, a self-developed filtering software, was used to compile statistics, and raw reads were cleaned by filtering them against reads containing adapters, poly-N and low-quality reads with trimmomatic. De novo assembly of clean reads (the removal of PCR duplicates to improve assembly efficiency) was conducted using Trinity, after which the assembled transcripts were clustered and de-duplicated using Tgicl to obtain Unigenes. Clean reads were aligned to the genomic sequence with Bowtie 2⁶¹, after which the gene expression level of each sample was calculated using RSEM⁷⁶. The DEGseq method is based on the Poisson distribution. DEGs were detected using the method described in Wang, Feng, Wang, Wang, and Zhang (2010)⁷⁷. P-values were adjusted using Benjamini and Storey’s approach to control false positives. Genes with a ≥ 2-fold difference in expression, and an adjusted P-value ≤ 0.001, were considered significantly differentially expressed.

RNA interference

RNAi was used to determine the function of target genes in oocyte maturation using the EGFP (enhanced green fluorescent protein, GenBank Accession No. U55762) gene as a parallel control. For double-strand RNA (dsRNA) preparation specific primers (Supplementary Data 4) conjugated with the T7 promoter sequence were first used for PCR amplification, and the resultant PCR products were used as templates for dsRNA synthesis. dsRNA was synthesized using the T7 RiboMAX Express RNAi System (Promega, Madison, WI, United States) according to the manufacturer’s instructions. 500 ng dsRNA was injected into each newly emerged adult female in the treatment group and the same dosage of dsEGFP injected into those in the control group. Three biological replicates were performed. The ovarian morphology of females in each group was observed under a stereomicroscope (Motic SMZ-161, Motic Group Co., Xiamen, China) 72 h after dsRNA injection.

Total RNA extraction and quantitative real-time PCR

Total RNA was extracted using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s instructions. RNA purity was verified with gel electrophoresis and its concentration measured with a Qubit^® RNA Assay Kit in Qubit® 2.0 Flurometer (Life Technologies, CA, USA). qPCR primers were designed using the NCBI profile server (http://www.ncbi.nlm.nih.gov/tools/primer-blast)(see Supplementary Data 4 for a list of the primers used). cDNA was synthesized using a PrimeScript RT Reagent Kit with gDNA Eraser (Perfect Real Time) (Takara, Dalian, China) according to the manufacturer’s instructions. cDNA templates were diluted 5 times with deionized water. qRT-PCR was performed on a CFX96 Touch^TM Real-Time PCR Detection System (Bio-Rad Laboratories, Hercules, CA, USA) in a reaction volume of 20 μl using TB Green^TM Premix Ex Taq^TM II (Takara), according to the manufacturer’s instructions. RPS15 (ribosomal protein S15) and RP49 (ribosomal protein 49) were the internal references genes⁷⁸. A two-step program was performed as follows: 95 °C for 30 s, 40 cycles at 95 °C for 10 s and 59 °C for 30 s. A melting curve analysis was performed from 55 °C to 95 °C to determine the specificity of the qPCR primers and their efficiency was verified by calculating a standard curve (cDNA concentration vs. Ct) based on the dilution gradient of the templates. The 2^-ΔΔCt method was used to calculate the relative expression levels of target genes⁷⁹.

Statistics and reproducibility

Statistical analysis of the qRT-PCR results was conducted in GraphPad Prism 8 software (GraphPad Software Inc., San Diego, CA, United States). Data are presented as mean ± standard error (SE). The statistical significance of differences in the expression of genes among temperature treatment groups was analyzed with one-way analysis of variance (ANOVA) followed by Tukey’s honestly significant difference test for multiple sample comparisons (P < 0.05). The statistical significance of differences in gene expression between the RNAi treatment and control group was evaluated with Student’s t-test (^∗P < 0.05, ^∗∗P < 0.01, ^∗∗∗P < 0.001). Go term analysis of upregulated pathways was performed using the OmicShare tools, a free online platform for data analysis (https://www.omicshare.com/tools)⁸⁰.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All raw data from the Chlorops oryzae genome have been deposited in the SRA under SRR14340331 (BioProject PRJNA728371). This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession JAIPUU000000000. The C. oryzae transcriptome data for genome annotation are also stored in the SRA (SRR7528441, SRR7528446, SRR7528467, SRR7529086, SRR7529100, SRR7533623, SRR7534236, SRR7534658 and SRR7534603) (BioProject PRJNA481388, PRJNA 481391, PRJNA 481407, PRJNA 481440, PRJNA 481449, PRJNA 481587, PRJNA 481589, PRJNA 481604, PRJNA 481612). The Hi-C data have been deposited in the NCBI GEO under the accession number GSE210874. All other relevant data are available upon request.

References

Takeda, M. Genetic basis of photoperiodic control of summer and winter diapause in geographic ecotypes of the rice stem maggot, Chlorops oryzae. Entomol. Exp. appl. 86, 59–70 (1998).
Article Google Scholar
Hirao, J. Comparative studies on the development of geographical populations from the 2- and 3-generation areas in the rice stem maggot, Chlorops oryzae Matsumura. Bull. Tohoku. Nat. Agric. 39, 137–170 (1970).
Google Scholar
Li, F. et al. Insect genomes: progress and challenges. Insect Mol. Biol. 28, 739–758 (2019).
Article CAS PubMed Google Scholar
Jiang, F., Liang, L., Wang, J. & Zhu, S. Chromosome-level genome assembly of Bactrocera dorsalis reveals its adaptation and invasion mechanisms. Commun. Biol. 5, 25 (2022).
Article CAS PubMed PubMed Central Google Scholar
Adams, M. D. et al. The genome sequence of Drosophila melanogaster. Science 287, 2185–2195 (2000).
Article PubMed Google Scholar
Holt, R. A. et al. The genome sequence of the Malaria Mosquito Anopheles gambiae. Science 298, 129–149 (2002).
Article CAS PubMed Google Scholar
Scott, J. G. et al. Genome of the house fly, Musca domestica L., a global vector of diseases with adaptations to a septic environment. Genome Biol. 15, 466 (2014).
Article PubMed PubMed Central Google Scholar
Papanicolaou, A. et al. The whole genome sequence of the Mediterranean fruit fly, Ceratitis capitata (Wiedemann), reveals insights into the biology and adaptive evolution of a highly invasive pest species. Genome Biol. 17, 192 (2016).
Article PubMed PubMed Central CAS Google Scholar
Pollard, E. & Yates, T. J. Monitoring Butterflies for Ecology and Conservation (Chapman & Hall, London,1993).
Bale, J. S. et al. Herbivory in global climate change research: direct effects of rising temperature on insect herbivores. Glob. Chang. Biol. 8, 1–16 (2002).
Article Google Scholar
IPCC. Climate Change 2013 -Quotations (IPCC, 2014).
Wang, Y. J., Zhou, B. T., Ren, Y. Y. & Sun, C. H. Impacts of global climate change on China climate security. J. Appl. Meteorol. Sci. 27, 750–758 (2016). (in Chinese).
Google Scholar
Su, H. et al. Comparative transcriptome profiling reveals candidate genes related to insecticide resistance of Glyphodes pyloalis. Bull. Entomol. Res. 110, 57–67 (2019).
Article PubMed CAS Google Scholar
Liu, N. Insecticide resistance in mosquitoes: impact, mechanisms, and research directions. Annu. Rev. Entomol. 60, 537–559 (2015).
Article CAS PubMed Google Scholar
Feyereisen, R. In Comprehensive Molecular Insect Science (Gilbert, L. I. et al.) 1–77 (Elsevier BV, Amsterdam, 2005).
Ranson, H. et al. Evolution of supergene families associated with insecticide resistance. Science 298, 179–181 (2002).
Article CAS PubMed Google Scholar
Li, H. et al. Comparative transcriptome analysis of the heat stress response in Monochamus alternatus Hope (Coleoptera: Cerambycidae). Front. Physiol. 10, 1568 (2020).
Article PubMed PubMed Central Google Scholar
Xiong, Y. et al. Comparative transcriptome analysis reveals differentially expressed genes in the Asian citrus psyllid (Diaphorina citri) upon heat shock. Comp. Biochem. Physiol., Part D: Genomics Proteom. 30, 256–261 (2019).
CAS Google Scholar
Roy, S., Saha, T. T., Zou, Z. & Raikhel, A. S. Regulatory pathways controlling female insect reproduction. Annu. Rev. Entomol. 63, 489–511 (2018).
Article CAS PubMed Google Scholar
Li, Y. et al. Chromosome-level assembly of the mustache toad genome using third-generation DNA sequencing and Hi-C analysis. GigaScience https://doi.org/10.1093/gigascience/giz114 (2019).
Wan, F. H. A chromosome-level genome assembly of Cydia pomonella provides insights into chemical ecology and insecticide resistance. Nat. Commun. 10, 4237 (2019).
Article PubMed PubMed Central CAS Google Scholar
Schmidt, M. H. W. De Novo assembly of a new Solanum pennellii accession using Nanopore sequencing. Plant Cell 29, 2336–2348 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wu, H. et al. A high-quality Actinidia chinensis (kiwifruit) genome. Hortic. Res. 6, 117 (2019).
Article PubMed PubMed Central CAS Google Scholar
Scott, J. G. & Wen, Z. Cytochromes P450 of insects: the tip of the iceberg. Pest Manag. Sci. 57, 958–967 (2001).
Article CAS PubMed Google Scholar
Wang, H. CYP6AE gene cluster knockout in Helicoverpa armigera reveals role in detoxification of phytochemicals and insecticides. Nat. Commun. 9, 4820 (2018).
Article PubMed PubMed Central CAS Google Scholar
Li, S. et al. The genomic and functional landscapes of developmental plasticity in the American cockroach. Nat. Commun. 9, 1–11 (2018).
CAS Google Scholar
Rane, R. V. et al. Are feeding preferences and insecticide resistance associated with the size of detoxifying enzyme families in insect herbivores? Curr. Opin. Insect Sci. 13, 70–76 (2016).
Article PubMed Google Scholar
King, A. M. & MacRae, T. H. Insect heat shock proteins during stress and diapause. Annu. Rev. Entomol. 60, 59–75 (2015).
Article CAS PubMed Google Scholar
Richter, K., Haslbeck, M. & Buchner, J. The heat shock response: life on the verge of death. Mol. Cell 40, 253–266 (2010).
Article CAS PubMed Google Scholar
Guo, X. & Feng, J. Comparisons of expression levels of heat shock proteins (hsp70 and hsp90) from Anaphothrips obscurus (Thysanoptera: Thripidae) in polymorphic adults exposed to different heat shock treatments. J. Insect Sci. 18, 1–10 (2018).
Article CAS Google Scholar
Wang, X. R. et al. Genome-wide identification and characterization of HSP gene superfamily in whitefly (Bemisia tabaci) and expression profiling analysis under temperature stress. Insect Sci. 1, 44–57 (2019).
Article CAS Google Scholar
Gkouvitsas, T., Kontogiannatos, D. & Kourti, A. Expression of the Hsp83 gene in response to diapause and thermal stress in the moth Sesamia nonagrioides. Insect Mol. Biol. 18, 759–768 (2009).
Article CAS PubMed Google Scholar
Bettencourt, B. R., Hogan, C. C., Nimali, M. & Drohan, B. W. Inducible and constitutive heat shock gene expression responds to modification of Hsp70 copy number in Drosophila melanogaster but does not compensate for loss of thermotolerance in Hsp70 null flies. BMC Biol. 6, 5 (2008).
Article PubMed PubMed Central CAS Google Scholar
Lu, Z. C. & Wan, F. H. Using double-stranded RNA to explore the role of heat shock protein genes in heat tolerance in Bemisia tabaci (Gennadius). J. Exp. Biol. 214, 764–769 (2011).
Article CAS PubMed Google Scholar
Raquel, M. F., Mercedes de la, F., Gloria, M. & José-Luis, M. G. Characterization of six small HSP genes from Chironomus riparius (Diptera, Chironomidae): Differential expression under conditions of normal growth and heat-induced stress. Comp. Biochem. Physiol., Part A: Mol. Integr. Physiol. 188, 76–86 (2015).
Article CAS Google Scholar
Ponnuvel, K. M., Murthy, G. N., Awasthi, A. K., Rao, G. & Vijayaprakash, N. B. Differential gene expression during early embryonic development in diapause and non-diapause eggs of multivoltine silkworm Bombyx mori. Indian J. Exp. Biol. 48, 1143–1151 (2010).
CAS PubMed Google Scholar
Nguyen, T. M., Bressac, C. & Chevrier, C. Heat stress affects male reproduction in a parasitoid wasp. J. Insect Physiol. 59, 248–254 (2013).
Article CAS PubMed Google Scholar
Gu, J., Huang, L. X., Shen, Y., Huang, L. H. & Feng, Q. L. Hsp70 and small Hsps are the major heat shock protein members involved in midgut metamorphosis in the common cutworm, Spodoptera litura. Insect Mol. Biol. 21, 535–543 (2012).
Article CAS PubMed Google Scholar
Stadtman, E. R. & Levine, R. L. Free radical-mediated oxidation of free amino acids and amino acid residues in proteins. Amino Acids 25, 207–218 (2003).
Article CAS PubMed Google Scholar
Cossu, C. et al. Glutathione reductase, selenium-dependent glutathione peroxidase, glutathione levels and lipid peroxidation in freshwater bivalves, Unio tumidus, as biomarkers of aquatic contamination in field studies. Ecotoxicol. Environ. Saf. 38, 122–131 (1997).
Article CAS PubMed Google Scholar
Park, M. S., Jo, P. G., Choi, Y. K., An, K. W. & Choi, C. Y. Characterization and mRNA expression of Mn-SOD and physiological responses to stresses in the Pacific oyster Crassostrea gigas. Mar. Biol. Res. 5, 451–461 (2009).
Article Google Scholar
Qin, G. et al. Characterization and functional analysis of four glutathione S transferases from the migratory locust, Locusta migratoria. PLoS ONE 8, e58410 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chen, K. K. et al. Transcription analysis of the stress and immune response genes to temperature stress in Ostrinia furnacalis. Front. Physiol. 10, 1289 (2019).
Article PubMed PubMed Central Google Scholar
Yang, L. H., Huang, H. & Wang, J. J. Antioxidant responses of citrus red mite, Panonychus citri (McGregor) (Acari: Tetranychidae), exposed to thermal stress. J. Insect Physiol. 56, 1871–1876 (2010).
Article CAS PubMed Google Scholar
Lee, K. Characterization of a silkworm thioredoxin peroxidase that is induced by external temperature stimulus and viral infection. Insect Biochem. Mol. Biol. 35, 73–84 (2005).
Article CAS PubMed Google Scholar
Kang, Z. W. et al. The potential coordination of the heat-shock proteins and antioxidant enzyme genes of Aphidius gifuensis in response to thermal stress. Front. Physiol. 8, 976 (2017).
Article PubMed PubMed Central Google Scholar
Yue, Y. et al. Involvement of Met and Kr-h1 in JH-mediated reproduction of female Bactrocera dorsalis (Hendel). Front. Physiol. 9, 482 (2018).
Article PubMed PubMed Central Google Scholar
Song, J., Wu, Z., Wang, Z., Deng, S. & Zhou, S. Kruppel-homolog 1 mediates juvenile hormone action to promote vitellogenesis and oocyte maturation in the migratory locust. Insect Biochem. Mol. Biol. 52, 94–101 (2014).
Article CAS PubMed Google Scholar
Zhang, W. N. et al. Dissecting the role of Kruppel homolog 1 in the metamorphosis and female reproduction of the cotton bollworm, Helicoverpa armigera. Insect Mol. Biol. 27, 492–504 (2018).
Article CAS PubMed Google Scholar
Luo, M. et al. Juvenile hormone differentially regulates two Grp78 genes encoding protein chaperones required for insect fat body cell homeostasis and vitellogenesis. J. Biol. Chem. 292, 8823–8834 (2017).
Article CAS PubMed PubMed Central Google Scholar
Roy, S. G. & Raikhel, A. S. The small GTPase Rheb is a key component linking amino acid signaling and TOR in the nutritional pathway that controls mosquito egg development. Insect Biochem. Mol. Biol. 41, 62–69 (2011).
Article CAS PubMed Google Scholar
Sheng, Z., Xu, J., Bai, H., Zhu, F. & Palli, S. R. Juvenile hormone regulates vitellogenin gene expression through insulin-like peptide signaling pathway in the red flour beetle, Tribolium castaneum. J. Biol. Chem. 286, 41924–41936 (2011).
Article CAS PubMed PubMed Central Google Scholar
Abrisqueta, M., Suren-Castillo, S. & Maestro, J. L. Insulin receptor mediated nutritional signalling regulates juvenile hormone biosynthesis and vitellogenin production in the German cockroach. Insect Biochem. Mol. Biol. 49, 14–23 (2014).
Article CAS PubMed Google Scholar
Brown, M. R. et al. An insulin-like peptide regulates egg maturation and metabolism in the mosquito. Aedes aegypti. Proc. Natl Acad. Sci. USA 105, 5716–5721 (2008).
Article CAS PubMed Google Scholar
Tatar, M. et al. A mutant Drosophila insulin receptor homolog that extends life-span and impairs neuroendocrine function. Science 292, 107–110 (2001).
Article CAS PubMed Google Scholar
Xu, J., Sheng, Z. & Palli, S. R. Juvenile hormone and insulin regulate trehalose homeostasis in the red flour beetle, Tribolium castaneum. PLoS Genet. 9, e1003535 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tu, M. P., Yin, C. M. & Tatar, M. Mutations in insulin signaling alter juvenile hormone synthesis in Drosophila melanogaster. Gen. Comp. Endocrinol. 142, 347–356 (2005).
Article CAS PubMed Google Scholar
Maestro, J. L., Cobo, J. & Bellés, X. Target of rapamycin (TOR) mediates the transduction of nutritional signals into juvenile hormone production. J. Biol. Chem. 284, 5506–5513 (2009).
Article CAS PubMed Google Scholar
Lu, K., Chen, X., Liu, W. T. & Zhou, Q. TOR pathway-mediated juvenile hormone synthesis regulates nutrient-dependent female reproduction in Nilaparvata lugens (Sta˚l). Int. J. Mol. Sci. 17, 438 (2016).
Article PubMed PubMed Central CAS Google Scholar
Pérez-Hedo, M., Rivera-Perez, C. & Noriega, F. G. The insulin/TOR signal transduction pathway is involved in the nutritional regulation of juvenile hormone synthesis in Aedes aegypti. Insect Biochem. Mol. Biol. 43, 495–500 (2013).
Article PubMed PubMed Central CAS Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259–269 (2015).
Article PubMed PubMed Central CAS Google Scholar
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinf. 25, 1–14 (2009).
Article Google Scholar
Gary, B. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Article Google Scholar
Roberts, A., Pimentel, H., Trapnell, C. & Pachter, L. Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics 27, 2325–2329 (2011).
Article CAS PubMed Google Scholar
McGinnis, S. & Madden, T. L. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 32, W20–W25 (2004).
Article CAS PubMed PubMed Central Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and genomewise. Genome Res. 14, 988–995 (2004).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 9, R7 (2008).
Article PubMed PubMed Central CAS Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Article CAS PubMed PubMed Central Google Scholar
Li, L., Stoeckert, C. J. & Roos, D. S. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189 (2003).
Article CAS PubMed PubMed Central Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS PubMed Google Scholar
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article CAS PubMed Google Scholar
Xu, D. et al. SPDE: A multi-functional software for sequence processing and data extraction. Bioinformatics 37, 3686–3687 (2020).
Article CAS Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202 (2020).
Article CAS PubMed Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinf. 12, 323 (2011).
Article CAS Google Scholar
Wang, L., Feng, Z., Wang, X., Wang, X. & Zhang, X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics 26, 136–138 (2010).
Article PubMed CAS Google Scholar
Tian, P. Evaluation of appropriate reference genes for investigating gene expression in Chlorops oryzae (Diptera: Chloropidae). J. Econ. Entomol. 112, 1–8 (2019).
Article CAS Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2^-ΔΔCT method. Methods 25, 402–408 (2001).
Article CAS PubMed Google Scholar
Qie, C. et al. Single-cell RNA-Seq reveals the transcriptional landscape and heterogeneity of skin macrophages in Vsir^-/- murine psoriasis. Theranostics 10, 10483–10497 (2020).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Annoroad Gene Technology (Beijing) Co., Ltd. for technical assistance. This work was supported by the Changsha Municipal Natural Science Foundation [grant number kp2007019], the Double first-class construction project of Hunan Agricultural University [grant number SYL2019029], and the Hunan Provincial Department of Education Project [grant number 18B096].

Author information

Authors and Affiliations

Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, College of Plant Protection, Hunan Agricultural University, Changsha, 410128, China
Ailin Zhou, Hualiang He, Wenbing Ding, Jin Xue, Youzhi Li & Lin Qiu
Hunan Provincial Engineering & Technology Research Center for Biopesticide and Formulation Processing, Changsha, 410128, China
Ailin Zhou, Wenbing Ding & Youzhi Li
Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
Cong Huang
Plant Protection and Inspection Station, Agriculture and Rural Development of Hunan Province, Changsha, 410005, China
Yi Li, Xinwen Li & Zhengbing Zhang

Authors

Ailin Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Cong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinwen Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhengbing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hualiang He
View author publications
You can also search for this author in PubMed Google Scholar
Wenbing Ding
View author publications
You can also search for this author in PubMed Google Scholar
Jin Xue
View author publications
You can also search for this author in PubMed Google Scholar
Youzhi Li
View author publications
You can also search for this author in PubMed Google Scholar
Lin Qiu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.Q. and Yo.L. conceived and designed the works of the whole paper. Yi.L., X.L. and Z.Z. prepared the samples for sequencing. C.H., A.Z. and H.H. performed gene family analysis. W.D. and J.X. did the chromosome location analysis. A.Z. performed functional experiments. A.Z. and L.Q. interpreted the data and wrote the manuscript. C.H., Yo.L. and L.Q. improved the manuscript.

Corresponding authors

Correspondence to Youzhi Li or Lin Qiu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Zachary Cohen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: George Inglis.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, A., Huang, C., Li, Y. et al. A chromosome-level genome assembly provides insights into the environmental adaptability and outbreaks of Chlorops oryzae. Commun Biol 5, 881 (2022). https://doi.org/10.1038/s42003-022-03850-7

Download citation

Received: 23 February 2022
Accepted: 16 August 2022
Published: 26 August 2022
DOI: https://doi.org/10.1038/s42003-022-03850-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Assembly and annotation of the C. oryzae genome

Gene orthology and phylogenetic analysis

Gene family expansion and contraction

The C. oryzae cytochrome P450 gene

Thermal stress response

Reproductive development

Discussion

Methods

Insects

Genome sequencing

Generation of short reads for genome correction

K-mer analysis

Genome assembly

Hi-C

Genome annotation

Orthology and phylogeny

Gene family expansion and contraction analysis

Gene family identification and analysis

Location of P450 genes, HSP genes and antioxidant genes on the chromosome

Transcriptome sequencing and analysis during larval temperature stress experiments

RNA interference

Total RNA extraction and quantitative real-time PCR

Statistics and reproducibility

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links