Data integration aids understanding of butterfly–host plant networks

Muto-Fujita, Ai; Takemoto, Kazuhiro; Kanaya, Shigehiko; Nakazato, Takeru; Tokimatsu, Toshiaki; Matsumoto, Natsushi; Kono, Mayo; Chubachi, Yuko; Ozaki, Katsuhisa; Kotera, Masaaki

doi:10.1038/srep43368

Download PDF

Article
Open access
Published: 06 March 2017

Data integration aids understanding of butterfly–host plant networks

Ai Muto-Fujita¹^na1,
Kazuhiro Takemoto²^na1,
Shigehiko Kanaya³,
Takeru Nakazato⁴,
Toshiaki Tokimatsu⁵,
Natsushi Matsumoto⁶,
Mayo Kono⁷,
Yuko Chubachi⁷,
Katsuhisa Ozaki⁸ &
…
Masaaki Kotera⁷

Scientific Reports volume 7, Article number: 43368 (2017) Cite this article

6146 Accesses
21 Citations
17 Altmetric
Metrics details

Subjects

Abstract

Although host-plant selection is a central topic in ecology, its general underpinnings are poorly understood. Here, we performed a case study focusing on the publicly available data on Japanese butterflies. A combined statistical analysis of plant–herbivore relationships and taxonomy revealed that some butterfly subfamilies in different families feed on the same plant families, and the occurrence of this phenomenon more than just by chance, thus indicating the independent acquisition of adaptive phenotypes to the same hosts. We consequently integrated plant–herbivore and plant–compound relationship data and conducted a statistical analysis to identify compounds unique to host plants of specific butterfly families. Some of the identified plant compounds are known to attract certain butterfly groups while repelling others. The additional incorporation of insect–compound relationship data revealed potential metabolic processes that are related to host plant selection. Our results demonstrate that data integration enables the computational detection of compounds putatively involved in particular interspecies interactions and that further data enrichment and integration of genomic and transcriptomic data facilitates the unveiling of the molecular mechanisms involved in host plant selection.

Genome-wide macroevolutionary signatures of key innovations in butterflies colonizing new host plants

Article Open access 13 January 2021

Rémi Allio, Benoit Nabholz, … Fabien L. Condamine

Evaluating insect-host interactions as a driver of species divergence in palm flower weevils

Article Open access 09 December 2020

Bruno A. S. de Medeiros & Brian D. Farrell

Symbioses shape feeding niches and diversification across insects

Article Open access 18 May 2023

Charlie K. Cornwallis, Anouk van ’t Padje, … Lee M. Henry

Introduction

Herbivorous insects and their host plants have been engaged in a chemical arms race for more than 420 million years¹. Whilst host plants have developed various chemicals for defence against herbivorous insect damage, herbivorous insects have evolved various countermeasures^2,3. Some insects even use plant chemicals for their own benefit⁴. Most chemical ecology studies have focused on a small number of specific species rather than systematically exploring a wide range of species.

Many herbivorous butterflies feed on specific host plants, thus forming plant–herbivore networks. In recent years, large-scale data on ecological networks, including plant–herbivore networks, have become available with the development of new observation techniques and with improvements in databases and other infrastructure. Network analysis techniques resulting from the development of network science⁵ have been used in ecology to actively investigate ecological networks, with respect to both basic scientific research (e.g., structure–stability relationships) and applied ecology (e.g., biodiversity maintenance and environmental assessment)^6,7,8.

Taxonomic families of herbivorous butterflies generally correspond to those of host plants, thus reflecting the close co-evolutionary relationship between butterflies and plants. A correlation has been observed between changes in host plants and species diversification in the butterfly superfamily Papilionidea⁹. Numerous observations of host–plant selection have established that this selection is controlled by chemical constraints, i.e., several insect species recognise plants by detecting their chemical components^{10,11,12,13,14,15,16}. Nevertheless, the molecular mechanisms that determine host–plant shifts are poorly understood¹⁶. A general understanding of the molecular factors involved in host–plant selection requires detailed chemical and genomic studies on a wide range of insects and plants.

To address this problem, we integrated biological data to gain a comprehensive picture of the contribution of plant chemical compounds to host plant selection and to identify candidate plant compounds involved in specific interactions between butterfly and plant families. This required us to first comprehensively collect data on plant–herbivore relationships involving Japanese butterflies, which were not previously available in the HOSTS lepidopteran host plant database [ http://www.nhm.ac.uk/our-science/data/hostplants/].

The Japanese Islands lie at the convergence of three terrestrial ecozones, namely, the Palearctic (including Europe, Asia north of the Himalayan foothills, northern Africa, and the northern and central Arabian Peninsula), the Oriental (extending across most of South and Southeast Asia and into southern East Asia), and Oceania. Although the flora and fauna of the Japanese Islands are influenced by all three ecozones, they retain their unique status in these islands, which extend from the subarctic to subtropical zones and include many highlands. These geographic and ecological features, as well as an abundance of accumulated knowledge from observations of area-specific populations of butterflies, are attractive targets for research on species diversification. We have generated a complete plant–herbivore network for the Japanese Islands. This network, designated the InsectInDB database (http://insect-plant.org/), comprises 545 insects, 1,922 plants and 3,435 insect–plant relationships.

In this study, we used network analysis combined with taxonomic information to clearly demonstrate that butterfly subfamilies in different families feed on the same plant groups, thereby indicating the independent acquisition of the same adaptive phenotypes (i.e., to the same hosts). The incorporation of chemical compound data allowed us to consider the contributions of plant compounds to butterfly host–plant selection by comparing two hypotheses: a “feed-on-family” hypothesis in which butterflies preferentially feed on host plants in the same family, and a “feed-on-compound” hypothesis in which butterflies select plants that produce similar compounds. Statistical analysis enabled the identification of compounds specific to host plants of particular butterfly families. We determined that some of these compounds are known to attract limited groups of butterflies and repel others. An integration of the insect–compound relationship data revealed potential metabolic processes related to host-plant selection. Our results successfully demonstrated the value of data integration for the computational detection of candidate host plant selection compounds. Finally, the further integration of genomic and transcriptomic data allowed us to unveil the putative molecular mechanism of host plant selection.

Results

Study strategy overview

Before describing the approach used in this study, we first will define the terms used in this paper to describe different plant–insect relationships. Several sets of words, including generalist/specialist and monophagous/oligophagous/polyphagous, have been used to represent the range of insect host preferences¹⁷. When discussing plant–herbivore relationships, a specialist is an herbivore that has evolved a specific mechanism to feed on a particular host plant, whereas a generalist is one that has not. Monophagous, oligophagous and polyphagous refer to the number of host plants fed on, i.e., only one, a few, or many host plants, respectively. Alternatively, oligophagous is defined as the use of multiple host plant species from the same family¹⁸. Generalist/specialist can also refer to the breadth of plant selection by pollinator insects¹⁹. In many cases, however, these words are ambiguous. For example, “specialist” and “monophagous” typically indicate that an herbivore feeds on a single host plant species, but they are sometimes applied to an herbivore feeding on an entire family. To avoid confusion, we define the number of host plant species and host plant families as “mono-species-phagous/oligo-species-phagous/poly-species-phagous” and “mono-family-phagous/oligo-family-phagous/poly-family-phagous”, respectively (Fig. 1a). Similar to oligophagous/polyphagous, the distinction between “oligo-” (several) and “poly-” (many) is somewhat subjective. In this paper, we use these words to avoid ambiguity when discussing plant–herbivore relationships at the species and family levels.

**Figure 1: Overview of the strategy used in this study.**

To achieve a general understanding of host plant selection by Japanese butterflies, we considered plant–herbivore relationships (Fig. 1b) and the presence of compounds in various plant species (Fig. 1c) as follows. First, plant–herbivore relationships between butterflies and their host plants were represented as a bipartite network (Fig. 1d) in which a node represents a butterfly or plant species and an edge indicates that the butterfly feeds on the host plant. Second, the network was divided into sub-networks according to taxonomic family (Fig. 1e). The combined use of plant–herbivore relationship (Fig. 1b) and plant–compound relationship (Fig. 1c) data allowed us to identify compounds common to a group of host plants that are specific to a group of butterflies (in order to distinguish from just “common” compounds, we refer to this type of compounds as “common specific” compounds in this paper). For example, suppose that butterflies B4 and B5 feed specifically on plants P3 and P4 (Fig. 1b). Among the compounds shown in Fig. 1c, C1 is “common” to all plants, which indicates that C1 may be important for plant survival in general but may not be involved in interactions with specific butterflies. Compound C2 is common specific to plants P3 and P4 but not P1 and P2, indicating the possibility that C2 functions to attract butterflies B4 and B5 and repel B1, B2 and B3 (Fig. 1f). When we examined the contributions of these common specific compounds to host plant selection, we found that some were, in fact, important in the plant–herbivore relationship. Finally, a comparison of compounds identified as participants in plant–herbivore relationships unveiled insect metabolic processes that may be involved in the interaction (Fig. 1g).

General architecture of the Japanese butterfly plant–herbivore network

Several seminal studies have revealed that taxonomically related host plants are used by particular butterfly families^20,21. In the present study, we adopted the most recently reported phylogenies of angiosperm plants (APGIII)²² and butterflies²³ to examine the relationships between butterfly–host plant associations in the Japanese Islands. To analyse the network architecture as a whole, we conducted a comprehensive literature survey in which 3,435 plant–herbivore relationships encompassing 1,922 plant species and all 545 butterfly species living in Japan were examined. All members of the superfamily Rhopalocera except for Riodinidae are found in the Japanese Islands; these butterflies are classified into the families Hesperiidae, Papilionidae, Pieridae, Lycaenidae and Nymphalidae. Figure 2a shows the distribution of butterfly families in the Japanese Islands and global ecozones. We conducted a similarity analysis based on Pearson’s correlation coefficients (r) of the relative proportions of butterfly families in different ecozones, which indicated that the butterfly composition of the Japanese Islands is similar to those of the Holarctic (r = 0.92) and Oriental (r = 0.91) ecozones. This result may imply that butterfly distributions in the Japanese Islands are more influenced by distributions in the Holarctic and Oriental ecozones than by those in Oceania or other ecozones, however, it still lacks solid evidence and analysis. The number of butterfly species in Japan (302) is much smaller than the numbers in other ecozones (2,224, 2,411, 1,274, 3,964 and 7,784 in Holarctic, Oriental, Oceania, Afrotropical and Neotropical ecozones, respectively). Another factor that may need to be considered is that Papilionidae butterflies are especially popular in Japan and have been extensively catalogued.

Figure 2b, which shows the relationship between the number of host-plant families and butterfly species, clearly demonstrates that host plant preferences differ among butterfly families. For example, Hesperiidae and Lycaenidae both feed on a wide range of host plant species; however, the butterflies in the first family feed on members of one or two specific host plant families, whereas butterflies in the latter family feed on a wider variety of host plant families. To distinguish between these behaviours, we therefore prefer to use mono-species-phagous/oligo-species-phagous/poly-species-phagous and mono-family-phagous/oligo-family-phagous/poly-family-phagous, as proposed above, rather than the existing terms, specialist/generalist and monophagous/oligophagous/polyphagous. The matrix representation in Fig. 2c clearly reveals some characteristics of the host plant selectivity of Japanese butterflies. For example, the genus Papilio in Papilionidae and the subfamilies Pierinae in Pieridae and Heliconiinae in Nymphalidae are generally mono-family-phagous and have specific relationships with host plants in Rutaceae, Brassicaceae and Violaceae, respectively. In contrast, some butterfly subfamilies in different families exhibit shared host plant selectivity: for example, the subfamily Coliadinae in Pieridae and the subfamily Polyommatinae in Lycaenidae both feed on plants in Fabaceae, whereas the subfamily Hesperiinae in Hesperiidae and the subfamily Satyrinae in Nymphalidae both interact with host plants in Poaceae. Poly-family-phagous butterflies can also be identified from the matrix. For example, butterflies in the subfamily Theclinae in Lycaenidae generally feed on more than three host-plant families.

An examination of the entire network of plant–herbivore relationships (Fig. 3) also reveals that butterfly species in the same family tend to share the same host-plant species and families. The Papilionidae, Nymphalidae and Pieridae families have their own predominant sub-networks (upper left, upper middle and upper right in Fig. 3, respectively). Rutaceae and Brassicaceae are examples of plant families predominantly fed on by single butterfly families (Papilionidae and Pieridae, respectively). In contrast, two or more butterfly families (Hesperiidae, Lycaenidae and Nymphalidae) share for the plants of the families Poaceae and Fabaceae.

**Figure 3: Plant–herbivore network of all Japanese butterfly species.**

To further analyse plant–herbivore relationships, we simplified the network to describe significant interactions between plant and butterfly families (Fig. 4a) with significance evaluated based on real vs. randomised network Z-scores (see Methods). Positive Z-scores indicated that plants in a particular family are significantly more likely to be eaten by insects from a certain family than by those in the randomised networks. As shown in Fig. 4a, the five butterfly families clearly form selective plant–herbivore relationships. For example, Papilionidae butterflies interact specifically with the families Rutaceae, Apiaceae, Aristolochiaceae and Lauraceae, thus forming an isolated network. The remaining four butterfly families also have their own specific host plant families, with two exceptions: Coliadinae subfamily in Pieridae and Polyommatinae subfamily in Lycaenidae butterflies share Fabaceae plants, and Hesperiinae subfamily in Hesperiidae and Satyrinae subfamily in Nymphalidae butterflies share plants in Poaceae. These butterfly subfamilies are in different families. Nonetheless, the families have independently acquired the ability to use the same groups of plants as hosts.

Figure 4b shows significant plant–herbivore relationships between plant families and butterfly subfamilies. For example, the two subfamilies of Papilionidae (Parnassiinae and Papilioninae) have significantly different host plant families, and the same is true for the two Pieridae families (Pierinae and Coliadinae). Among the butterfly subfamilies in Lycaenidae, Curetinae and Miletinae have no specific host plant families, Polyommatinae and Theclinae are poly-family-phagous, and Lycaeninae is mono-family-phagous. The subfamily Satyrinae in Nymphalidae and the subfamily Hesperiinae in Hesperiidae might be regarded as generalist taxa because they feed on two or more host plant families. Because their host-plants are mostly restricted to a single order, we instead consider these latter butterflies to be mono-order-phagous.

With respect to butterfly families (Fig. 4a), the plant families Poaceae and Fabaceae are significantly shared by two butterfly families. At the butterfly subfamily level (Fig. 4b), plants in Fabaceae are consumed by the subfamily Coliadinae in Pieridae and by the subfamily Polyommatinae in Lycaenidae. Members of Poaceae serve as host plants for both the subfamily Satyrinae in Nymphalidae and the subfamily Hesperiinae in Hesperiidae. We found that these two plant families differ with respect to the types of butterflies that consume them. In particular, Poaceae is significantly eaten by subfamilies that are mono-order-phagous (Hesperiinae and Satyrinae), whilst Fabaceae is eaten at a significant level by poly-family-phagous subfamilies (Coliadinae, Polyommatinae, Cyrestinae and Limenitidinae).

A negative Z-score indicates that plants belonging to a particular family are significantly less subject to predation by a given insect family compared with those in randomised networks. Thus far, we have shown that Poaceae and Fabaceae are eaten by a wide range of butterflies (Fig. 4a). However, having analyzed the relationships with butterfly subfamilies, we also found that Poaceae and Fabaceae are consumed differently (Fig. 4b). Poaceae members have no significant relationships exhibiting negative Z-scores, indicating that there are no butterfly subfamilies that significantly avoid Poaceae plants. On the other hand, Fabaceae members have some relationships exhibiting negative Z-scores, i.e., significantly less numbers of host-plant relationships with Papilioninae, Theclinae, Nymphalinae and Satyrinae. To explain this difference, we hypothesised the presence/absence of some factors that work as repellent for some butterflies but as attractant for some other butterflies. In other words, Fabaceae plants are eaten by a wide range of butterflies because they possess some factors that exert both significantly positive (i.e., attractive) and negative (i.e., repellent) effects, whereas Poaceae plants are devoured by a wide range of butterflies because they have attraction factors but not repellent factors.

Influence of plants on host-plant selection

In the previous section, we demonstrated that host-plant selection is highly associated with butterfly taxonomic classifications and implied the existence of factors influencing host-plant selection. To provide more conclusive evidence for this idea, we explored two possible hypotheses: the “feed-on-family” hypothesis or the “feed-on-compound” hypothesis (Fig. 1h). To test the likelihood of the feed-on-family hypothesis, we calculated Z-score statistics based on family-based evenness for butterfly category (family or subfamily) i, E^real(i), and the evenness of randomised networks, E^rand(i) (see Methods). We found that the average E^real(i) (0.03) was significantly lower than the average E^rand(i) (0.16) (Z = –61.9; p < 2.2 × 10^–16 using the Z-test).

We likewise examined the possibility of the feed-on-compound hypothesis. To determine whether a set of host plants of a butterfly species tended to possess a higher number of common specific compounds compared with those from randomly selected sets, we applied the food-pairing hypothesis²⁴ to our datasets. For the statistical test, we used the average number of compounds shared among host plants by butterfly species k, N_s(k)²⁴. From the original plant–herbivore network, we extracted all plant–herbivore relationships in which at least one compound was present in the host plant according to the species-metabolite relationship database KNApSAcK [25]. This extracted network, which consists of 216 butterfly and 405 plant species, is subsequently referred to as the real-world network. A comparison of real-world and randomised network N_s(k) values (N_s^real(k) and N_s^rand(k), respectively) revealed that the average N_s^real (k) (0.37) was significantly larger than the average N_s^rand (k) (0.09) (Z = 8.9; p < 2.2 × 10^–16 using the Z-test).

We focused on the statistics for each butterfly family for further evaluation. In particular, we used the average family-based evenness for butterfly family i, E(i), and the average number of compounds shared among host plants by butterfly species k, N_s(k). Figure 5a shows the relationship between Z-scores of average plant family-based evenness for butterfly families, Z^E(i) = (E^real(i) – E^rand(i))/SD_E(i)^rand, and the Z-scores of the average number of common specific compounds in host plants, Z^Ns(k) = (N_s^real(k) – N_s^rand(k))/SD_Ns(k)^rand. E^real(i) [N_s^real(k)] and E^rand(i) [N_s^rand(k)] correspond to E(i) [N_s(k)] obtained from the real-world network and randomized networks, respectively. SD_E(i)^rand, and SD_Ns(k)^rand are the standard deviations for E^rand(i) and N_s^rand(k), respectively (Data and Methods). Figure 5b is the same scatterplot for the butterfly subfamilies. For butterfly families and subfamilies, the Z-scores of evenness were mostly negative, which indicated that the actual host-plant relationship was more selective than random. The Z-scores of N_s were positive for the butterfly families Pieridae, Lycaenidae, and Papilionidae as well as for many subfamilies, which indicated that the host plants possessed more common specific compounds than would be expected by random chance. As clearly shown in Fig. 5a and b, the Z-scores of evenness and those of N_s were negatively correlated, indicating that the common specific compounds positively contributed to host-plant selection. For example, Pieridae and its subfamilies (Pierinae and Coliadinae) are significantly lower in evenness and higher in N_s compared with other families and subfamilies. Together with the previously known phenomenon that many phytophagous insects are highly adapted to defense chemicals produced by plants and use them as the specific host-finding cues^14,15, the results obtained in this study provide quantitative support for our hypothesis that insects tend to selectively feed on closely-related plants sharing common specific chemical compounds.

We assumed that the global trend of host-plant selectivity was a mixture of different levels of feed-on-compound effects, i.e., some plants express a strong feed-on-compound effect, whereas others do not. Based on this idea, we determined the compound contribution of each plant species i, denoted as χ_i (see Methods). According to our calculations, most plant species had a positive value of χ_i, thus contributing positively to the feed-on-compound effect (Fig. 5c and d).

The relationship between χ_i and degree (i.e., the number of butterflies that predate on the respective plant species) is shown in Fig. 5c. Two Rutaceae species (Zanthoxylum ailanthoides and Phellodendron amurense) exerted particularly high feed-on-compound effects, whilst a species in Poaceae (Oryza sativa) had a negative effect. We found that χ_i was moderately positively correlated with degree (Spearman’s rank correlation coefficient r_s = 0.53, p < 2.2 × 10^–16). This correlation indicated that the presence of common specific plant compounds generally attracts phytophagous butterflies, an observation that supports the feed-on-compound hypothesis.

Figure 5d shows the relationship between plant family χ_i and average neighbour degree (i.e., the average number of host-plant families consumed by the butterfly herbivores of the respective plant species). Although the figure indicates that most plant families are eaten by mono-family-phagous butterflies (lower neighbour degree), we also observed that poly-family-phagous butterflies tend to feed on host plants with a lower χ_i. This result reveals the relationship between χ_i and host selectivity.

Common specific compounds in host plants

To identify specific compounds shared by host plants of a particular butterfly family, we used Fisher’s exact test, which, analogous to its application in genome-wide association studies, is the simplest method for inferring associations or correlations. We extracted all plant–herbivore relationships involving at least one plant compound registered in KNApSAcK²⁵ from the original plant–herbivore network mentioned in the previous section, thereby obtaining 890 plant–herbivore relationships encompassing 217 butterfly species, 406 plant species and 3,507 plant compounds. Statistical significance based on Fisher’s exact test was evaluated using Cramer’s V; for example, given a sample size of 890 and a Cramer’s V value of ~0.6, the threshold P-value was set to 1.0 × 10⁻⁶.

Common specific compounds in host plants of Pierinae (family Pieridae), Lycaenidae, and Papilionae (family Papilionidae) butterflies are shown in Fig. 6a,b and c, respectively. Host plants of the Pierinae specifically possess a number of glucosinolates and flavonoids (Fig. 6a), whereas Papilionae (family Papilionidae) and Lycaenidae host plants are characterised by skimmianine and genistein, respectively (Fig. 6b,c). Among these identified compounds, glucosinolates are attractive to certain members of Pierinae but repel other butterflies^10,11,26. Some plants, such as members of Brassicaceae, have a glucosinolate-myrosinase chemical defence system (Fig. 7a); to avoid this system, some Pieridae butterflies have a nitrile-specifier protein (NSP) (Fig. 7b)¹⁴. In addition, skimmianine (Fig. 6c) has been reported to attract certain Papilionidae butterflies, whereas a study on the interactions between Papilionidae and Rutaceae families²⁷ has revealed that skimmianine has a prohibitive effect on these other butterflies. As clearly shown in the plant-compound matrix in Fig. 6d, plants belonging to the Brassicaceae and Rutaceae families typically possess glucosinolates and skimmianine, respectively. We found that genistein is specific to host plants of Lycaenidae butterflies; however, those plants belong to the family Fabaceae, which hosts a wide variety of Lycaenidae, Nymphalidae and Pieridae butterflies (Fig. 6e). Taking all of these results into consideration, we concluded that the identified common specific compounds contribute to the host-plant selectivity of mono-family-phagous butterflies.

**Figure 6: Compounds specific to host plants of a given butterfly family.**

**Figure 7: Known and deduced plant–herbivore metabolism of glucosinolates.**

Enzyme catalysis in plant–herbivore relationships

Because herbivorous butterflies incorporate plant chemicals whilst feeding, an herbivorous butterfly and its host plants can be generally assumed to share more compounds (C1 in Fig. 1g) than other butterfly–plant pairs. This assumption is consistent with the results of previous studies. For example, some insects have been found to use plant-derived chemicals as signalling materials or as precursors of chemical messengers (e.g., pheromones)¹⁵. In another study, volatile organic compounds produced by insects and plants had an overlap of 87%. Lepidoptera (butterflies and moths) members share significantly more aromatics than do Hymenoptera (bees and ants), whilst no significant difference in shared monoterpenes has been observed between the two orders²⁸.

In this study, we expanded the above assumption to compounds not shared between butterflies and their host plants (C2, C2′, C5 and C6 in Fig. 1g). Even when compounds in a butterfly are not exactly the same as those in host plants, they may be derived from the latter (C2 and C2′ in Fig. 1g) via the insect’s metabolism. To overcome this problem, we used the Pherobase²⁹ and KNApSAcK²⁵ databases to find potential substrate–product pairs for all possible combinations of compound pairs (see Methods for details)^30,31.

Our database search revealed four insect compounds that could possibly be such products (right-hand side of Fig. 7d–g). The corresponding substrates of all four compounds (Fig. 7d–g, left) are present in Brassicaceae plants; of these four compounds, three (Fig. 7d,e and g, left) were found to be Pieridae host-plant specific (Fig. 6a). We used the E-zyme webserver (see Methods) to search for known enzymatic reactions having the same chemical transformation patterns as the query reaction^32,33. Only one of the four reactions (Fig. 7g) was shown to have the same chemical transformation pattern as the known glucosinolate-myrosinase reaction (Fig. 7a). The remaining three reactions (Fig. 7d–f) had the same chemical transformation patterns as those of an avoidance reaction (Fig. 7b,c)^11,26. Among the possible products (Fig. 7d–g, right), only allylnitrile (Fig. 7e, right) was found to be possessed by a butterfly species, Pieris brassicae. No butterfly species possessed any other products, according to Pherobase. Although there is no direct evidence, we showed that the integration of plant–insect relationships, plant compounds and insect compounds provides indirect evidences to infer potential enzyme catalysis. If we integrate additional data, i.e., gene or protein sequence information, further analysis would be available.

To cope with glucosinolates, Pieridae family butterflies have developed the NSP protein³⁴. In the presence of NSP, insect myrosinase acts on glucosinolates to produce nitrile compounds (Fig. 7b,c) that are less toxic than the isothiocyanate compounds produced when NSP is absent (Fig. 7a). These two alternative reactions correspond to the reactions predicted in Fig. 7 (i.e., the reactions in Fig. 7d–f and Fig. 7g correspond to the myrosinase reaction in the presence or absence of NSP, respectively). This NSP protein is not found in any other organisms and is thought to be specific to Pieridae family butterflies.

We therefore searched genome and transcriptome sequences to determine whether myrosinase and NSP genes are present in a wider range of insects. At the time of our survey, the National Center for Biotechnology Information (NCBI) database contained 17 complete and 36 draft genome sequences of insect species, whilst Sequence Read Archive (SRA)³⁵ had genome and transcriptome sequences for 48 and 131 insect species, respectively. Although some protein sequences of Pieridae butterflies have been registered at NCBI, none of their genomes have yet been sequenced. We conducted a comprehensive search for myrosinase and NSP against all published insect genomic and transcriptomic data (see Methods). Myrosinase-like sequences were detected in all sequenced butterflies and moths (Bombyx mori, Chilo suppressalis, Danaus plexippus, Graphium sarpedon, Heliconius melpomene, Manduca sexta, Melitaea cinxia, Papilio glaucus, Papilio machaon, Papilio memnon, Papilio polytes, Papilio xuthus and Plutella xylostella), but no NSP-like sequences were detected. This result is consistent with butterfly host selection of glucosinolate-containing plants¹¹.

Discussion

We have demonstrated that insects selectively feed on closely-related plants containing common specific compounds. Nevertheless, distinguishing between taxonomic and compound effects on plant–herbivore interactions (or host-plant selection) is difficult because plants in the same family may share many compounds. To overcome this complication, more careful examinations may be required. In particular, phylogenetic signals must be removed when evaluating the association between chemical compounds and plant–herbivore interactions. In this context, though comparative phylogenetic analysis³⁶ is useful, this approach generally considers a simple evolutionary model in which species traits change in a Brownian fashion on a phylogenetic tree with accurate branch lengths. Phylogenetic comparative analysis was not applied in this study for two reasons. First, the phylogenetic tree was not accurate; thus, a phylogenetic comparative analysis may have led to misleading conclusions. Second, our dataset included only a few samples of specific plant–herbivore interactions and plant–compound relationships. In comparative phylogenetic analysis, a loss of statistical power is known to occur in small datasets³⁷. Although definite conclusions were not obtained, we expect that the effect of compounds on plant–herbivore interactions is dominant^14,15. In particular, only weak (although significant) phylogenetic signals have been observed in studies of plant–pollinator³⁸ and seed-dispersal³⁹ networks.

Our study has shown that the common specific compounds are mostly toxic compounds (Fig. 6), however, such common specific compounds were not found in Poaceae or Fabaceae, either of which is shared by some butterfly subfamilies in different families (Figs 3 and 4). This result may be due to the limited information in the examined databases but may also be explained by another hypothesis: Poaceae and Fabaceae do not possess particularly toxic compounds, and these less toxic plant species may serve as stepping stones during a transition to new host plants, because the butterflies were able to use their existing detoxification mechanisms to adapt to these plants. At least in Papilionidae, the diversification of butterfly species is thought to be correlated with changes in host plants⁹. Papilio machaon, one of many Papilionidae butterfly species that strongly depend on Rutaceae plants (Fig. 4), is thought to have evolved recently and feeds on a wide range of host plants, including Apiaceae. Rather than expanding to Apiaceae from Rutaceae plants in a single bound, a reasonable hypothesis is that Papilio machaon temporarily adopted less toxic hosts, such as Fabaceae plants, during its evolution.

When investigating host plant selection in this study, we only considered plant compounds; however, many other factors (such as hardness, leaf shape, and especially trichomes) are known to affect host plant edibility. Although these factors are described in field guides, encyclopaedias, and databases, their comprehensive and quantitative acquisition for use in informatics analysis is difficult. Moreover, the effects of phylogenetic signals were not considered in this study despite the importance of phylogenetic comparative analysis⁴⁰. This is because an accurate phylogenetic tree is still unavailable. However, the absence of this analysis poses little problem because several studies have reported that phylogenetic signals are weak in ecological networks^41,42. Despite these limitations, we have successfully shown that plant compounds generally contribute to the host-plant selection of phytophagous butterflies. Although molecular phylogeny is a powerful tool for understanding evolution, we strongly maintain that the integration of various types of data currently scattered across published articles and databases would enhance the study of natural evolutionary diversity.

Concluding remarks

Diversity is one of the most appealing characteristics of insects. Because of this diversity, however, unstructured knowledge is scattered throughout the literature and various databases, hindering comprehensive and systematic analyses. The strength of the current study is the integration of various types of data, including those pertaining to plant–herbivore relationships and plant–compound relationships, which enabled us to obtain a list of compounds that putatively contributed to host plant selection. By itself, this approach is insufficient to enable conclusions, but it can provide insights about molecular mechanisms drawn from the relationship data. Further enrichment of these relationship data would enhance and widen the scope of possible analyses, such as the prediction of host plant relationships and the inference of responsible genes or proteins using various machine-learning techniques.

Data and Methods

Plant–herbivore relationships

Although the HOSTS database (http://www.nhm.ac.uk/research-curation/research/projects/hostplants/) maintained by the Natural History Museum in the UK contains information on plant–herbivore relationships involving butterflies and their host plants worldwide, it lacks data on some relationships, especially those in the Japanese Islands. We therefore conducted a literature survey to collect plant–herbivore relationship information relevant to the Japanese Islands. The data collected and used in this study are summarised in the InsectInDB database (http://insect-plant.org/). The current InsectInDB release (as of January 2016) contains approximately 545 insects, 1,922 edible plants and 3,435 plant–herbivore relationships. InsectInDB provides links to plant–herbivore relationships and metabolomic and genomic data via the scientific names of insects or plants.

Plant-compound and insect-compound relationships

We collected plant compound data (Fig. 1c) using the KNApSAcK database²⁵. We used Pherobase (http://www.pherobase.com), a major online resource cataloguing pheromone compounds that influence the behaviour of insects and other animals²⁹, for the analysis of compounds common to insects and their host plants (Fig. 1g).

Interaction frequency between insect and plant families

To evaluate the strength of the interaction between insect family i and plant family j, we computed W_ij, the number of interactions between the insects in family i and the plants in family j appearing in the plant–herbivore network. We also evaluated the statistical significance of W_ij in the observed network using values from randomised networks (see below for details).

Randomised networks and significance

A comparison with randomised model networks was performed to evaluate the statistical significance of the observed network measures^24,43,44,45. We used the null model 2 ⁴⁶, which is similar to the fixed row-fixed column null model used by Bascompte et al.⁴⁷. This null model generates random bipartite networks with degree sequences identical to those of real networks. In the null model, the probability that a plant connects to an animal is proportional to the product of node degrees for a given plant or animal.

The significance of the network measure X was evaluated based on the Z-score: Z_X = (X_real – X_rand)/SD_rand, where X_real is the network measure of a real-world network, X_rand is the average value of the network measure, and SD_rand is the standard deviation obtained from 1,000 randomised model networks.

Family-based evenness index

To measure the diversity of host plants of a specific insect category, we used a family-based evenness index based on Pielou’s evenness index⁴⁸, which is a Shannon diversity index used to evaluate species evenness. In particular, the evenness index of insect category (species, subfamily or family) i was defined as follows:

where m_f indicates the set of plant families appearing in the plant–herbivore network and |m_f| is the number of plant families. S_i and N_ij are the number of plants interacting with insect category i (i.e., node degree of insect category i) and the number of plants in family j connecting to insect category i, respectively.

To characterise the overall trend of the evenness, we also considered the average family-based evenness:, where A and |A| denote the set and number of insect species, respectively.

Compound contribution to plant–herbivore networks

Drawing on terminology established for flavour networks²⁴, we defined a coefficient of compound contribution, χ_i, to evaluate the contribution of plant i to the feed-on-compound effect in plant–herbivore networks as follows:

where A and P denote sets of insect and plant species, respectively, A(i) and P(j) are the set of insects feeding on plant i and the set of plant species consumed by insect j, respectively, |X| is the size (number of elements) of set X, C_i is the number of compounds in plant i, f_i represents the number of insects feeding on plant i, and S (i) is the average number of plant species that serve as prey in A(i). The first term indicates the average degree of the overlap of compounds between plant i and other plants in A(i) for the observed data. The second term is the expected value of the average degree obtained when assuming no host plant selectivity (i.e., insects feed on randomly selected plants). Thus, a positive or negative χ_i indicates the positive or negative contribution, respectively, of plant i to the feed-on-compound effect.

Chemical compound-based reaction prediction

The reaction prediction based on chemical compounds consists of two steps. The first step is the problem of enzymatic reaction-likeness, i.e., whether a pair of metabolic compounds can be the substrate and product of a single enzymatic reaction³⁰. The second step is the prediction of the actual enzymes responsible for the putative enzymatic reactions³³. To accomplish the first step, we used known chemical transformation patterns encoded as SMIRKS strings⁴⁹ to search for pairs of metabolic compounds potentially possessing the same chemical transformation pattern. For the second step, we used the E-zyme³² and E-zyme2³³ webservers. E-zyme predicts putative enzyme classifications (referred to as Enzyme Commission numbers) with generated values reflecting the similarity between the given transformation patterns and known patterns. E-zyme2 predicts putative enzyme orthologues (i.e., homologous proteins/genes thought to have the same function). The values output by this program reflect the similarities of the transformation patterns as well as those of the surrounding conserved substructures.

Massive-scale sequencing data

DNA and RNA sequences were obtained from NCBI (http://www.ncbi.nlm.nih.gov/) and DDBJ SRA (http://trace.ddbj.nig.ac.jp/dra/index_e.html) databases³⁵. Raw RNA-sequencing data for each insect were downloaded from SRA and then assembled with Trinity⁵⁰, one of the most effective RNA-sequencing assemblers. The obtained Trinity contigs were used as training datasets in the AUGUSTUS program⁵¹ to predict protein-coding sequences in the corresponding insect genome downloaded from NCBI. This analysis generated a list of putative protein sequences encoded in the corresponding insect genomes.

Data Availability

The data collected in this study are summarised in a publicly accessible database, InsectInDB (http://insect-plant.org).

Additional Information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Jensen, N. B. et al. Convergent evolution in biosynthesis of cyanogenic defence compounds in plants and insects. Nat. Commun. 2, 273 (2011).
ADS PubMed Google Scholar
Gatehouse, J. A. Plant resistance towards insect herbivores: a dynamic interaction. New Phytol. 156, 145–169 (2002).
CAS PubMed Google Scholar
Konno, K. Plant latex and other exudates as plant defense systems: roles of various defense chemicals and proteins contained therein. Phytochemistry. 72 (13), 1510–1530 (2011).
CAS PubMed Google Scholar
Fürstenberg-Hägg, J., Zagrobelny, M. & Bak, S. Plant Defense against Insect Herbivores. Int. J. Mol. Sci. 14, 10242–10297 (2013).
PubMed PubMed Central Google Scholar
Barabási A.-L. Network science. Philos. Trans. R. Soc. A. 371, 20120375 (2013).
ADS Google Scholar
Bascompte, J. Structure and dynamics of ecological networks. Science. 329, 765–766 (2010).
ADS CAS PubMed Google Scholar
Allesina, S. & Tang, S. Stability criteria for complex ecosystems. Nature. 483, 205–208 (2012).
ADS CAS PubMed Google Scholar
Mougi, A. & Kondoh, M. Diversity of interaction types and ecological community stability. Science. 337, 349–351 (2012).
ADS MathSciNet CAS PubMed MATH Google Scholar
Thompson, J. N. Evolutionary genetics of oviposition preference in swallowtail butterflies. Evolution. 42, 1223–1234 (1988).
PubMed Google Scholar
Renwick, J. A. A., Radke, C. D., Sachdev-Gupta, K. & Städler, E. Leaf surface chemicals stimulating oviposition by Pieris rapae (Lepidoptera: Pieridae) on cabbage. Chemoecology. 3, 33–38 (1992).
CAS Google Scholar
Agerbirk, N., Olsen, C. E., Topbjerg, H. B. & Sørensen, J. C. Host plant-dependent metabolism of 4-hydroxybenzylglucosinolate in Pieris rapae: substrate specificity and effects of genetic modification and plant nitrile hydratase. Insect Biochem. Mol. Biol. 37, 1119–1130 (2007).
CAS PubMed Google Scholar
Nakayama, T. & Honda, K. Chemical basis for differential acceptance of two sympatric rutaceous plants by ovipositing females of a swallowtail butterfly, Papilio polytes (Lepidoptera, Papilionidae), Chemoecology. 14, 199–205 (2004).
CAS Google Scholar
Ozaki, K. et al. A gustatory receptor involved in host plant recognition for oviposition of a swallowtail butterfly. Nat. Commun. 2, 542 (2011).
ADS PubMed Google Scholar
Nishida, R. Chemical ecology of insect-plant interactions: ecological significance of plant secondary metabolites. Biosci. Biotechnol. Biochem. 78, 1–13 (2014).
CAS PubMed Google Scholar
Heidel-Fischer, H.-M. & Vogel, H. Molecular mechanisms of insect adaptation to plant secondary compounds. Curr. Opin. Insect Sci. 8, 8–14 (2015).
PubMed Google Scholar
Bass, C. et al. Gene amplification and microsatellite polymorphism underlie a recent insect host shift. Proc. Natl. Acad. Sci. USA 110, 19460–19465 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Bidart-Bouzat, M. G. & Kliebenstein . An ecological genomic approach challenging the paradigm of differential plant responses to specialist versus generalist insect herbivores. Oecologia. 167, 677–689 (2011).
ADS PubMed Google Scholar
Wiklund, C. Generalist vs. Specialist Oviposition Behaviour in Papilio Machaon (Lepidoptera) and Functional Aspects on the Hierarchy of Oviposition Preferences. Oikos, 36, 163–170 (1981).
Google Scholar
Tudor, O., Dennis, R. L. H., Greatorex-Davies, J. N. & Sparks, T. H. Flower preference of woodland butterflies in the UK: nectaring specialists are species of conservation concern. Biol. Conserv. 119, 397–403 (2004).
Google Scholar
Ehrlich, P. R. & Raven, P. H. Butterflies and plants: A study in coevolution. Evolution 18, 586–608 (1964).
Google Scholar
Ferrer-Paris, J. R., Sánchez-Mercado, A., Viloria, A. L. & Donaldson, J. Congruence and Diversity of Butterfly-Host Plant Associations at Higher Taxonomic Levels. PLoS One. 8, e63570 (2013).
ADS CAS PubMed PubMed Central Google Scholar
The Angiosperm Phylogeny group. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Bot. J. Linn. Soc. 161, 105–121 (2009).
De Jong, R., Vane-Wright, R. I. & Ackery, P. R. The higher classification of butterflies (Lepidoptera): problems and prospects. Insect Systematics & Evolution, 27, 1, 65–101 (1996).
Google Scholar
Ahn, Y. Y., Ahnert, S. E., Bagrow, J. P. & Barabási, A. L. Flavor network and the principles of food pairing. Sci. Rep. 1, 196 (2011).
CAS PubMed PubMed Central Google Scholar
Afendi, F. M. et al. KNApSAcK family databases: integrated metabolite-plant species databases for multifaceted plant research. Plant Cell Physiol. 53, e1 (2012).
CAS PubMed Google Scholar
Nastruzzi, C. et al. In vitro cytotoxic activity of some glucosinolate-derived products generated by myrosinase hydrolysis. J. Agric. Food Chem. 44, 1014–1021 (1996).
CAS Google Scholar
Heininger, E. E. Effects of furocoumarin and furoquinoline allelochemicals on host plant utilization by Papilionidae. Graduate Dissertations and Theses at Illinois. (1989).
Schiestl, F. P. The evolution of floral scent and insect chemical communication. Ecol. Lett. 13, 643–656 (2010).
PubMed Google Scholar
Symonds, M. R. & Elgar, M. A. The evolution of pheromone diversity. Trends Ecol. Evol. 23, 220–228 (2008).
PubMed Google Scholar
Kotera, M. et al. Metabolome-scale prediction of intermediate compounds in multistep metabolic pathways with a recursive supervised approach. Bioinformatics. 30, i165–i174 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Yamanishi, Y., Tabei, Y. & Kotera, M. Metabolome-scale de novo pathway reconstruction using regioisomer-sensitive graph alignments. Bioinformatics. 31, i161–i170 (2015).
CAS PubMed PubMed Central Google Scholar
Yamanishi, Y., Hattori, M., Kotera, M., Goto, S. & Kanehisa, M. E-zyme: predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs. Bioinformatics. 25, i179–86 (2009).
CAS PubMed PubMed Central Google Scholar
Moriya, Y. et al. Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs. J. Chem. Inf. Model. 56, 510–516 (2016).
CAS PubMed Google Scholar
Wheat, C. W., Vogel, H., Wittstock, U., Braby, M. F., Underwood, D. & Mitchell-Olds, T. The genetic basis of a plant-insect coevolutionary key innovation. Proc. Natl. Acad. Sci. USA 104, 20427–204231 (2007).
ADS CAS PubMed PubMed Central Google Scholar
Nakazato, T., Ohta, T. & Bono, H. Experimental design-based functional mining and characterization of high-throughput sequencing data in the sequence read archive. PLoS One. 8, e77910 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Garland, T., Bennett, A. F. & Rezende, E.-L. Phylogenetic approaches in comparative physiology. J. Exp. Biol. 208, 3015–3035 (2005).
PubMed Google Scholar
Griffith, O. L., Moodie, G. E. E. & Civetta, A. Genome size and longevity in fish. Exp. Gerontol. 38, 333–337 (2003).
CAS PubMed Google Scholar
Rezende, E. L., Jordano, P. & Bascompte, J. Effects of phenotypic comple- mentarity and phylogeny on the nested structure of mutualistic networks. Oikos. 116, 1919–1929 (2007).
Google Scholar
Schleuning, M. et al. Ecological, historical and evolutionary determinants of modularity in weighted seed-dispersal networks. Ecol. Lett. 17, 454–463 (2014).
PubMed Google Scholar
Garland, T., Bennett, A. F. & Rezende, E. L. Phylogenetic approaches in comparative physiology. J. Exp. Biol. 208, 3015–3035 (2005).
PubMed Google Scholar
Rezende, E. L., Jordano, P. & Bascompte, J. Effects of phenotypic complementarity and phylogeny on the nested structure of mutualistic networks. Oikos. 116, 1919–1929 (2007).
Google Scholar
Schleuning, M., Ingmann, L., Strau, R., Fritz, S. A., Dalsgaard, B. & Matthias Dehling, D. et al. Ecological, historical and evolutionary determinants of modularity in weighted seed-dispersal networks. Ecol Lett. 17, 454–463 (2014).
PubMed Google Scholar
Thébault, E. & Fontaine, C. Stability of ecological communities and the architecture of mutualistic and trophic networks. Science. 329, 853–856 (2010).
ADS PubMed Google Scholar
Toju, H., Guimarães, P. R., Olesen, J. M. & Thompson, J. N. Assembly of complex plant-fungus networks. Nat. Commun. 5, 5273 (2014).
ADS CAS PubMed Google Scholar
Feng, W. & Takemoto, K. Heterogeneity in ecological mutualistic networks dominantly determines community stability. Sci. Rep. 4, 5912 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Bascompte, J., Jordano, P., Melián, C. J. & Olesen, J. M. The nested assembly of plant-animal mutualistic networks. Proc. Natl. Acad. Sci. USA 100, 9383–9387 (2003).
ADS CAS PubMed PubMed Central Google Scholar
Almeida-Neto, M., Guimaraes, P., Guimaraes P. R., Loyola, R. D. & Ulrich, W. A consistent metric for nestedness analysis in ecological systems: reconciling concept and measurement. Oikos 117, 1227–1239 (2008).
Google Scholar
Hill, M. O. Diversity and Evenness: A Unifying Notation and Its Consequences. Ecology. 54, 427–432 (1973).
Google Scholar
Daylight Theory Manual, Chapter 5. http://www.daylight.com/dayhtml/doc/theory/theory.smirks.html (accessed August 1, 2016).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
CAS PubMed Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–439 (2006).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was financially supported by the Ministry of Education, Culture, Sports, Science and Technology of Japan (MEXT), the Japan Society for the Promotion of Science (JSPS) (MEXT/JSPS Kakenhi grant number 25870379), and the Japan Science and Technology Agency. This work was also supported by the JST/MEXT Programme to Promote the Tenure Track System in Tokyo Institute of Technology.

Author information

Ai Muto-Fujita and Kazuhiro Takemoto: These authors contributed equally to this work.

Authors and Affiliations

Graduate School of Biological Sciences, Nara Institute of Science and Technology (NAIST), 8916-5, Takayama, Ikoma, 630-0192, Nara, Japan
Ai Muto-Fujita
Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Kawazu 680-4, Iizuka, 820-8502, Fukuoka, Japan
Kazuhiro Takemoto
Graduate School of Information Sciences, Nara Institute of Science and Technology (NAIST), 8916-5, Ikoma, Takayama, 630-0192, Nara, Japan
Shigehiko Kanaya
Database Center for Life Science (DBCLS), Research Organization of Information and Systems, Yata 1111, Mishima, 411-8540, Shizuoka, Japan
Takeru Nakazato
DDBJ Center, National Institute of Genetics, Research Organization of Information and Systems, Yata 1111, Mishima, 411-8540, Shizuoka, Japan
Toshiaki Tokimatsu
Neko-System Inc., 8-31, Konyamachi, Takatsuki, 569-0804, Osaka, Japan
Natsushi Matsumoto
School of Life Science and Technology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, 152-8550, Tokyo, Japan
Mayo Kono, Yuko Chubachi & Masaaki Kotera
JT Biohistory Research Hall, 1-1 Murasaki-cho, Takatsuki, 569-1125, Osaka, Japan
Katsuhisa Ozaki

Authors

Ai Muto-Fujita
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Takemoto
View author publications
You can also search for this author in PubMed Google Scholar
Shigehiko Kanaya
View author publications
You can also search for this author in PubMed Google Scholar
Takeru Nakazato
View author publications
You can also search for this author in PubMed Google Scholar
Toshiaki Tokimatsu
View author publications
You can also search for this author in PubMed Google Scholar
Natsushi Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Mayo Kono
View author publications
You can also search for this author in PubMed Google Scholar
Yuko Chubachi
View author publications
You can also search for this author in PubMed Google Scholar
Katsuhisa Ozaki
View author publications
You can also search for this author in PubMed Google Scholar
Masaaki Kotera
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M., K.T., K.O. and M. Kotera designed the study. K.T. and M. Kotera performed the network analysis. A.M. and M. Kotera performed the metabolomic and genomic analyses and interpretation. K.O., T.N. and T.T. provided advice on the analyses. S.K. and N.M. helped with data integration and management. Y.C. and M. Kono collected and organised information on phytophagous relationships. A.M., K.T., and M. Kotera drafted the manuscript. All authors completed and approved the final manuscript.

Corresponding authors

Correspondence to Katsuhisa Ozaki or Masaaki Kotera.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Muto-Fujita, A., Takemoto, K., Kanaya, S. et al. Data integration aids understanding of butterfly–host plant networks. Sci Rep 7, 43368 (2017). https://doi.org/10.1038/srep43368

Download citation

Received: 21 October 2016
Accepted: 23 January 2017
Published: 06 March 2017
DOI: https://doi.org/10.1038/srep43368

This article is cited by

Polyhydroxy Acids as Fabaceous Plant Components Induce Oviposition of the Common Grass Yellow Butterfly, Eurema Mandarina
- Chisato Matsunaga
- Naoki Kanazawa
- Hisashi Ômura
Journal of Chemical Ecology (2023)
Oviposition stimulants underlying different preferences between host races in the leaf-mining moth Acrocercops transecta (Lepidoptera: Gracillariidae)
- Tomoko Katte
- Shota Shimoda
- Hajime Ono
Scientific Reports (2022)
LepTraits 1.0 A globally comprehensive dataset of butterfly traits
- Vaughn Shirey
- Elise Larsen
- Leslie Ries
Scientific Data (2022)
Machine learning for the meta-analyses of microbial pathogens’ volatile signatures
- Susana I. C. J. Palma
- Ana P. Traguedo
- Ana C. A. Roque
Scientific Reports (2018)
Oviposition inhibitor in umbelliferous medicinal plants for the common yellow swallowtail (Papilio machaon)
- Chisato Morino
- Yusuke Morita
- Ken Tanaka
Journal of Natural Medicines (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.