Introduction

Influenza A viruses (IAVs) belong to the family Orthomyxoviridae. Based on the antigenic properties of two surface glycoproteins hemagglutinin (HA) and neuraminidase (NA), IAVs are clustered into 18 HA (H1–H18) and 11 NA (N1–N11) subtypes1,2,3. The ecology of IAVs is complicated involving multiple host species and viral genes. So far, except that H17, H18, N10 and N11 were restrictively identified from bat samples in forms of H17N10 and H18N114, all other subtypes viruses can circulate in avian species1,2,5, aquatic birds (or waterfowls) in particular, which are therefore considered the natural reservoir of IAVs6,7,8,9,10,11. Occasionally, avian influenza A viruses (AIVs) can transmit to mammals from avian species, which may lead to the development of human pandemic strains by direct or indirect transmission.

A successful transmission between species depends on both host and virus factors, and some period of adaptation of the virus to the new species. Many host factors interacting with the component proteins of IAVs have been identified and their role in the host range expansion and interspecies transmission has been clearly stated8,9. Several viral proteins of IAVs are also known to be responsible for host adaptation or interspecies transmission6,12,13,14,15,16,17,18, of which HA membrane protein is the major determinant for crossing the species barrier8,19,20. Binding to sialic acid receptor, HA initiates fusion of the viral envelope with the host cell membrane5,21,22. The sialic acid receptor can be linked to galactose by α2,6-linkages (SAα2,6 Gal) or α2,3-linkages (SAα2,3 Gal). It is generally believed that HAs of AIVs preferentially bind to SAα2,3 Gal on intestinal epithelial cells of aquatic birds, whereas the HAs of human IAVs prefer SAα2,6 Gal on tracheal epithelium. The adaptation of AIVs to human or other mammalian hosts (mammalian influenza A viruses are abbreviated as MIVs in this study) is connected with a switch in HA ability to bind SAα2,6 Gal instead of SAα2,3 Gal20,23,24,25,26,27,28. In pig tracheal epithelium, there exist both SAα2,3 Gal and SAα2,6 Gal; HA of both AIVs and human influenza viruses may find the receptors. Given these features, pigs are considered as a plausible intermediate host for the generation of human pandemic strains by gene reassortment5,29,30,31,32,33,34,35. This potential to generate novel influenza viruses has resulted in swine being labelled ‘mixing vessels’36,37,38.

The variety of AIVs combined with the high ability of adaptation constitutes the main risk factor for crossing the species barriers, but it is difficult to predict which virus might induce a human pandemic8,39. In order to identify precursor viruses of potential pandemics, an active surveillance and collecting of AIVs from different species, especially the species that can be served as mixing vessels, are crucial. In this study, through screening the host originations of all subtypes of HA and NA sequences available in public databases, we analyzed their host tropisms and attempted to provide the target hosts other than pigs for surveillance of influenza pandemics.

Methods

Screen and count the host originations of IAVs

As mentioned above, HA can initiate fusion of the viral envelope with the host cell membrane, which is the prerequisite for viral replication and transmission. While the balance between HA receptor-binding affinity and NA receptor destroying activity is critical for the efficient growth of IAVs, NA also contributes to influenza virus species specificity40,41. Therefore, HA and NA nucleotide sequences were analyzed in this study.

The host originations of HA and NA nucleotide sequences were screened and counted in two databases, the Influenza Virus Resource of NCBI (http://www.ncbi.nlm.nih.gov/genomes/FLU/aboutdatabase.html) and the Global Initiative on Sharing Avian Influenza Data (GISAID, http://platform.gisaid.org/epi3/frontend) by the end of March 12, 2018. NCBI was used as the main database while the latter was used as a supplementary under the set of only GISAID uploaded isolates. Two screen strategies were engaged in this study. Considering that large amount of isolates of IAV were not sequenced and submitted completely to the databases, the host originations of HA and NA sequences were screened separately. The other strategy that the host originations were screened and counted by subtypes of HxNy (x = 1, 2, 3…18, y = 1, 2, 3…11), was carried out when the preferential HA-NA balances of IAVs were taken into account.

Measure for α-diversity indices of IAVs established from mammalian hosts

Microenvironment in host animal provides the material basis for the growth and proliferation of viruses; at the same time, antibodies and receptor types can also restrict the IAV infections. Relationship or interaction between the microenvironment of a host and viruses is somewhat similar to that between ecological environment and the populations living in it. When a host can be infected with different subtypes of IAVs, it is more likely to be a mixing vessel or natural reservoir for the viruses. Each subtype of HA or NA was regarded as a species population, while the sequence frequencies recorded in the databases were regarded as the observed individuals of the corresponding populations. We can use use some ecological indices such as species diversity, richness, and evenness of IAVs within a species of mammalian host to measure the complexity of a relationship between host microenvironment and IAVs42:

  1. (1)

    Margalef (1951, 1957, and 1958) index (focuses on richness)

    $${{D}}_{r}=({\rm{S}}-1)/\mathrm{lnN}$$

    S is the total subtype number of HA or NA that established from a species of mammalian hosts, and N is the total sequence frequency of HA or NA established from this host species.

  2. (2)

    Simpson’s index (focuses on dominance)

    $${{D}}_{s}=1-{{\rm{\Sigma }}\mathrm{Pi}}^{2}$$

    Pi refers to the ratios of the number of ni subtype of HA or NA to the total sequence frequency of HA or NA established from a species of mammalian hosts, i.e., Pi = ni/N.

  3. (3)

    Shannon-wiener diversity index

    $$H^{\prime} =-\,{\rm{\Sigma }}\mathrm{PilnPi}$$

    The meaning of Pi is the same as above.

  4. (4)

    Pielou evenness index

$$E={\rm{H}}/{\rm{lnS}}$$

H is the observed species diversity index, which equals to Shannon-wiener index H′, i. e., H = H′ = −ΣPilnPi.

Sequence analyses for focused HAs and NAs of AIVs

A species of mammalian host may have the tendency to become a mixing vessel or natural reservoir for human IAVs or other MIVs if they can be infected with a large number of subtypes of IAVs, AIVs in particular. We further studies the species of mammalian hosts infected with the IAVs that had the highest richness, diversities, and evenness. After consulting the information in GenBank and GISAID in detail, tracking references, and removing repeated sequence submissions, each of the HA and NA sequences of AIVs was analyzed by using the Basic Local Alignment Search Tool (BLAST, https://blast.ncbi.nlm.nih.gov/Blast.cgi) with the set of Max target sequences being 1000, and then, every 1000 sequences were downloaded to a local computer. After alignment by FFT-NS-2 methods in multiple alignment program for amino acid or nucleotide sequences (MAFFT version 7, https://mafft.cbrc.jp), they were translated and compared by the MegAlign module of the Lasergene 7.0 software. For HAs, the parts of sequences encoding the signal peptides were cut off beforehand corresponding to each reference sequence of the respective subtype (https://www.ncbi.nlm.nih.gov/refseq). The comparisons were carried out between each sequence of HA or NA and its 999 most similar sequences, and variations on the sites that are known as being relevant to the host tropism of IAVs were focused on3,23,24,39,43,44,45,46,47,48,49,50,51.

Ethics approval

This study is a serial of phylogenetic analyses based on large scale of existing gene sequences; all these sequences can be searched and downloaded from two public databases, the NCBI Influenza Virus Sequence Database and the Global Initiative on Sharing Avian Influenza Data (GISAID) database. No institutional review board approval was required from the research ethics committee of School of Public Health, Fudan University, and animals’ ethics approval was applicable neither.

Results

From H1 to H18, and from N1 to N11, the ratios of sequences with mammalian host origination to those with avian host origination are displayed in Supplementary Figs 1, 2 and Supplementary Tables 13. Except for 65 sequences (33 HA and 32 NA) that were labeled as mammalian origination but no definite species records, 26 species of nonhuman mammal hosts of IAVs were retrieved from the databases. Further checking confirmed that the hosts labeled as feline are cats rather than the taxonomic family of feline. Bovine and mouse had entries but no sequence records. Thus, 23 species of nonhuman mammals were included for the subsequent analysis.

The mammalian species of bat, boar, camel, canine, cat, equine, ferret, mink, muskrat, seal, swine, and whale can be infected by more than one subtype of IAVs. For a long time, swine is considered as a mixing vessel for reassortment or recombination of IAVs. Although isolates established from swine are indeed abundant, the subtypes of them are restricted mainly to MIVs, of which, H1, H3, N1 and N2 account for the overwhelming majority (99·18% of HAs and 99·58% of NAs). The α-diversity related indices including the Shannon-wiener index, the Simpson’s diversity index, the Margalef richness, and the Pielou evenness and they were 0.88, 0.38, 0.94 and 0.27 for HAs that derived from swine, and were 1·00, 0·49, 0·64, and 0·36 for NAs. For HAs, the indices were even lower in swine as compared with those in cat, ferret, camel, bat and muskrat, and for NAs, they were not higher in swine than those in cat and camel. It seemed that swine can only be infected with limited subtypes of IAVs, and sporadic infections caused by subtypes other than H1N1, H1N2 and H3N2 occasionally occurred by chance of accidental spillover. The same happened in dogs and horses. Although the sequences of HA and NA established from them were abundant enough, the subtypes of IAVs were restricted to one or more specific subtypes of MIVs, of which H3N8 and H3N2 accounted for the overwhelming majority.

Interestingly, a neglected mammalian host, mink, was infected by more subtypes of IAVs. Isolates including both MIVs (H3N2 and H1N1) and AIVs (H5N1, H9N2, and H10N4), had considerably higher α-diversity related indices. The Shannon-wiener index, the Simpson’s diversity index, the Margalef richness, and the Pielou evenness were 2·20, 0·77, 1·56, and 0·95 for HAs, and 1·46, 0·61, 0·81, and 0·92 for NAs. The α-diversity related indices of HAs and NAs derived from different mammalian hosts are displayed in Table 1 and Fig. 1.

Table 1 The α diversity related indices of HA and NA derived from mammalian hosts.
Figure 1
figure 1

Shannon-wiener index of IAVs’ HA and NA derived from different mammals*. *Only those are greater than zero are shown. As for HA (left side), from high to low are mink, cat, ferret, camel, bat, seal, muskrat, swine, boar, canine, equine, in turn; as for NA (right side), they are cat, mink, canine, seal, swine, whale, camel, bat, muskrat, boar, ferret, equine, respectively.

Fourteen HA and thirteen NA sequences were found to be established from minks, of which nine pairs of HA and NA were of the typical AIVs, including two H10N4, three H5N1, and four H9N2. BLAST analysis showed that one variation of G212R in two strains of H10N4, and one variation of N173H in the isolate China/01/2014 (H9N2) might involve the binding epitopes of the globular head of HA protein, and the rest variations did not site in the known binding epitopes of the host cell receptor (Table 2).

Table 2 Variations of HAs and NAs of AIVs established from minks compared to majorities.

Discussion

Swine is an important host for IAVs for reasons of being involved in genetic reassortment and interspecies transmission. In swine population, H1N1, H3N2, H1N2 viruses are circulating worldwide, and most swine influenza viruses (SIVs) are reassortants originated from human, avian and swine influenza viruses24,52,53. However, our study indicated that the spillover infections of swine occurred only occasionally. Rare spillover infections were similar for dogs, horses and cats.

Remarkably, our results suggest that mink should be taken more seriously in influenza surveillance. Mink (Neovison vison) is a semiaquatic mammal (or riverside mammal, mammals occurring close to the water and sometimes within it, such as Neovison vison, Lutra lutra, Delphinidae, and Phocidae) species of the genus Mustela of the family Mustelidae; there are 15 subspecies of mink widely distributed in the Americas or being introduced into other continents54. IAVs including both AIVs (H9N2, H5N1, and H10N4) and MIVs (H3N2 and H1N1), were isolated from minks with the highest species/subtype diversities, richness and evenness. Influenza A has caused several outbreaks in minks55,56,57,58. The same strain of MIV or AIV can be repeatedly established during one outbreak55,58, and different subtypes of AIVs also can be isolated from an outbreak in the same period and same breeding farm59,60. All these testimonies prove the susceptibility of mink to IAVs and the transmission features within the populations. Peng et al. and Yu et al. have reported that receptors in tracheal epithelium of mink are mainly linked to SAα2,6 Gal, but receptors of SAα2,3 Gal and SAα2,6 Gal are detected equally or with predominance to SAα2,3 Gal in gastrointestinal mucosa of it58,61. As we know, HAs of AIVs preferential receptors of SAα2,3 Gal are coincidentally on intestinal epithelial cells of aquatic birds26,31,62. Such a molecular basis of the existence of both AIVs and MIVs specific receptors within minks, as well as their characteristic distribution, imply that minks not only could infect with MIVs by intra-tracheal inoculation or horizontal transmission from other minks within populations, but also could infect with AIVs either by eating (preying or feeding) on virus-infected birds or by faecal-oral route within habitat environment63,64. These findings suggest minks may be another intermediate host to spread the virus from wild waterfowls to human. Mink infection may contribute to the adaptation of AIVs to human and other mammals by genetic reassortment or other mechanisms.

From the viewpoint of niche, habitats of riverside mammals such as minks and wild waterfowls overlap each other, which greatly facilitate the interspecies transmission among them. Possibly, some other species of riverside mammals, in addition to terrestrial and domesticated pigs, might also have this potential. Waterfowls have long been considered natural gene pools for IAVs. While the receptors on the surface of gastrointestinal mucosa can recur the infections caused by AIVs within waterfowls, minks may be of significance in sustaining IAVs’ genes and the species may be both a mixing vessel and natural reservoir for IAVs. Co-infection greatly increases the chances of generating novel viruses through genetic reassortment or recombination, which can introduce a novel subtype of IAV in human population. In a free stall barn system, usually in some areas of South Asia, Southeast Asia, Southern and Eastern China, the traditional methods of free-range or outdoor breeding always exacerbates the risk of infection of poultry and backyard livestock through contact with contaminated water or feces65,66,67. The emergence of novel IAVs can lead to a rapid epidemic within terrestrial animals or human. Circulation of IAVs among minks or other riverside mammals, waterfowl, domestic poultry, terrestrial mammals and human is illustrated in Fig. 2.

Figure 2
figure 2

The illustration of adaptation and transmission of Human AIVs. An adaptation from AIVs to human AIVs includes two circulations, the aquatic habitat circulation and the land habitat circulation. In the aquatic habitat circulation, AIVs are transmitting, mutating, and adapting between aquatic birds and minks (as well as other semiaquatic mammals). This adaptation may or may not change their infectivity to avian, but can significantly increase the infectivity to human and terrestrial mammals. Poultries such as duck, goose can be infected through contacting with epidemic water. In a free stall barn system, usually in some areas of South Asia, Southeast Asia, Southern and Eastern China, it will inevitably lead to a land habitat circulation including human beings. The blue pathway is transmitted by faecal-oral route, while the red one is transmitted by intra-tracheal inoculation. The conception for this scene is partly based on the observation of daily lives; e.g., in rural areas of South Asia, Southeast Asia, Southern and Eastern China, pigs and poultries, in particular chick, are often observed to eat each other’s feces. Pigs also eat duck feces, but ducks seldom eat pig feces; and partly based on the available reports, e.g., human infected by a human-adapted AIV from the live poultry market was often reported in China.

This study demonstrates that mink (Neovison vison) might be a potential mixing vessel or intermediate host for the generation of novel human IAVs. Minks, possibly some other semiaquatic mammals (riverside mammals) as well, might play a pivotal role in the process of adapting and transmitting AIVs to human and other terrestrial animals. The significances of mink and other riverside mammal hosts in influenza surveillance and early warning should be paid an attention. In epidemic areas, mink should be considered as one of important sentinel species of hosts for influenza surveillance.

There are several limitations of our study should be mentioned. In this study, we only used the existing databases with no additional laboratory evidence. Secondly, the number of IAVs established from the mammalian species here including those in minks is still small. Hence, our conclusions need to be consolidated.