Introduction

Wild waterfowl (Anseriformes) and shorebirds (Charadriiformes) are the known reservoir of influenza A viruses (IAVs) of the Orthomyxoviridae family1. Barring the two hemagglutinin (HA; H17 and H18) and two neuraminidase (NA; N10 and N11) subtypes recently discovered in bats, these birds carry all known IAV subtypes (H1–H16 and N1–N9). Wild waterfowl and shorebirds can carry IAV genotypes across flyways, and, to a lesser degree, across oceans2. Open farming allows the sharing of food and water resources between wild and domestic waterfowl that may lead to the transmission of IAVs from wild to domestic birds3. Domestic waterfowl, especially ducks, can then act as intermediate hosts for the transmission of IAVs to poultry, primarily chicken (Galliformes).

Certain H5 and H7 IAV strains can cause severe disease in chicken and domestic ducks and are thus called highly pathogenic avian influenza viruses (HPAIVs)4. Poultry infected with HPAIVs exhibit severe clinical signs, including paralysis and sudden death, with up to 100% mortality during outbreaks. Because HPAIVs can rapidly spread within a farm and to adjacent farms, HPAIV outbreak control requires culling all birds in affected farms, thereby causing huge economic losses5. The effects of HPAIVs to wild birds are traditionally considered mild. However, reports of HPAIV-related deaths in wild birds have been growing in number, warranting the reassessment of the effects of HPAIVs to wild birds, especially to protected species6,7. HPAIV outbreaks among poultry also expose poultry farm workers to the virus and can cause a range of mild to severe flu symptoms in humans, with approximately 50% case fatality for the H5N1 HPAIV subtype8. Although HPAIVs have not yet caused sustained chains of human-to-human transmission, they are regarded as potential pandemic threats. Meanwhile, low pathogenic avian influenza viruses (LPAIVs) are also known to contribute genetic material to pandemic influenza virus strains and are thus considered precursors of future pandemic influenza viruses9. Genetic material from LPAIVs can also be used to design vaccines for poultry. Therefore, the surveillance of circulating avian influenza viruses, including LPAIVs, is imperative to the understanding of IAV ecology, which impacts both animal and human health.

South Korea has experienced HPAIV outbreaks that have been linked to circulating HPAIVs or precursors carried by migratory wild birds10,11,12. In 2008, the government of South Korea implemented a national influenza virus surveillance program to identify IAVs in birds that have the potential to cause HPAIV outbreaks among poultry and may be a threat to humans13. Here, we report the isolation and phylogenetic analyses of avian influenza viruses from fecal matter of wild birds collected in migratory bird stopover sites in South Korea in winters throughout November 2014 to January 2018 as part of the nationwide surveillance program.

Results

Prevalence of avian influenza viruses in South Korea in the winters of 2014 to 2018

Samples of fresh bird fecal matter were collected from different sites in South Korea (Fig. 1a) during winters within November 2014 to January 2018. Each winter was defined as November to January of two consecutive years (i.e., November 2014 to January 2015 was defined as the winter of 2014–2015). The sampling sites are known stopover points of migratory birds, especially those coming from Far East Siberia during the annual winter migration along the East Asian-Australasian Flyway (EAAF). As in our previous study that covered winters of 2009 to 201314, the stopover sites were selected for their proximity to previous sites of HPAI outbreaks. A total of 6758 fecal samples were collected throughout the study (Fig. 1a, Table 1). Of these, 75 samples tested positive for IAV through the standard chicken allantoic fluid isolation method and RT-PCR, indicating isolation from 1.11% of the total number of samples for the entire duration of the study. Percent IAV-positivity of samples were highly variable per site and per year (Fig. 1a,b, Table 1), with the winter of 2014–2015 having the highest percent positivity (2.55%), although all the IAV-positive samples that season came from only one of two sampling sites. The winter season with the lowest percent IAV-positivity was 2015–2016 (0.28%), despite a relatively large number of collected samples. Not all sites yielded positive samples, as no IAV was isolated from wild bird fecal matter collected at Gocheonam Reservoir in Jeonnam Province, in contrast with our 2009–2013 study where we reported isolation of IAVs from this site.

Figure 1
figure 1

Isolation of influenza A viruses (IAVs) from fecal matter collected at five migratory bird stopover sites in South Korea (winters of November 2014 to January 2018). (a) Fecal samples were collected at five migratory bird stopover sites during the winter seasons from November 2014 to January 2018. The number of samples that tested positive for IAV over the total number of samples collected per site are indicated. Numbers in parentheses indicate percentage of IAV-positive samples per site. (b) Number of samples that tested positive for IAV per year over the total number of samples collected per year are indicated. Numbers in parentheses indicate percentage of IAV-positive samples per year. Distribution of positive samples per site per year are also shown (color-matched with a).

Table 1 Number of isolates of avian influenza viruses per year and per site in South Korea in winters within November 2014 to January 2018.

HA and NA subtype prevalence and combinations

Not all HA and NA subtypes and combinations thereof were represented over the duration of the study (Fig. 2a–c, Supplementary Table S1S3). For the study period, the most common HA subtypes were H1 (n = 23; 30.67%), H6 (n = 22; 29.33%), and H5 (n = 14; 18.67%) (Fig. 2a, Supplementary Table S1). No virus of the H4, H9, or H10 subtype was identified. However, we were unable to identify the HA subtypes of 3 samples (HX). Different years also had different dominant HA subtypes; the H1 subtype was the most prevalent in 2014–2015, while the H6 subtype was the most prevalent in the succeeding years. The H5 subtype was also among the most prevalent subtypes in most years, except in 2015–2016, when no H5-subtype IAV was isolated.

Figure 2
figure 2

Subtypes of influenza A isolates collected in migratory bird stopover sites in South Korea during winter from November 2014 to January 2018. (a) Hemagglutinin (HA), (b) neuraminidase (NA) subtypes, and (c) combinations thereof of all influenza A virus isolates collected over the entire surveillance period. (d) Host species carrying the influenza A virus isolates for winter 2016–2017 were determined through sequencing of the COX1 gene amplified from DNA isolated from the fecal samples.

For the duration of the study, the most common NA subtypes were N1 (n = 32; 42.67%), N3 (n = 17; 22.67%), and N2 (n = 12; 16.00%) (Fig. 2b, Supplementary Table S2). No virus belonging to the N6, N10, and N11 subtype was isolated; however, the NA subtype of one sample could not be identified (NX). Isolates belonging to the N1 subtype were the most prevalent in the winters of 2014–2015, 2016–2017, and 2017–2018, whereas isolates of the N2 subtype were the most prevalent during the winter of 2015–2016. The most common subtype combinations were H1N1 (n = 17; 22.67%), H6N1 (n = 13; 17.33%), H5N3 (n = 11; 14.67), and H6N2 (n = 8; 10.67%) (Fig. 2c, Supplementary Table S3). The prevalent subtype combinations varied per year, with H1N1 as the most prevalent subtype in 2014–2015; H6N2 in 2015–2016; and H6N1 in 2016–2017 and 2017–2018.

HA and NA sequence identities

The HA and NA sequences for the isolates were assembled to generate full and partial coding sequences. These sequences were deposited to GenBank (Supplementary Table S4) and subjected to BLASTn searches in both NCBI and GISAID to identify reported HA and NA isolates that were most similar to our isolates (Supplementary Table S5). As expected, our isolates matched with samples collected from either South Korea or nearby locations (China, Japan, and Mongolia), all of which are part of the EAAF. The closest sequence matches were isolated within similar periods of time (2014–2021). None of the H5 and H7 sequences matched with reported HPAIVs, suggesting that all isolated viruses in this study were LPAIVs. A closer look at the H5 sequences showed the lack of the multibasic HA cleavage site characteristic of HPAIVs (Supplementary Fig. S1a), further indicating that our H5 isolates were LPAI. However, all the H5 isolates carried reported markers for potential mammalian transmission (Supplementary Table S6) as indicated by the “Identify Sequence Features in Segments” tool in the Influenza Research Database. Specifically, majority of the H5 isolates carried the N171, A172, and P251 variations, which were all reported to enhance HA binding to the human ⍺2,6-linked sialic acid15,16. The algorithm did not report notable variations at the 226 amino acid position in the H5 isolates. Meanwhile, the HA amino acid sequences of the two H7 isolates in this study as well as the closely related A/crane/Kor/17 (H7N7) also lacked the multibasic cleavage site, further verifying that they were LPAIVs (Supplementary Fig. S1b).

Host species identification for IAVs isolated in the winter of 2016–2017

To obtain a snapshot of the host that carried some of our isolates, we extracted genomic DNA from the IAV-positive fecal samples collected during the winter of 2016–2017 (n = 24). The cytochrome C oxidase 1 (COI) gene was amplified from genomic DNA extracted from the fecal matter of the winter 2016–2017 samples, and the sequences of these amplicons were used for identifying the host species using the Barcode of Life Data (BOLD) system. All IAV isolates were carried by members of the genus Anser (wild geese) of the family Anatidae in the order Anseriformes (Fig. 2d, Supplementary Table S7). We were unable to identify the host at the species level for all the samples owing to nearly identical COI sequences among members of the genus Anser. However, the top hits included A. fabalis (bean goose), A. albifrons (greater white-fronted goose), A. anser (greylag goose), and A. cygnoides (swan goose). Both A. fabalis and A. albifrons are among the most common annual visitors to South Korea during winter and are the most probable avian hosts for the isolated IAVs15. We were unable to sequence the COI genes of 7 out of 24 samples owing to the poor quality of DNA extracted from the fecal samples.

Phylogenetic analyses of predominant HA subtypes

Full and partial sequences of the three most prevalent HA subtypes (H1, H5, H6) were compared with sequences pulled from GenBank and GISAID for phylogenetic analyses (Fig. 3). Our H1 isolates clustered with H1 samples that have been isolated from various locations along the EAAF including South Korea since 2004 (Fig. 3a). This indicates that the current H1 gene pool in South Korea has long been circulating in South Korea and other regions along EAAF. However, a recently reported sequence (2020) from Alaska also clustered with our isolates, indicating that the current EAAF H1 gene pool may have been recently carried over to a different continent by migratory birds either through long-distance migration or through interactions with other migratory bird species from the Pacific flyway.

Figure 3
figure 3

Phylogenetic analyses of the three most prevalent hemagglutinin subtypes collected in South Korea during winter from November 2014 to January 2018. Assembled full and partial (a) H1, (b) H5, and (c) H6 sequences of isolates (blue, bold-italic) compared with full coding sequences available at GenBank and GISAID. Isolates in bold red (b, H5 tree) indicate highly pathogenic avian influenza viruses (HPAI). Isolates collected from the Americas are labeled in magenta. Circles at nodes indicate transfer support values from 60 to 100%; the size of the circle is proportional to the bootstrap score. AB aquatic bird, Ck chicken, Dk duck, Gs goose, Md mallard, Kor Korea.

In contrast, the H5 and H6 gene pools appear to be more dynamic than that of H1. Our H5 isolates clustered with H5 viruses that have long been circulating since at least 2002 all over Eurasia and Australia, covering all the Eurasian, African, and Australian flyways (Fig. 3b). Meanwhile, our H6 isolates clustered into two major groups: some isolates clustered with H6 samples from North America, while some isolates clustered with H6 samples primarily from Eurasia (Fig. 3c). However, considering that the North American sequences that clustered with our samples were obtained much later (2012 onwards; Fig. 3c, Eurasian Group A) than the earlier Asian samples in the same cluster (2004–2009; Fig. 3c, Eurasian Group A), the North American gene pool in this group may have arisen from the Eurasian gene pool.

Phylogenetic analyses of predominant NA subtypes

Full and partial sequences of the three most prevalent NA subtypes (N1, N2, and N3) were compared with sequences pulled from GenBank and GISAID for phylogenetic analyses (Fig. 4). The NA genes of our isolates clustered with isolates from the EAAF, and all belong to the Eurasian lineage. Notably, some of our N1 isolates clustered with samples isolated from North America, suggesting that some of the North American N1 gene in circulation have arisen from the Eurasian gene pool. N2 and N3 sequences of viruses isolated in North America also grouped with samples from Eurasia, indicating that Eurasian genes are carried by migratory birds to North America.

Figure 4
figure 4

Phylogenetic analyses of the three most prevalent neuraminidase subtypes collected in South Korea during winter from November 2014 to January 2018. Assembled full and partial (a) N1, (b) N2, and (c) N3 sequences of isolates (blue, bold-italic) compared with full coding sequences available at GenBank and GISAID. Isolates collected from the Americas are labeled in magenta. Circles at nodes indicate transfer support values from 60 to 100%; the size of the circle is proportional to the bootstrap score. AB aquatic bird, Ck chicken, Dk duck, Gs goose, Md mallard, Kor Korea.

Because N1 and N2 subtype influenza viruses are the most associated with human illness and have the potential for reassortment with human influenza virus genes, we determined whether the N1 and N2 subtypes in this study were susceptible to approved NA inhibitors using the “Antiviral Resistance Risk Assessment” tool in the Influenza Research Database. All N1 isolates were found to be susceptible to oseltamivir, zanamivir, and peramivir. Meanwhile, all N2 isolates were found to be susceptible to oseltamivir and zanamivir.

Discussion

Similar to our previous report on avian influenza surveillance from 2009 to 2013 in South Korea14, percent IAV-positivity of bird fecal samples and prevalent IAV subtypes varied per year and per site over the winters of November 2014 to January 2018. All H5 and H7 IAVs isolated during the surveillance period were LPAIVs and were highly similar to isolates obtained from different locations along the EAAF. As expected, most of the HA and NA genes of the isolates clustered with those from isolates obtained from other countries along the EAAF, including China, Japan, and Mongolia. Supporting recent reports that IAVs from wild birds do not always follow rigid flyway boundaries and that there is intercontinental movement of IAV gene pools carried by wild fowl, some of the HA and NA genes of our isolates also cluster with a number of North American isolates. In addition, as shown by the present study and previous results on the surveillance of avian influenza viruses in South Korea14, percent IAV-positivity of samples tends to fluctuate across the years; however, we currently have no specific explanation for this interannual variability.

Influenza host species identification using the COI gene amplified from genomic DNA extracted from bird droppings collected during the winter of 2016–2017 indicates that majority of the isolates during the season were carried by migratory geese (Anser spp.). Although we were unable to identify the host at the species level, we speculate that the isolated IAVs came from either the taiga bean goose (A. fabalis) or the greater white-fronted goose (A. albifrons), which are the most dominant goose species that winter in South Korea from breeding sites in Far East Siberia17. IAVs, including HPAIVs, have been previously isolated from these goose species in other studies in South Korea13,18,19; migratory geese are therefore presumed to contribute to HPAIV outbreaks in the country.

Similar to our surveillance study in previous years, IAVs of the H6 subtype were among the most frequently isolated during successive winters in South Korea. This is corroborated by studies that suggest that H6-subtype IAVs are the most frequently isolated subtypes from wild waterfowl especially in Asia20. A study from China reported that 11.07% of the isolates from wild birds in 2016–2017 were of the H6 subtype, making H6 the most predominant subtype in that study21. In our study, IAVs of the H6 subtype were likewise the most prevalent over the winter of 2016–2017, which may indicate similar distributions of IAV subtypes along the EAAF that year. Another study has reported that 49.44% (44/89) of isolates from wild birds in China along the EAAF over 2015–2019 were of the H6 subtype22. In our study, the H6-subtype IAVs were consistently isolated in all four years, suggesting its persistence in migratory waterfowl populations that traverse the EAAF. Recent studies from China suggest that H6NX viruses from wild birds, including A/Anser fabalis/China/Anhui/L221/2014 (H6N1), which is 99% identical to one of our isolates (A/Kor/CN69/2014 [H6N2]), can infect mice and replicate at high titers in mouse lungs without prior adaptation23. While H6-subtype IAVs are considered LPAI and are non-reportable, they may still cause damage to the poultry industry (~ 30% drop in egg production)20. Given the potential ability of H6-subtype IAVs to infect mammals, their persistence in the wild, and their effects on poultry, closer investigation of H6-subtype IAVs is warranted.

Meanwhile, in our study, viruses of the H1 subtype were predominant in the winter of 2014–2015, accounting for 50% (18/36) of the isolates that season. This is in line with a previous report of an H1NX outbreak among wild fowl in Anhui, China24 and from another study in bird wintering wetlands in Yangtze River, China25.

Despite H5 HPAI outbreaks in poultry in South Korea over the study period, we found no H5 HPAIV during our surveillance. However, H5-subtype LPAIVs, especially H5N3 LPAIVs, were among the most prevalent subtypes, particularly in the winters of 2014–2015 (27%) and 2016–2017 (12.5%). Another surveillance study in South Korea also reported the prevalence of H5 LPAIVs from wild bird habitats in 2015 (13.1%) and 2017 (14.4%), coinciding with our results26. In contrast to our findings, this previous study reported the isolation of H5-subtype LPAIVs in 2017 in the Jeonnam Province of South Korea, while we obtained no isolate from Gocheonam Reservoir in Jeonnam Province. However, the Jeonnam isolates of this study were obtained in December 2017, while we performed sampling in the same region in November 2016. The differences in isolation rates of H5 and other AIV isolates in this province may be due to sampling bias, as we only sampled a very small portion of a large migratory bird stopover site, and only performed sampling on a single day in a single winter season. Obtaining more avian fecal samples later in the season and on other years may have yielded different results.

All our H5 isolates carried markers of increased transmission to mammals. These markers include the N171and T172 variations which, when present together, have been reported to increase binding of the A/Viet Nam/1203/2004 (H5N1) HA to the human ⍺2,6-linked sialic acid without loss of binding to the avian ⍺2,3-linked sialic acid15. Another variation found in most of our isolates was P251, which has been reported to slightly increase the binding of the A/duck/Egypt/D1Br12/2007 (H5N1) HA binding to ⍺2,6-linked sialic acid16. The L226 variation was not present in any of our H5 isolates. These findings suggest that H5 genotypes circulating in wild birds passing through South Korea and adjacent countries continue to gain potential to infect mammals. The persistence of these H5 sequences in the wild increases their chances of reassortment with other IAV genes that may lead to eventual crossing to the mammalian species barrier and perhaps to sustained transmission among mammals including humans. Considering that H5 LPAIVs have been reported as precursors of HPAIVs, continued monitoring of H5 LPAIVs is needed to guide mitigation and control measures for H5 IAV infections in both wild and domestic fowl.

Only 2 fecal samples (2.6% of total) yielded H7-subtype isolates, both of which were H7N7 LPAIVs. Interestingly, both the HA and NA genes of the isolates were highly identical to those of an H7N7 isolate from a red-crowned crane (A/red-crowned crane/Korea/H1026/2017) in a zoo in South Korea isolated in March 201727. Our H7N7 samples were collected in January 2017, placing this particular H7N7 gene pool in South Korea earlier than closely related H7N7 isolates collected from migratory mallards in February to March of 2017. Notably, H7 isolation was more frequent in the other study (43.6% of all isolates in early 2017) than in ours (8.3%), possibly owing to differences in sample timing, host species, and sampling method28. Regardless, transmission of H7 IAVs from migratory fowl to other bird species was highlighted by the isolation of the H7N7 isolate in the captive crane, which further emphasizes the likelihood of transmission of H7 IAVs from migratory wild birds to domestic birds.

As in previous years of our study, the IAV isolation rate was variable per year. This may be a result of changes in seasonal temperatures across different years. Migration of fowl, including bean geese and white-fronted geese, is highly affected by temperatures. The timing of bird migration to wintering sites may change based on temperatures in both the breeding and wintering sites29. This would then affect the composition of migratory fowl populations at stopover sites and, in turn, affect observed IAV prevalence. Environmental temperatures also affect the persistence of IAVs in aquatic environments and may affect the maintenance of IAVs in wild birds30; changes in temperatures across the years would then alter the prevalence of IAV infection in wild birds. Thus, the timing of IAV surveillance studies such as this should take into account potential variabilities in environmental temperatures and timing of bird migration to be able to obtain a more comprehensive view of IAV prevalence in breeding, stopover, or wintering sites.

One limitation of our study is that, at the time of writing, the sequences we report are already at least five years old. Several studies have already reported more recent data (2019 to 2022) on circulating avian influenza viruses among wildfowl that visit South Korea, although most have focused on H5-, H7-, and H9-subtype influenza viruses19,31,32,33. Given the continued threat of AIVs to both animal and human health, large amounts of genetic data are important to allow us to track the movement of avian influenza virus gene pools across time and geographic locations. Ensuring that sequences of influenza viruses over large periods of time are reported will fill in potential gaps of information in tracing back the evolution of future avian-origin influenza virus pandemics. This has been highlighted by the recent coronavirus disease (COVID-19) pandemic, where, owing to very few surveillance studies in the past and very little existing genetic information on severe acute respiratory syndrome (SARS)-related viruses prior to the pandemic, scientists have difficulty in fully understanding the emergence of SARS coronavirus 2 (SARS-CoV-2). Global public health will undoubtedly benefit from all available genetic information on AIVs especially within the past couple of decades, as these would be the first resources to investigate should an avian-origin pandemic arise. It would also allow us to further understand the dynamics of the continued spread and circulation of avian influenza viruses in the wild in the hope of preparing for large outbreaks in both wild and domestic birds.

Additionally, most AIV surveillance studies in South Korea focus on H5-, H7-, and H9-subtype influenza viruses. While these subtypes pose the highest and most immediate risk to veterinary and human health, other subtypes of AIVs may contribute genetic material to future threats, and, in rare cases, cause disease and death in humans34. Thus, we believe that long-term surveillance studies on IAVs that include LPAIVs and HPAIVs and the reportable and non-reportable subtypes can provide the scientific community a wealth of genetic information that can be utilized when the need arises.

All in all, in this study, we were able to collect a total of 6758 bird fecal samples from five migratory bird stopover sites in South Korea during the winters of 2014 to 2018 (November 2014 to January 2018). Of these, 75 (1.11%) tested positive for IAVs based on egg isolation and RT-PCR. The most common HA subtypes were H1, H6, and H5, and the most common NA subtypes were N1, N3, and N2. Based on sequence analysis, all isolates, including the H5- and H7-subtypes, were LPAIVs and were most similar to isolates collected along the EAAF. As in previous years, isolation rates and predominant subtypes varied per site and per year. Although we were unable to isolate HPAIVs from the bird fecal samples despite HPAIVs outbreaks in South Korea during the study period, this surveillance study provides insight into the dynamics of IAVs carried by migratory birds into South Korea.

Methods

Sample collection and virus isolation

Through the winter seasons of 2014 to 2018 (November 2014 to January 2018), a total of 6758 fecal samples were collected from stopover sites of wild migratory birds in South Korea. Fecal samples were stored in transport medium consisting of phosphate buffered saline (PBS) and glycerol (50%) with antibiotics (1000 U/ml penicillin G and polymyxin B, 500 U/ml nystatin, 250 µg/ml gentamicin, 60 µg/ml ofloxacin, and 200 µg/ml sulfamethoxazole) (Sigma-Aldrich, St. Louis, MO). The collected samples were stored at − 80 °C until analysis or egg passage.

The samples were suspended in antibiotic-supplemented PBS and inoculated into the allantoic cavity of 10-day-old embryonated chicken eggs35. The allantoic fluid was collected; turbid fluid was assumed to be contaminated with bacteria or fungi and was discarded. All viral isolates were collected from the first egg passage. Viral presence was identified through the hemagglutination assay using 0.5% chicken red blood cells.

RT-PCR and sequence analysis

Viral gene amplification using samples that tested positive in the hemagglutination assay was performed as previously described30. Using the RNeasy Mini Kit (Qiagen, Valencia, CA), viral RNA was extracted from the hemagglutination-positive allantoic fluid. For HA and NA gene subtyping, one-step RT-PCR was carried out using the One Step RT-PCR Kit (Qiagen) with HA- or NA-specific universal primer sets designed and described in a previous study36. The RT-PCR reactions were set up according to manufacturer instructions. Each 50-μl reaction contained 1.5 μl of each primer (20 pmol/μl) and 2 μl of viral RNA (1 pg–2 µg). Reverse transcription was performed at 50 °C for 30 min, and standard PCR was performed with an initial denaturation at 94 °C for 10 min; followed by 35 cycles of 94 °C for 30 s, 56 °C for 30 s, and 72 °C for 2 min; and a final extension at 72 °C for 10 min. After purification with the QIAquick Gel Extraction Kit (Qiagen), the amplified gene segments were commercially sequenced at Cosmogenetech Co. (Seoul, South Korea). Sequencing was based on the Sanger sequencing technology using the high-throughput DNA Analyzer (Applied Biosystems 3730xl DNA Analyzer, Thermo-Fisher Scientific, Waltham, MA). Full-length coding sequences (HA: 1710 bp; NA: 1410 bp) were assembled using the Lasergene sequence analysis software package (DNASTAR, Madison, WI). A BLASTn query was performed in NCBI to identify the HA and NA subtypes of the isolates based on the obtained sequences and to identify isolates reported in GenBank that had the highest HA and NA coding sequence similarities to our isolates. The search results were supplemented with a BLAST analysis in GISAID especially for sequences that had hits with < 99% sequence identity on GenBank. Full and partial HA and NA sequences of the isolates were deposited to GenBank, and accession numbers are listed in Supplementary Table S1.

Host identification using the COI gene

Genomic DNA was extracted from wild bird fecal droppings collected in winter 2016–2017 using the QIAamp DNA Stool Mini Kit (Qiagen) according to the manufacturer's protocol. The COI gene was isolated thru PCR amplification using bird-specific COI primers: forward primer: 5′-TTCTCCAACCACAAAGACATTGGCAC-3′, and reverse primer: 5′-ACGTGGGAGATAATTCCAAATCCTG-3′37, followed by purification using a QIAquick Gel Extraction kit (Qiagen). The amplified COI genes were commercially sequenced (Cosmogenetech), and the full sequences of the COI genes were assembled using the Lasergene analysis software. The host species were determined using the web version of BOLD SYSTEMS Identification Engine (https://www.boldsystems.org/index.php/IDS_OpenIdEngine), which is widely used for species-level identification38. The “All Barcode Records on BOLD” was selected as the search database, and the hits were determined based on % similarity with the assembled COI genes of the samples.

Sequence alignment to determine sequence motifs

The H5 and H7 nucleotide sequences of the isolates were translated into amino acid sequences using the EMBOSS Transeq translation tool (https://www.ebi.ac.uk/Tools/st/emboss_transeq/). The amino acid sequences were aligned with sequences of reported HPAI H5 and H7 isolates using UGENE to determine whether the HA of the isolates had monobasic or multibasic cleavage sites.

Phylogenetic analyses of HA and NA coding sequences

The assembled full and partial coding sequences of H1, H5, H6, N1, N2, and N3 isolates were aligned using CLUSTAL V39. Maximum likelihood phylogenetic trees were prepared through the web version of PhyML (1,000 transfer bootstrap replicates; http://www.atgc-montpellier.fr/phyml/)40. The trees were rooted using the NJ Plot software41 and visualized using Interactive Tree of Live (iTOL; https://itol.embl.de/)42.

Identification of HA and NA sequence features

To assess whether the H5 sequences carried markers for mammalian infection, we submitted the nucleotide coding sequences of the H5 isolates to the “Identify Sequence Features in Segments” analysis tool in the Influenza Research Database (https://legacy.fludb.org). Amino acid features with hits are listed for the H5 isolates.

To determine whether our N1 and N2 isolates are susceptible to approved influenza NA inhibitors, we first obtained the amino acid sequences of the N1 and N2 isolates using the EMBOSS Transeq translation tool. The amino acid sequences were then submitted to the “Antiviral Resistance Risk Assessment” analysis tool of the Influenza Research Database.