Main

Microbial source tracking (MST) is currently receiving increasing attention for its use in prevention of water contamination and in accurate assessment of human health risks. MST is a tool used to identify the origin of fecal contamination in water through various molecular and biochemical methods (Simpson et al., 2002; Seurinck et al., 2005; Gourmelon et al., 2007). Target microorganisms for MST typically include conventional fecal indicator microorganisms, including Escherichia coli, enterococci and bacteriophages (Scott et al., 2002). Traditionally, these target microorganisms are isolated from samples, and various genotypic and/or phenotypic methods are applied to properly categorize the fecal origin of the microorganisms. Recently, culture-independent molecular markers, such as the 16S rRNA gene or taxon-specific genes (for example, the Enterococcus spp. esp gene and the Bacteroidales-specific 16S rRNA gene), have been identified and used for MST analysis (Fogarty and Voytek, 2005; King et al., 2007; Byappanahalli et al., 2008). However, use of this technique is limited by the fact that not all of these molecular or biochemical markers can provide the correct identification of fecal sources.

Comparison of fecal microbiota represents another method that could be used for the identification of the most likely source organism in MST analysis. Recently, 454 pyrosequencing has become a powerful tool that can be used to analyze millions of nucleic acid sequences with a low level of error (<1%) to determine microbiota composition (Margulies et al., 2005; Andersson et al., 2008). However, to date, few studies have characterized the fecal microbiota in human and animal feces (Andersson et al., 2008; Turnbaugh et al., 2009), and this method has not been used for MST. Thus, the objectives of this study were to characterize the compositions of the microbial communities in human and animal feces to identify specific microorganisms in each host for potential use in MST analysis.

In total, 38 fecal samples, including 14 human, 6 chicken, 6 cow, 6 pig and 6 wild geese, were collected for this study. Human fecal samples were obtained from healthy adults with normal body mass indices. Livestock fecal samples were collected from various farms located in Kyunggi Province, South Korea as described in a previous study (Lee et al., 2009). Fecal samples from wild geese were collected from their habitats in the Cheonra area of the southeastern part of South Korea.

Total DNA was extracted from feces using previously described methods (Turnbaugh et al., 2009). The 16S rRNA gene was then amplified from isolated DNA using a previously reported primer set (Fierer et al., 2008). The 27F and 338R primers were used as forward and reverse primers and contained the 454 Life Sciences (Branford, CT, USA) primers B and A, respectively. A unique 5-bp error-correcting bar code was used to tag each PCR product. Amplified PCR products were purified using the QIAquick PCR Purification Kit (Qiagen, Valencia, CA, USA). The resulting samples were sent to the National Instrumentation Center for Environmental Management at Seoul National University for pyrosequencing using a 454 Life Sciences Genome Sequencer FLX machine (Roche, Basel, Switzerland).

Sequences were processed and analyzed as previously described (Fierer et al., 2008; Ley et al., 2008). For each of the 38 fecal samples, 3000 reads of the V2 region were randomly selected. The nucleic acid sequences of the 16S rRNA gene were trimmed using the Greengenes database with a quality score of <20. A multiple sequence alignment was generated using NAST (parameters: minimum alignment length, 200 bp; sequence identity, 70%) (DeSantis et al., 2006a). The taxonomy of each phylotype was classified based on the Greengenes database (DeSantis et al., 2006b) using the Ribosomal Database Project taxonomy. Pre-aligned sequences were added to a neighbor-joining tree available from the Greengenes core set database using ARB project (http://www.arb-home.de). Then, the UniFrac program was used to conduct principal coordinate analysis (Lozupone and Knight, 2005). Finally, a network-based analysis using Cytoscape 2.6.3 was performed to identify core and specific operational taxonomic units in each host (Shannon et al., 2003).

The V2 region of the 16S rRNA gene was pyrosequenced with an average of 7252±2107 reads per sample, and 3000 reads were randomly selected for each sample and subjected to analysis as described in a previous study (Turnbaugh et al., 2009). The vast majority (>99%) of classified sequences belonged to four phyla, Firmicutes, Bacteroidetes, Actinobacteria and Proteobacteria (Supplementary Figure S1). When the phylogenetic characteristics of the gut microbiota in each host were compared based on the average unweighted UniFrac distance, the human gut microbiota was found to be significantly different from that of all other animals. The microbiota in chicken fecal matter exhibited the greatest difference from human fecal samples (***P<10−6), and the microbiota in pig samples exhibited the least difference from human fecal samples (*P<0.01) (Supplementary Figure S3). The Shannon–Weaver indices were highest in cows (4.53), followed by humans (4.14), chickens (4.02), pigs (3.90) and geese (3.67) (Supplementary Table S1, Supplementary Figure S2).

Principal coordinate analysis revealed clustering based on the origin of the fecal samples (Figure 1a), with the exception of human samples. Human samples were found to be scattered throughout the plot. When principal coordinate analysis was analyzed for each of the phyla represented, members of the Firmicutes phylum were found to be primarily responsible for the clustering observed in the principal coordinate analysis (Supplementary Figure S4), indicating that bacteria belonging to this phylum may represent useful microbiological and molecular targets for MST.

Figure 1
figure 1

Clustering of fecal bacterial communities by principal coordinate analysis (PCoA) using UniFrac (• chicken, ▴ cow, ▪ goose, ♦ pig and – human) (a) and results from network-based analysis of microbiota (b and c) (N=38). Nodes represent operational taxonomic units (OTUs), and each line indicates that an OTU was identified in the same source.

Our results indicate that the general compositions of the gut microbiota in humans and other vertebrates were similar overall (Supplementary Table S2). However, specific differences, such as alterations in microbial diversity and the presence of specific microorganisms, were identified in each of the gut microbiota samples analyzed. Several previous studies have suggested that high levels of variation are present in the gut microbiota of different populations (Andersson et al., 2008; Ley et al., 2008). In particular, the gut microbiota composition can be affected by various factors, including diet, age and obesity (Ley et al., 2006). For these reasons, body mass index and age in human subjects were carefully controlled in this study.

Figures 1b and c depict a network-based analysis of fecal microbiota obtained using the Cytoscape program (Ley et al., 2008). This analysis revealed that microbial communities from the same fecal origin were highly similar (Figure 1b). In addition to a common core microbiota, several microorganisms were found specifically in samples of a particular fecal origin (Figure 1c). For example, Bifidobacterium spp. were identified only in human samples (21% of analyzed samples). Therefore, Bifidobacterium could be considered to be a specific fecal microorganism representing fecal contamination from humans. These results were consistent with previous studies (Long et al., 2005; Dorai-Raj et al., 2009). In addition, Yania spp. were found in 33% of chicken fecal samples and were specifically identified only in chickens (Table 1). These results suggest that Yania could be useful as a specific indicator bacterium for fecal contamination from chickens. Other operational taxonomic units specific to fecal origin included Agromyces spp. (goose) and Marinicola spp. (pig). The presence of these host-specific microorganisms could be because of various biological and ecological characteristics, such as diet and environmental exposure (Turnbaugh et al., 2009), and may be useful as host-specific biomarkers for MST. However, future studies assessing the use of these markers for MST should characterize the fate and transport of these fecal microorganisms in a water-based environment, as this study analyzed only fecal samples. In conclusion, we have characterized and identified specific microbiota in human and animal fecal samples. Further studies using a larger sample size representative of different geographic regions should be performed in the future to better characterize the distributions of differences in microbiota across a variety of species.

Table 1 Specific genera identified in human and animal samples