Diversity of macaque microbiota compared to the human counterparts

Studies on the microbial communities in non-human primate hosts provide unique insights in both evolution and function of microbes related to human health and diseases. Using 16S rRNA gene amplicon profiling, we examined the oral, anal and vaginal microbiota in a group of non-captive rhesus macaques (N = 116) and compared the compositions with the healthy communities from Human Microbiome Project. The macaque microbiota was dominated by Bacteroidetes, Firmicutes and Proteobacteria; however, there were marked differences in phylotypes enriched across body sites indicative of strong niche specialization. Compared to human gut microbiota where Bacteroides predominately enriched, the surveyed macaque anal community exhibited increased abundance of Prevotella. In contrast to the conserved human vaginal microbiota extremely dominated by Lactobacillus, the macaque vaginal microbial composition was highly diverse while lactobacilli were rare. A constant decrease of the vaginal microbiota diversity was observed among macaque samples from juvenile, adult without tubectomy, and adult with tubectomy, with the most notable distinction being the enrichment of Halomonas in juvenile and Saccharofermentans in contracepted adults. Both macaque and human oral microbiota were colonized with three most common oral bacterial genera: Streptococcus, Haemophilus and Veillonella, and shared relatively conserved communities to each other. A number of bacteria related to human pathogens were consistently detected in macaques. The findings delineate the range of structure and diversity of microbial communities in a wild macaque population, and enable the application of macaque as an animal model for future characterization of microbes in transmission, genomics and function.


Results
Diversity of microbial composition in healthy macaques and humans. Oral, perianal and vaginal (female only) swabs were collected from 117 rhesus macaques (84 females and 33 males) (Tables 1 and S1). Samples from one female animal in pregnancy was excluded from this study. All surveyed animals appeared healthy according to the body condition and temperature measured by veterinarians. We performed amplicon sequencing on the V4 region of the 16S rRNA gene, producing 13,583,251 high-quality sequences from a total of 318 samples using an Illumina MiSeq sequencing (mean 42,715 ± 15,848 reads per sample). In order to describe the surveyed macaque-associated microbial communities in the context of the human counterparts, we downloaded 2,689,425 16S rRNA gene V3-V5 sequences from the Human Microbiome Project (HMP) (SRA accession PRJNA48489) (Table S2). These HMP 16S sequences were generated by 454 pyrosequencing from 420 samples (mean 5,110 ± 2,652 reads per sample) consisting of paired saliva, stool and vaginal (female only) sites from 173 healthy human individuals (74 females and 99 males) 3,26 . All samples with high-quality reads less than 2,000 were removed from the further analysis (Table S1).
Using a threshold of 97% sequence similarity, we identified 755 and 1,644 operational taxonomic units (OTUs) from the human and macaque cohorts, respectively. The samples showed variable species richness when comparing the three body sites (rarefiled to 2,000 reads per samples), but the variability was independent of host species indicating there was unlikely obvious batch bias associated with these two datasets (Fig. S1). Microbial community composition was represented using unweighted UniFrac distances as this metric resulted in more distinct clustering of samples by body site and host species in ordination analysis when compared to the weighted UniFrac distances (entire variance, 94.886 vs 83.761) (Fig. S2). A principal component ordination indicated that the sample clustering was driven more by the presence/absence of bacterial OTUs (unweighted) rather than the proportion of microbial community members (weighted). Overall, the microbial community diversities were compositionally distinct with respect to body sites and host species (Fig. 1A). For example, human vaginal samples had the lowest OTU richness and Shannon diversity (at the genus level) among habitats due to a dominance of Lactobacillus (mean abundance of 78.65 ± 3.36%) which was rare in the macaque vagina (0.35 ± 0.07%). Macaque anal microbiota was significantly more diverse when compared to the human stool community (mean Shannon index of 2.42 vs 1.78, p < 0.001). In contrast, the surveyed macaques shown less diversity of the oral microbial community to the humans (1.85 vs 2.65, p < 0.001). From the principal component ordination, we observed that oral microbiota showed the least within-host and between-host variability when compared to anal or vaginal communities (green boxplots in Fig. 1B), implying relatively higher similarity of oral microbiota composition between macaque and human hosts. Using the adonis function in R's package 'vegan' , a permutational multivariate analysis of variance (PERMANOVA) based on unweighted UniFrac distances indicated that approximately 46% of variation in microbial community composition could be attributed to body site (Df = 2, R 2 = 0.2627, pseudo F = 170.8, p < 0.001), host species (Df = 1, R 2 = 0.0864, pseudo F = 112.3, p < 0.001) and the combination of site and host (Df = 2, R 2 = 0.1071, pseudo F = 69.6, p < 0.001), which supported the observations from the principal component ordination that microbial variation was mainly associated with both site and host species (Fig. 1C). Gender was not significantly associated with the composition of oral and anal microbial communities (R 2 = 0.0022, p = 0.167) (Fig. S3).  Macaques  PRJNA411767  V4  116  314  83  32  83  33  83   Humans  PRJNA48489  V3-V5  173  399  70  88  72  96  73   Total  290  713  153  120  155 129 156 Site-by-site genus-level comparison of microbial communities between hosts. At the phylum level, composition of microbial communities associated with the surveyed macaques largely resembled those in the corresponding body sites of their human counterparts. The macaque microbiota was predominantly composed of Firmicutes, Bacteroidetes and Proteobacteria which collectively accounted for 93.5% mean abundance in oral cavity, 97.6% in anus and 78.4% in vagina ( Table 2, Fig. 2A,B). Particularly, Firmicutes accounted for 43.6% of overall 16S sequence reads and were detected in every surveyed macaque and human samples across the body sites (taking into account OTUs with >1% relative abundance in at least one sample). Proteobacteria and Bacteroidetes were commonly present in macaque samples (>98% and >94% prevalence at >1% relative abundance, respectively) but they were less prevalent in human vagina (44% and 37% prevalence at >1% relative abundance, respectively). Besides, Fusobacteria and Actinobacteria were relatively common in the oral cavity and vagina but rare in anus samples. Spirochaetes seems to be unique to macaques and were found in 60% of vagina samples (at >1% relative abundance). Members of other phyla, such as Tenericutes, Verrucomicrobia and Saccharibacteria (TM7) were less prevalent and detected at relatively lower abundances (combined mean abundance <0.5%) ( Fig. 2A). The majority of bacteria belong to three phyla (Firmicutes, Bacteroidetes and Proteobacteria); however, the microbial components at the genus level were highly diverse, composed of distinct taxa shaping the differences of communities by habitats (Table S3, Fig. 2C). Using OTUs summarized at the genus and higher taxonomic levels (QIIME L6), we defined 59 "core" bacterial taxa comprising >1% mean abundance in any of respective habitats (e.g., oral, perianal or vaginal) surveyed in this study (Fig. 3A, Table S3). Each community was predominantly represented by one or a few of bacterial taxa distinguishing from other habitats. For example, Bacteroides was detected in all human stool samples (100% prevalence of >1% relative abundance) with a mean relative abundance of 51.1 ± 1.8%, while Helicobacter was the most dominant anal bacteria in the surveyed macaques, accounting for 30.9 ± 2.9% mean relative abundance, and was detectable in 97% of animals (>1% relative abundance) (Figs 3A and S4). The oral cavity of both macaques and humans mainly harboured Streptococcus and Haemophilus, but the composition of less dominant taxa was different: Gemella was more common in macaques (mean abundances of 6.2% vs 1.0%, p < 0.001), and Prevotella (1.3% vs 13.1%, p < 0.001) in humans. Some taxa could be found in multiple habitats (Fig. 3B, Table S3). For example, Prevotella was present in all surveyed communities, although the relative abundance within each habitat was variable. There were four "core" bacterial taxa shared between macaque oral and vaginal sites (Porphyromonas, Streptococcus, Fusobacterium and Haemophilus), three between anal and vaginal sites (Ruminococcaceae, Selenomonas and Dialister), and one between oral and anal sites (Prevotellamassilia). In contrast, the human microbiota from different communities were less overlapped (Fig. 3B).

Anus.
At the phylum level, human stool communities were dominated by Bacteroidetes (69% mean abundance) and Firmicutes (27%) whereas macaque anal samples had approximately equal relative abundances of Bacteroidetes (27%), Firmicutes (35%) and Proteobacteria (36%) ( Table 2, Fig. 2). A large proportion of genera belonging to the phylum Firmicutes, such as Dialister, Eubacterium, Faecalibacterium and Phascolarctobacterium, were detected in both macaques and humans (Tables S3 and S5, Figs 3A, S4 and S5). In contrast, members of Bacteroidetes were differentially enriched between two hosts. We detected higher relative abundances of Prevotella and Prevotellamassilia in macaques in contrast to the human counterparts where Bacteroides, Alistipes and Parabacteroides were largely enriched. One striking feature of the macaque anal microbiota was its high predominance of proteobacterial OTUs. Helicobacter accounted for 87% of proteobacterial sequences in macaque anal samples and was found in 97% samples (at > 1% relative abundance), but were extremely rare in healthy human stool samples. Using LEfSe, we classified 19 "core" bacterial genera that discriminated macaque and human anal communities, with Helicobacter dominating in macaques (31% mean relative abundance) and Bacteroides (51%) in humans (Fig. 3C). These two bacteria were negatively correlated with other "core" taxa in the relevant host habitats (Fig. S6).
Vagina. The human vaginal microbiota was highly enriched by Lactobacillus (79% mean abundance), a striking contrast to the macaque vagina which was not obviously dominated by any single genus (Figs 2, 3, S4 and S5). The macaque vaginal microbiota was extremely more diverse than their human counterparts (p < 0.001) (Fig. 1A) and was represented by at least 24 discriminative taxa belonging to 6 phyla (

OTU-level segregation of pathogenic microbes in macaques.
Several bacteria detected in macaque samples are potentially designated as known human pathogens, the most notable being Fusobacterium in oral cavity, Helicobacter in anus and Gardnerella in vagina. In order to determine whether the macaque OTUs represented opportunistic "pathogens" define by the Pathosystems Resource Integration Center (PATRIC), we aligned their 16S as well as sequences from the human samples to the species classification. In total, 40 of 461 PATRIC pathogens were detected at >1% prevalence of >0.1% relative abundance in the surveyed human and macaque samples (Table S7). Class A-C pathogens defined by the National Institute of Allergy and Infectious Diseases (NIAID) such as Bacillus anthracis, Clostridium botulinum, Francisella tularensis and Yersinia pestis recorded less than 0.1% relative abundance in any surveyed macaque and human individuals. Most of detectable pathogens were niche-specific and broadly distributed in healthy human subjects, and some taxa could be found in both hosts with low relative abundance. For example, Bacteroides vulgatus was detected in 4 macaque anal samples while this taxon was nearly ubiquitous in humans (prevalence of 94%, 21% and 15% in human stool, oral and vaginal samples at >0.1% relative abundance, respectively). Helicobacter was the most dominant genus in the surveyed macaque anal community, but these OTUs showed <97% similar to H. pylori based on the alignment of the 16S V4 region and were mostly identical to H. fennelliae (NCBI accession number of NG_042880) and H. macacae (NG_042884). In vaginal samples, 22% and 28% of macaque and human subjects (at >0.1% relative  Capnocytophaga gingivalis, Porphyromonas gingivalis, Rothia mucilaginosa, Tannerella forsythia and Treponema denticola. Streptococcus pseudopneumoniae were common in macaque oral cavities (>82% prevalence with >0.1% relative abundance) but nearly absent in healthy human individuals. It is important to note that these inferences are based on the V4 region of the 16S gene, which has limited resolution in determining taxonomy at species level. Ideally, presence of these putative pathogens in macaque samples should be verified with full length 16S rRNA gene sequences.
Functional profile of the macaque microbiota. Since the microbial community compositions of the surveyed macaques and humans were most heavily influenced by body site, we hypothesized that physiological similarities between the same body sites between macaque and human may enrich for functionally similar microbial communities. Hence, we used Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt) to predict functional pathways based on the composition of the microbial communities and produced Kyoto Encyclopaedia of Genes and Genomes (KEGG) Orthology (KO) abundance profiles for each sample. These KO abundance values were then summarized into KO pathways based on their involvement in metabolic pathways as described in the KEGG database. A PERMANOVA performed on variance stabilized-transformed KO profiles indicated that body site had the largest influence on predicted KO abundance followed by host species in descending size of effect (p < 0.001), supporting our hypothesis that body sites selected for functionally similar communities irrespective of host species (albeit in humans and macaques which share many physiological similarities). The hypothesis was also supported by KO profiles between samples showing less variability compared to community composition (Fig. 4). Similar to community composition, KO profiles were not associated with gender in oral cavity and anal sites when vagina samples were excluded from the analysis (p = 0.519). Results of the summarized KO profiles were supported by sparse partial least squares discriminant analysis (sPLSDA) using the first three ordination components that show clustering of samples mainly by body sites (Fig. S7). Next, we attempted to identify the metabolic functions that discriminated between communities of the body sites. When comparing corresponding body sites between macaque and human hosts, there was consistently increased metabolism associated with plant fiber degradation (e.g., butanoate, glycerophospholipid, propanoate and pyruvate) in macaque relative to human, which was supported by the more relatively abundant chloroplast sequences observed in macaque 16S sequences (Fig. S8). This observation suggested that the surveyed macaques primarily consumed fiber-rich plant material in contrast to humans with more diversified diets. Other pathways related to amino acid and lipid biosynthesis/metabolism and energy production such as glycolysis, pentose phosphate pathway and the citrate cycle, however, showed small and varying associations with hosts, suggesting macaques and humans may share highly similar microbial metabolism in the same body sites (Tables S8-S10). We suspect the limited resolution of partial 16S rRNA gene in discriminating bacterial phylotypes, as well as a possible lack of macaque-specific PICRUSt reference microbial genomes may have restricted the utility of this analysis, as relatively higher scores of the weighted Nearest Sequenced Taxon Index (NSTI) were observed in macaque datasets when compared to the human datasets (oral: mean of 0.0484 vs 0.0408, p = 0.7783; anal: 0.0864 vs 0.0799, p = 0.0172; vaginal: 0.0834 vs 0.0240, p < 0.001) (Table S11). There was no significant difference of the composition of oral microbiota between juvenile and adult macaques (data not shown). However, a dramatically decreased anal microbiota diversity was observed in juvenile male macaques compared to their adult counterparts (Shannon index, mean of 1.65 vs 2.79, p = 0.0015) (Fig. 6A), which inversely was not detected between female age groups (2.53 vs 2.48, p = 0.8631). Consistent with the distinct clustering of anal microbial communities between juvenile and adult male macaques (using weighted UniFrac distance, Df = 1, R 2 = 0.1838, pseudo F = 7.0, p = 0.003), the young animals were predominantly colonized with Helicobacter (mean relative abundance of 59% vs 12%) though this bacterium was extremely common in both age groups (93% and 88% prevalence at >1% relative abundance) (Tables S5 and S12).
Similarly, significant differences of the vaginal microbiota diversity (3.21 vs 2.77, p = 0.0028) and community composition (Df = 1, R 2 = 0.0907, pseudo F = 8.1, p = 0.001) between juvenile and adult (female) macaques were observed (Fig. 6B), with 15 and 12 bacteria taxa (>1% mean relative abundance in any of age groups) discriminating juvenile and adult animals, respectively (Table S13). Since these adult animals were part of a contraceptive operation project on the macaques, we hypothesize that the altered estrogen hormone may affect the vaginal microbial composition. 79% (58/73) of adult female macaques had already undergone tubectomy before they were re-captured for the veterinary inspection and sample collection (Fig. 5). Interestingly, PERMANOVA test using a weighted UniFrac distance well supported the separation of vaginal microbiota communities between adult macaques with and without tubectomy (Df = 1, R 2 = 0.0441, pseudo F = 3.3, p = 0.005) (Fig. 6C). We also found a relatively decreased vaginal microbiota diversity in the contracepted animals (2.75 vs 2.81, p = 0.8112), although the statistical test was not significant. When vaginal microbial communities from juvenile macaques, adult without tubectomy, and adult with tubectomy were compared, a number of bacteria taxa showed constant increase (N = 7) or decrease (N = 5) of the relative abundances across groups (>1% mean relative abundance in any of three categorized groups), with the most notable distinctions being the enrichment of Saccharofermentans (0.01%, 4.62%, and 8.68%, p < 0.001) in contracepted adults, and Halomonas (7.33%, 3.05% and 1.43%, p < 0.001) in juvenile females (Tables 3, S13 and S14).

Discussion
In this study, we surveyed the microbiota of paired oral, anal and vaginal swab samples of 116 non-captive healthy macaques and compared their compositions to the communities of the corresponding body sites of their human counterparts. Overall, the macaque-associated microbial communities were more diverse than humans which was consistent with other surveys of primate rectal swabs 25,27,28 , skin 17 and vaginal microbial communities 22 . The observations of higher diverse of microbiota in macaques could be attributed to a mix of evolutionary histories and lifestyles in which humans have altered their exposure to microorganisms through means such as diet, medicine, hygiene and lifestyle. For example, human guts show a predominance of Bacteroides whereas the surveyed macaque anal samples exhibited increased relative abundances of Prevotella. The dominance of Bacteroides in humans is often associated with a diet high in protein and fat 29 , whereas Prevotella is associated with carbohydrate and fibre 30 . Although we did not have information regarding the specific diet of the surveyed macaques, the dominance of Prevotella is in line with a plant-based dietary preference in macaques 31 . This observation was also supported by the increased proportion of fibrolytic genes detected in the surveyed macaques. It was previously inferred that doucs and howler monkeys recruited Bacteroides and Prevotella into their gut microbiota as a result of reduced dietary plant fibre associated with captivity 25 , suggesting that dietary perturbation could be one of major drivers altering the stool microbial composition. Another notable observation in the macaque anal samples was the dominance of proteobacterial taxa, namely Helicobacter. Using serological assays, Blaser's group found that Rhesus macaques have widespread exposure to H. pylori 32 . Surveys of intestinal microbial communities in healthy captive macaques also indicated the presence of Helicobacteraceae OTUs 21,33 , which together with our data suggest that Helicobacter are a feature of macaque gut microbial communities. Interestingly, a higher abundance of Helicobacter was observed in juvenile male macaque anal samples, in line with the colonization in human beings that H. pylori is predominantly acquired during childhood 34 . H. pylori in populations in developed countries is disappearing due in large part to changing hygienic standards and antibiotic treatments 35 . Although Helicobacter colonization is often linked to diseases in humans, such as peptic ulcer disease, gastric cancer, and mucosa-associated lymphoid tissue lymphoma 36 , reverse associations of H. pylori with early-onset asthma 37 and gastrointestinal infections 38 demonstrate also beneficial protection of this late-in-life pathogen in early life. The amplified 16S rRNA Helicobacter OTUs in macaque anal samples in the current work were most identical to H. fennelliae and H. macacae; both of them are categorized as enterohepatic helicobacters and have been linked to diarrhoea, bacteraemia and colitis in macaques 39,40 , which may prove useful in understanding the aetiology and pathogenesis of gastrointestinal diseases in humans. As an example of how the gut microbiota composition differs between humans and macaques, however, the dominance of Helicobacter in macaque guts emphasises the need to be careful when interpreting findings from macaque animal models. For example, Helicobacter pylori are suspected to dampen the effects of a drug used for treating Parkinson's disease in humans 41 . Although of different species, there is a possibility that the Helicobacter in macaques could also influence the outcome of animal trials testing efficacy of medical drugs.
Similar to the anal microbiota, vaginal communities in the surveyed macaques were highly diverse compared to their human counterparts. The finding was consistent with surveys of vaginal microbial communities in a number of non-human mammals, in which the dominance of Lactobacillus in the human vagina appears to be an exception rather than the norm 22,24,42 . A proposed mechanism behind the dominance of Lactobacillus in the human vagina is through increased glycogen availability, which functions as an energy source for lactobacilli, thereby promoting a low pH environment through lactobacilli-driven lactic acid production 43,44 . In contrast, the macaque lower genital tract is characterized by low levels of glycogen and lactic acid 45 , and previous study of short chain fatty acid content in the vagina of rhesus macaques reported that the main acid constituent is acetate 46 . Based on this finding, we infer that vaginal acidity in the surveyed macaques could be maintained by a variety of bacteria such as Fastidiosipila, Murdochiella, Peptoniphilus and Dialister by producing acetic acid to lower pH environment and protect from invasive pathogens. The presence of fermentative bacteria in the macaque vagina could be conserved in NHPs but not humans as the volatile short chain fatty acids produced can be used as scents for communication, social interaction and other olfactory-driven behaviour 47 , which arguably is less important in humans. Alternatively, a shift towards high starch diets of human beings associated with modernisation is probably responsible for lactobacilli dominance through increased vaginal glycogen content, resulting in low pH conditions in their reproductive physiology 24 . The exact reason behind the low microbial diversity in human vaginas, however, are still not well defined. We found reverse association of macaque vaginal microbiota diversity with increasing age and tubectomy contraception. Similarly, studies in pregnant women also demonstrated significantly decreased vaginal microbiota richness and diversity when that of nonpregnant women was compared, supporting potential influence of hormonal changes on the vaginal microbiome 48,49 .
Interestingly, the macaques harboured a group of anaerobic bacteria associated with bacterial vaginosis in woman, such as Gardnerella and Peptostreptococcaceae 50,51 . The role of Prevotella in the vagina is less understood; however, it has been reported that Prevotella can assist the growth of Gardnerella vaginalis and Peptostreptococcus anaerobius by producing key nutrients such as ammonia and amino acids for these pathogens 52,53 . Another   notable observation was the dispersal of periodontal bacteria, such as Fusobacterium and Porphyromonas in macaque vaginal microbial community. Several pathogens within Fusobacterium and Porphyromonas genera have been associated with a variety of oral diseases even cancers in humans 54 . Although a higher diversity of vaginal microbiota may enhance the buffer to protect host, macaques may also suffer bacterial vaginosis or other diseases if these opportunistic "pathogens" dominate the community. In contrast, the lactobacilli dominance, in turn, the low diversity of vaginal microbial community appears to be unique to humans and is hypothesized to benefit women by producing lactic acid to reduce pathogenic invasion 55 . Continuing investigation on non-human vaginal microbial communities may improve our understanding of the "uniqueness" of human vaginal microbiota on both the physiological mechanisms and the broad evolutionary processes. Both macaque oral swabs and human saliva samples shared the same three most predominant taxa: Streptococcus, Haemophilus and Veillonella. The lack of comparable distinction in oral microbial communities between macaque and human relative to the anal or vaginal site could indicate an evolutionary origin that oral microbiota has experienced similar selective pressures since the split of macaques and humans from their most recent common ancestor during million years of evolution 56 . Since the oral cavity functions as the first step in food digestion and is constantly exposed to the environmental selection, the microbiota at this site could be more influenced by generic saccharolytic and/or proteolytic microorganisms that respond to perturbations such as diet patterns, oral hygiene or foreign invasion. For example, the surveyed animals in this study had the chance to obtain food from household waste or tourists, resulting in accompanied oral microbiota responses to diet shift as that of humans. In contrast, host physiological and biological factors, such as body temperature, immunology and salivary content, likely do not exert significant selective forces on the oral microbiota for colonization and diversification compared to flora of the gastrointestinal or genital tracts. This hypothesis could be tested by comparing the oral microbiota in a larger cohort of animal species to determine whether oral microbial communities share more convergent lineages than other body sites. Nevertheless, interplays of oral microbiota with host immunodeficiency, lifestyle and other pathogens may also have detrimental consequences for the homeostasis of community, resulting in differential enrichment of certain bacteria between hosts. For example, there was a marked absence of actinobacterial lineages such as Actinomyces in the macaque oral swabs, which based on human data 57,58 would suggest that they suffer from periodontal disease.
These observations collectively indicate that rhesus macaques can be used as an animal model to study microbiota-associated human diseases; however, experimental outcomes will have to be interpreted with caution as the resident macaque microbiota is dramatically different as their human counterparts. With the growing appreciation for viral-bacterial interactions in the context of human health 59 , precautions should also apply to the study of simian/human immunodeficiency virus in rhesus macaques as they are commonly used as the animal model for research into acquired immune deficiency syndrome 60 .
One of the shortcomings of this study is the lack of metagenomic data to predict the comprehensive metabolisms and functions, although our observation based on the limited reference supports the findings from human and other macaque-associated microbial community surveys. Another challenge in comparing the microbiota between macaques and humans in this study is the data sources with different processing methods, which may present batch effects. In order to minimize study bias, we used a closed-reference approach to cluster 16S reads from different datasets separately, and applied a maximum likelihood phylogenetic algorithm to assign bacteria taxonomy. The rarefaction curve also indicated that the variability was less relevant to batch bias. Indeed, the macaque microbial communities characterized in this study are largely concordant with previous reports from a number of macaque animal groups. The third caveat of the present study is the use of slight different body locations, for example, perianal swabs versus stool samples, to compare anal microbial communities between macaque and human. However, micro-niches from the same body site usually harbour stable microbial communities composed of predominant taxa consistent in healthy individuals 21 . Nevertheless, the present study is to our knowledge the first and largest survey of microbiota on non-captive macaque with samples from paired body sites from the same individuals. Our findings elucidate the diversity and structure of microbiota in an apparently healthy macaque population and defined the "core" indigenous bacterial taxa relevant to body sites and hosts species, which forms a basis for further investigation into the host-microorganism co-evolution and the causality of bacterial pathogens in diseases. The presence of microorganisms related to human pathogens in primate animals suggest that they are a native feature; distinction of the predominant microbiota between macaque and human highlights the need to interpret outcomes of animal model studies with caution.

Methods
Ethics approval and consent to participate. The programme has been approved by the AFCD and pre- After swab collection, animals were vaccinated with parasiticide (Ivomec), antibiotics (Amoxycillin) and a nonsteroidal anti-inflammatory drug (Rimadyl) before release. The programme has been approved by the AFCD and presented to the Animal Welfare Advisory Group (AWAG).
Swab samples were transported to the laboratory in an ice box cooler within 2 hours and stored at −20 °C immediately. For each sample, 200 µl STM were used for DNA extraction using the Qiagen DNA Mini Kit (Qiagen, USA). Purified DNA was eluted in 200 μl elution buffer (pH 8.0) and PCR amplified using primers 515F and 806R, resulting in an approximately 250 bp long amplicon spanning the V4 variable region of the 16S rRNA gene 61 . A pair of dual 12-bp barcodes was introduced to the PCR amplicons from each sample through the forward and reverse primers according to the protocol from the Earth Microbiome Project 62 . Successful amplicons were pooled at approximately equal molar DNA concentrations and sequenced on an Illumina MiSeq (Illumina Inc., USA) at the Weill Cornell Medicine Genomics Resources Core Facility, New York, USA, using paired-end reads. The DNA extraction was processed in a BioSafety Cabinet in a laboratory physically separated from where the PCR amplification was performed to minimize contamination. Negative controls, including STM, DNA extraction, and PCR master mixture were set. All sequence data generated from this survey were deposited in the Sequence Read Archive with accession number PRJNA411767.

Sequence data processing and community composition statistical analyses. Demultiplexed
raw sequences were quality-filtered and clustered into operational taxonomic units (OTUs) using an established bioinformatics pipeline which includes QIIME (MacQIIME 1.9.1-20150604) 63 , PPLACER v1.1.alpha17 64 , USEARCH v9.2.64 65 and in-house developed scripts 66 . Sequences were clustered into OTUs based on a threshold of 97% nucleotide similarity using UPARSE 67 and then assigned taxonomic identities using RDP classifier 68 with a bacterial 16S reference database optimized from SILVA (release 128) 69 , HOMD 70 and NCBI 16S Microbial (ftp:// ftp.ncbi.nlm.nih.gov/blast/db/16SMicrobial.tar.gz). The resulting OTU tables were summarized at the phylum (L2) and genus (L6) levels for alpha and beta diversity analyses using QIIME and R v3.4.0 71 . In order to minimize potential sequence bias due to different lengths and depths, we used a closed-reference approach in QIIME to pick up OTUs for macaque and human bacterial reads separately. Two reference databases dereplicate from V3-V5 reads and V4 reads, including all novel OTUs at 97% dissimilarity, were pre-assigned using UPARSE. These two OTU tables were then merged and summarized at species level and/or above. Singleton reads were removed from the OTU tables.
We used a UniFrac algorithm in 'GUniFrac' R package 72 to calculate unweighted and weighted (alpha = 0.5) pairwise Kantorovich-Rubinstein (KR) distances between samples. The OTU phylogenetic tree was constructed using PPLACER by placing the 16S sequences on a bacterial 16S reference tree to maximize phylogenetic likelihood inferred from a 16S reference alignment. Differences in community composition were assessed using permutational multivariate analysis of variance (PERMANOVA) in the 'vegan' R package and Jensen-Shannon divergence 73 . Principal component ordination analysis (PCA) was used to visualise associations between community composition and experimental factors. In addition, we used a linear discriminant analysis (LDA) for effect size (LEfSe) to determine characteristic microbiota between host species and body sites 74 . Higher LDA scores reflect significantly higher abundances of certain bacteria within the groups of samples. Spearman's rank correlation was used to evaluate co-occurrence of taxa based on relative abundances. Comparisons of characteristics OTUs between sample groups were performed using non-parametric Mann-Whitney Wilcoxon rank-sum test or chi-square test. A two-sided p value of < 0.05 was considered statistically significant.
Functional prediction based on community composition. Functional profiles of the macaque and human microbial communities were predicted using PICRUSt v1.1.1 75 based on community composition and represented as KEGG Orthology (KO) counts 76 . These counts were summarized into KO hierarchies and then normalized for library size and variance stabilization-transformed using 'DESeq. 2' R package 77 . Differences in transformed counts between sample groups were analysed using STAMP 78 . Metabolic pathways that discriminated between host species and/or body sites were identified using sparse partial least squares discriminant analysis (sPLSDA) implemented in the 'mixOmics' R package 79 .

Data Availability
All 16S sequence data generated from this survey were deposited in the Sequence Read Archive with accession number PRJNA411767. The human microbiota 16S data were downloaded from the NIH Human Microbiome Project (PRJNA48489) (https://www.hmpdacc.org/hmp/HMR16S/).