Metagenomic profiles of the early life microbiome of Indonesian inpatient neonates and their influence on clinical characteristics

Determining the initial normal neonatal gut microbiome is challenging. The debate regarding the sterile fetal environment is still ongoing. Therefore, studying and comparing normal and dysbiotic microbiomes requires the elucidation of both the fetal and infant microbiomes. Factors influencing the normal microbiome also include regional and genetic factors specific to different countries. Determining the normal microbiome population in our center and their association with the clinical conditions of infants is helpful as a tool for both the prevention and treatment of related diseases during neonatal care. Here, we employed metagenomic sequencing to characterize meconium and the subsequent early-life gut microbiome of preterm neonates in Jakarta, Indonesia. Microbiome diversity and complexity was higher in the meconium and on day 4 than on day 7. At the genus level, the most abundant genus overall was unidentified Enterobacteriaceae, with meconium samples dominated by Ureaplasma, day 4 fecal samples dominated by Staphylococcus, and day 7 samples dominated by Clostridiales, while at the phylum level the most abundant was Proteobacteria and Firmicutes. Perinatal factors of PROM and mother’s diet influenced the meconium microbiome, while day 4 and day 7 microbiome was associated with bacteremia and early administration of antibiotics. One of our sample sets was derived from triplets, and they had varying diversity despite being triplets. These data are valuable for understanding the formation of a healthy microbiome specific to neonates and devising a strategy to improve both the gut health and related clinical outcomes of the neonate.


Materials and methods
Study design, participants, and fecal samples. This prospective cohort observational study included eight inpatient neonates born from February to June 2020 at Cipto Mangunkusumo Hospital (CMH), Jakarta (Indonesia). Fecal samples, i.e., the meconium and the day 4 and day 7 fecal samples, were collected by CMHtrained laboratory personnel from all neonates.
We excluded neonates with signs of hemolytic anemia caused by ABO incompatibility (based on blood grouping, Coomb's test, and decreased hemoglobin levels), cephalhematoma, or major congenital malformations assessed through clinical and laboratory examinations. Subgroups were then formed for further analysis between variables based on the number of patients with specific variables.
Relevant clinical data recorded for each neonate, such as delivery mode, feeding pattern, gestational age, necrotizing enterocolitis (NEC) status (based on radiologic and clinical findings), and sepsis status (proven through positive blood culture), are detailed in Table 1.
The meconium samples and the day 4 and day 7 fecal samples from the same inpatient neonates born at CMH were collected from diapers and put into sterile tubes, which was performed similarly throughout the study. The fecal samples were stored in a cooler box at 4 °C before being transported to Universitas Indonesia, Depok, on the same day. All samples were then kept in a refrigerator before processing for DNA extraction.
All experimental protocols were approved by the ethics committee of the Faculty of Medicine, Universitas Indonesia. Informed consent was obtained from all subjects' legal guardian(s).
Preparation of DNA for next-generation sequencing. The DNA from meconium samples and fecal samples was prepared on the same day after collection and treated to obtain the optimum yield of high-quality DNA for the NGS process. This step was carried out as elaborated by Rulita et al. 15 .
Amplification of the V3-V4 region of the 16S rRNA (16S) gene using the specific primers forward 515F and reverse 806R and subsequent NGS were carried out by Novogen Co., China.
Bioinformatic data analysis. To analyze the 16S rRNA gene sequences, bioinformatic analysis was performed by Novogen Co., China, as follows.
Sequencing data processing. Paired-end reads were assigned to samples based on their unique barcodes and truncated by trimming the barcode and primer sequences. Paired-end reads were merged using FLASH (version 1.2.7) 16 (for details, see http:// ccb. jhu. edu/ softw are/ FLASH). Quality filtering of the raw tags was performed under specific filtering conditions to obtain high-quality clean tags 17 according to the QIIME (version 1.7.0) 18 (for details, see http:// qiime. org/ scrip ts/ split_ libra ries_ fastq. html) quality control (QC) process.
The tags were compared with the reference database (Gold database, for details, see http:// drive5. com/ uchime/ uchime_ downl oad. html) using the UCHIME algorithm (for details, see http:// www. drive5. com/ usear ch/ manual/ uchime_ algo. html) 19 to detect chimera sequences (for details, see https:// drive5. com/ usear ch/ manual/ chime ras. html); the chimera sequences were then removed 20 . Finally, the effective tags were obtained. OTU clustering and taxonomic annotation. Analysis  To study the microbial community composition, the bioinformatics data were first analyzed by applying a QIIME QC process to obtain clean data so that the results would be more accurate and reliable. The clean data were then processed by clustering based on 97% identity for the effective tags of all samples. According to the OTU clustering results, taxonomic annotation was performed for the representative sequence of each OTU to obtain the corresponding taxon information and taxon-based abundance distribution.
To obtain the phylogenetic relationship of all OTU representative sequences, MUSCLE 24 (version 3.8.31, for details, see http:// www. drive5. com/ muscle/) was used to rapidly compare multiple sequences. OTU abundance information was normalized using a standard sequence number corresponding to that of the sample with the fewest sequences. Subsequent analyses of alpha diversity and beta diversity as higher-level measures were all performed based on these output-normalized data.
In addition, based on the OTU clustering results, we performed multiple sequence alignments and established phylogenetic trees. The differences between subjects' fecal samples or among groups regarding the structure of the microbial community were explained via dimension reduction (principal coordinate analysis (PCoA), principal component analysis (PCA), and NMDS) and the unweighted pair group method with arithmetic mean (UPGMA). Statistical methods such as T-test, MetaStat, LEfSe, analysis of similarity (ANOSIM), and multiple response permutation procedure (MRPP) were used to test the significance of the microbial community structure differences among groups. Moreover, the results were subjected to canonical correlation analysis (CCA)/ redundancy analysis (RDA) to explore the major environmental factors.
Complexity of biodiversity and differences in species complexity. To analyze the complexity of the biodiversity of the microbiome, we determined the alpha diversity for a sample through calculation of several www.nature.com/scientificreports/ indices, including the observed species, Chao1, and Shannon indices. All these indices were calculated with QIIME (version 1.7.0) and displayed with R software (version 2.15.3) 25 .
To evaluate differences among samples in species complexity, we applied beta diversity analysis. Beta diversity based on both weighted and unweighted UniFrac distances was calculated by QIIME software (version 1.7.0).
Furthermore, cluster analysis preceded by PCA was applied to reduce the dimensions of the original variables using the FactoMineR (http:// facto miner. free. fr) package and ggplot2 package in R software (version 2.15.3) 25 . PCoA was performed to obtain principal coordinates and visualize complex, multidimensional data. A distance matrix of weighted or unweighted UniFrac distance among samples obtained in previous analysis was transformed to a new set of orthogonal axes, by which the maximum variation factor was demonstrated by the first principal coordinate, the second maximum variation factor was demonstrated by the second principal coordinate, and so on. PCoA results were displayed by the WGCNA, stat and ggplot2 packages in R software (version 2.15.3) 25 . UPGMA clustering was performed as a type of hierarchical clustering method to interpret the distance matrix using average linkage and was conducted by QIIME software (version 1.7.0).
MetaStat analysis was conducted by R software 25 . The P-value was calculated by the permutation test method, while the q-value was calculated by the Benjamini and Hochberg false discovery rate method 26 . ANOSIM, MRPP and Adonis were performed by R software (Vegan package: the ANOSIM function, MRPP function and adonis function, respectively) 25 . Analysis of molecular variance (AMOVA) was calculated by Mothur using the AMOVA function. T-tests and drawing were conducted by R software 25 . Quality control and limitations. Each step, such as sample testing, PCR, purification, library preparation and sequencing, affect the quality and quantity of data, while the data quality directly affects the subsequent information analysis results. To maintain the accuracy and reliability of sequencing data, QC was performed at each step of the procedure.
In our study, there were only limited samples (24 DNA samples obtained from 8 subjects) for NGS profiling; thus, statistical bias was possible.
Ethical approval. All methods were carried out in accordance with relevant guidelines and regulations.
We confirm that all experimental protocols were approved by the ethics committee of the Faculty of Medicine, Universitas Indonesia.
Informed consent. We confirm that informed consent was obtained from all subjects' legal guardian(s).

Results
Subjects, fecal samples, and clinical characteristics of neonates. All subjects were preterm inpatient neonates born in CMH, Jakarta. All fecal samples were collected from those eight subjects by performing the same procedure at three time points, i.e., meconium (24-48 h) and feces on day 4 and day 7, providing 24 samples. The subjects consisted of an equal number of male and female neonates who were late preterm and very preterm neonates, with gestational ages ranging from 30 to 34 weeks and birth weights ranging from 1080 to 1900 g. All subjects were born via C-section, except one born spontaneously; subject AJM. The frequencies of the feeding patterns breast milk only and mixed feeding were almost equal (5-3, respectively). There were more neonates who did not receive antibiotics than those who did, while during pregnancy, only two out of 8 subjects' mothers received antibiotics. Total serum bilirubin (TSB) levels were all within the normal limit. These characteristics are shown in Table 1.
We also collected the dietary records of the mothers and found that the overall dietary pattern lacked dairy intake. The complete dietary pattern of the mothers is summarized in Supplementary Table 1.

Microbial community composition.
In the process of constructing OTUs for studying the microbial community composition, basic information about different samples was collected, such as effective tag data, lowfrequency tag data and tag annotation data. A summary of this information is shown in Supplementary Fig. 1.
The OTUs composing the microbial communities of fecal samples from the three time-point groups and each subject at the phylum and genus levels are summarized in Fig. 1. As observed in all groups and subjects, the order of abundance at the phylum level was Proteobacteria followed by Firmicutes (Fig. 1A1,A2), which agrees with previous reports [27][28][29] . At the genus level, unidentified Enterobacteriaceae was the most abundant ( Fig. 1B1,B2). However, the second most abundant genus among these OTUs could not be determined, although Staphylococcus showed intriguing abundance in the fecal of one subject, ADS(4). In total, only 4 fecal samples from 2 subjects showed the presence of Staphylococcus, i.e., MIR(M), ADS(M), ADS(4), and ADS(7). Heatmap of 35 genera result gave great supportive data for OTUs analysis result at Genus level as presented in Fig. 1. It is interesting that relative abundance of Ureaplasma was found in CIT meconium, which disappeared in day 4 and day 7 fecal samples, which indicating the influence of maternal factor. Other interesting genus, Lactobacillus, was found in MIR day 4 fecal but was unable to develop as shown in day 7 feces.
Taxonomic annotation at the kingdom through the genus level of all subjects at the three time points is shown with column graphs in Supplementary Figs. 2-7. Taxonomic annotation at the kingdom through the species level for the time points was visualized with phylogenetic trees, as shown in Supplementary Figs. 8-10. Taxonomic annotation for the 50 highest abundance OTUs in every time-point group is summarized in www.nature.com/scientificreports/ Microbiome diversity and abundance in feces of the three time-point groups. The richness and evenness information about the microbiome diversity across three time-point groups, and common and unique OTU information among different groups were obtained by using alpha diversity assessment. This measure allowed us to assess a broader change or difference in the composition of microorganisms for describing the microbiome. Based on alpha diversity assessments data performed, i.e., observed species, Shannon diversity, and chao1 diversity, as well as Faith's Phylogenetic Diversity across three-time points groups (meconium, day 4 and day 7) as presented in Fig. 2, significantly different composition of microorganisms for describing the microbiome between groups was not observed. It demonstrated that the pattern of early life microbiome development exhibited similar species richness and evenness in three-time points groups, although with some exceptions for unhealthy neonates represented by outliers' value. www.nature.com/scientificreports/ www.nature.com/scientificreports/ A rank abundance curve was made to sort the OTUs in the sample by relative abundance (or the number of sequences included) from greatest to least to obtain the corresponding sorting number, which can be used as the ordinate. The observed OTUs, Chao1 index, Shannon index, and Faith's phylogenetic diversity (PD) were generated from the alpha diversity assessment with boxplot visualization, as shown in Fig. 2. There were outliers in the alpha diversity indices observed species ( Fig. 2A) on day 4, Chao1 diversity (Fig. 2C) in M and on day 4, and Faith's PD (Fig. 2D) in M and on day 7.
We produced alpha rarefaction curves to indicate whether the biodiversity of microbiome samples reflects the representativeness of the sequencing data directly and reflects the richness of the microbial community in the samples indirectly. The curve flattened ( Supplementary Fig. 11), indicating that a reliable number of samples was used, which means that only scarce species remained to be sampled.

Microbiome composition changes in feces at the three time points. Beta diversity measures were
assessed to capture changes in microbiome community composition among the groups. The dissimilarity among groups was calculated with gradient analysis and displayed with ordination plots (PCA, PCoA, etc.). Samples with similar microbial community structures tended to cluster, and vice versa. Here, we used the PCoA method with weighted UniFrac and unweighted UniFrac distances and visualized the results as a graph, as shown in Fig. 3.
Furthermore, samples were clustered by UPGMA based on the acquired distance matrix to construct a cluster tree ( Supplementary Figs. 12, 13). Thus, with the UPGMA tree, which considered high-abundance taxa to achieve the underlying driving factors, we were able to determine the complexity differences between day 4 and day 7, which were fewer than those between M and either postnatal group.
To learn more about the development of the microbiome community over the first week in newborn preterm neonates' feces based on their changes and differences, we used ANOSIM. As a nonparametric test, ANOSIM could evaluate whether the variation in the feces microbiome among time-point groups was significantly larger than the variation within groups, i.e., early, day 4, and day 7; this information helped us evaluate the reasonability of the division of groups.
Our results showed a negative R-value, which suggested that intragroup variation was larger than intergroup variation. Therefore, there were no significant differences between early (M) and day 4 (Fig. 4A) or day 4 and day 7 (Fig. 4B), with P = 0.812 and P = 0.455, respectively. Boxplots based on the rank (between-group  www.nature.com/scientificreports/ and within-group) of the ANOSIM results are shown in Fig. 4. The rank was obtained from the sorted distance between groups. A Krona plot displayed the microbiome diversity and proportion in the feces of subject MIR at the three time points, is shown in Fig. 5.
The microbiome diversity and proportion of these triplets are shown as circular representations by Krona displays in Supplementary Fig. 14. The development of the feces microbiome of the triplet subjects from meconium, day 4, and day 7 with their proportion is shown in Fig. 6.

Discussion
This study shows the development and diversity of the gut microbiome in 24 samples derived from the feces samples of eight preterm neonates born at CMH, Jakarta, at 3 different time points in early life. The analysis of gut microbiome development over the first week based on alpha and beta diversity in terms of relative abundance showed that for the 8 subjects providing 24 feces samples at the three time points, i.e., meconium, day 4 feces, and day 7 feces, Proteobacteria and Firmicutes were dominant at the phylum level, while at the genus level, unidentified Enterobacteriaceae were most abundant, as has been reported widely 1,6,7,9,10,[29][30][31][32][33][34] . This result is similar to our previous study on cultivable bacteria from meconium from neonates born in CMH 9 . The microbiomes of meconium, as the first feces in this study, were relatively similar, with one subject showing a high abundance of the genus Ureaplasma and the rest showing a high abundance of Enterobacter.
The meconium microbiome shows different progression and development in preterm neonates, as reported, even among triplets, as revealed in this study 5,11,27,30,31,35,36 . The meconium microbiome from our study was not significantly affected by perinatal factors. The neonatal gut microbiome is known to begin colonization in utero and is affected by genetic and maternal factors during pregnancy, including chorioamnionitis 13,37,38 .
Raw data were processed to obtain clean data prior to OTU analysis of paired-end sequencing reads and then merged by using FLASH, a very fast and accurate analysis tool, which was designed to merge paired-end reads when at least some of the reads overlapped the reads generated from the opposite end of the same DNA fragment, and the splicing sequences were called raw tags. Based on the OTU analysis of the microbial composition ( Fig. 1), one patient (CIT) showed Ureaplasma as the most abundant taxon in the meconium (Fig. 1B2), which can be explained by the fact that the mother had premature rupture of membranes (PROM) over 18 h, which is a risk factor for chorioamnionitis (Table 1). Moreover, OTU analysis of the ADS and SRI samples showed proportional diversity among each abundant genus (Fig. 1B2). These two patients' mothers had leukocytosis as a risk factor for chorioamnionitis but without PROM. This observation could be explained by how the microbiome can be directly affected by chorioamnionitis, specifically by the incidence of PROM, not by other risk factors such as only leukocytosis. It is shown that PROM could directly expose the fetal digestive system to foreign bacteria during systemic infection, whereas the risk of direct contamination is far less [1][2][3][4][5]29,30,39 . This emphasizes the importance of maintaining infection prevention during pregnancy but, more importantly, prompt delivery when indicated for PROM cases.
Diversity of the meconium microbiome reflects the overall quality of the fetus and the mother 4,5,29,38 . In this case, MIR was born the most maturely (at 34 weeks, see Table 1) among the subjects, with an optimum Apgar score and no risk of chorioamnionitis. This background is in accordance with having a diverse meconium microbiome, showing the importance of maintaining both a healthy mother and fetus, which will affect the immediate future of the neonate upon birth, especially regarding feeding tolerance and response to infection 4,9,30,35 .
Subject MIR was the neonate with the best conditions not only prenatally but also postnatally. Veillonella spp. were detected in this subject's microbiome at day 7. Veillonella spp. are part of several normal human microbiome, including the fecal microbiome of healthy adults, and are predominant in the upper digestive tract 40,41 . Variations in the counts of Veillonella spp. have previously been noted depending on diet, pathology, or antimicrobial treatment 42,43 . Veillonella may play a role in gut microbiome development at an early age since it has been shown to appear in neonate feces in the first days of life, with a significantly higher count in bottle-fed than in breast-fed children [42][43][44] .
The pattern of development began with Enterobacter, which was detected initially in meconium, and then changes depending on the environmental factors. Antibiotics were administered due to the high risk of earlyonset neonatal sepsis that occurs in preterm neonates affected by perinatal factors 45,46 . The regimen given is usually based on the microbe map of the particular center. In our study, 6 out of 8 patients were given first-line antibiotics (ampicillin and gentamicin). Only three of these patients developed a non-Enterobacter-dominant microbiome on day 4 or day 7, which shows that we were unable to conclude whether early life administration of first-line antibiotics have a significant effect on the early life gut microbiome from our study. One interesting case was with triplet patients, i.e., JWR1 (first triplet), JWR2 (second triplet) and JWR3 (third triplet) which had varying populations of microbiome. Two of the triplets had non-Enterobacter-dominant microbiomes. From a blood culture of the first triplet (JWR1), Escherichia coli was detected; however, the microbiome was dominated by the phylum Citrobacter. The third triplet (JWR3) had Acinetobacter and Clostridium as the dominant microbiome constituents at day 4 and day 7, respectively, with no blood culture growth. All three of the triplets had feeding intolerance, with the worst occurring in the third triplet (JWR3) (Supplementary Fig. 14). This result differs with a previous study comparing dichorionic triplets which showed a significantly strong similarity of microbiome between triplets 47 . This disparity can be due to the smaller triplet in which reflects a worse state of prematurity including the intestinal permeability, therefore directly affecting the early life gut microbiome.
The direct effect of bacteremia on the gut microbiome is significant, especially in cases of late-onset neonatal sepsis; in addition, neonatal sepsis will detrimentally impact the already premature immune system and contribute indirectly to the abundance of non-commensal bacteria in the gut, such as Acinetobacter, Citrobacter, www.nature.com/scientificreports/ and Clostridium 48,49 . The administration of antibiotics, especially those that target anaerobes, can also eliminate commensal normal flora bacteria and overgrowth of pathogenic bacteria such as Clostridium difficile, thus causing antibiotic-associated diarrhea 50 . This effect usually occurs after 14 days of antibiotic administration. However, in cases of under intensive care and immunosuppression, as well as a premature gut microbiome, gut dysbiosis can occur much earlier 28,30,33,51,52 . Our study showed that the triplets who received the same antibiotics, 2 of the triplets (JWR2 and JWR3) showed varying microbiome populations, most likely due to worsening bacteremia. Another interesting finding is the diversity among the feces microbiome from the three time points. The diversity lessens between each time point; however, the samples shared the same species at certain amounts, in addition, complexity of day 4 and day 7 is less than that of meconium compared to day 4 and day 7. As visualized by a Venn diagram ( Supplementary Fig. 15), the meconium microbiome exhibited the same species richness and evenness in those subjects, which may be considered consistent intestinal homeostasis in preterm neonates. For example, microbiome development in the case of the triplets (JWR1, JWR2 and JWR3), from early life to one week later, showed that the microbiome shared common microbiome population early life (meconium) and at day 4, while that at day 4 shared common microbiome population with that at day 7 and influenced by nutrition intake, as summarized in Fig. 6.
This observation can be explained by the mechanism of microbial maturation and colonization 2,11,13,53 . Prenatally, the gut is heavily and dominantly influenced by the genetic determinants, only after the postnatal period, environmental factors including those that affect the brain development confers the dominant influence 2,29,30,33 . The postnatal developmental phases of the gut and brain are interdependent and occur separately. The developmental interdependency between the gut and brain is due to the complex, bidirectional communication, incorporating neural, endocrine, immune, and metabolic mediators termed GBA 54 . Several ways the GBA works if via both afferent and efferent autonomic pathways with different intestinal targets such as enteric nervous system (ENS), muscle layers and gut mucosa, modulating motility, immunity, permeability, and secretion of mucus; which results in the development and alteration of the gut microbiome 2,29,30,33,54 . During the process on the 4 th day, exposure of the neonate to the environment, intake of medicine and enteral/parenteral nutrition, and influences of insults that affect the innate immune system shifted the microbiome into a less diverse population. By the 7th day, the microbiome further shifted into specific populations, albeit the shift was not as significant as the shift from meconium to feces on the 4th day. There were some differences in colonization, possibly due to the comorbidities of the neonates, such as bacteremia and antibiotic administration. However, the detrimental dysbiosis effect due to antibiotic use usually occurs after administration for longer than 14 days; therefore, in our case, a more likely explanation is based on bacteremia, neonate prematurity, and lack of enteral supply of probiotics (e.g., mother's milk) 3,13,33,38,[55][56][57] . Despite observing differences among the time points, the differences were not statistically significant. Meconium and day 7 feces should display tremendous differences, but the difference had a P-value of only 0.612. This may be due to the small number of samples not being sufficient to perform an optimal statistical analysis.   Fig. 14). The proportion (percentage) of the same microbiome diversity is summarized as a horizontal bar, and the matrix is underneath (below). www.nature.com/scientificreports/ Surprisingly, we did not find any Bifidobacterium or Lactobacillus abundant in our study. Maternal, perinatal, and postnatal factors should influence the normal microbiome. From the food survey of the mothers, we found that all the mothers' dairy consumption was lower than recommended (Supplementary Table 1). Bifidobacterium and Lactobacillus are acquired mainly from intake of dairy products. The abundance among the maternal population of those microbiome constituents greatly influences the fetal microbiome 58 . Even by the 7th day, the microbiome population of these neonates did not reach adequate abundance of Bifidobacterium and Lactobacillus, mainly due to both the lack of dairy consumption and postnatal exposure to probiotics, which are usually acquired from mother's milk. This phenomenon should be further studied to determine whether we can optimize the neonate's innate immune response and early tolerance to feeding by inducing earlier colonization of beneficial gut microbiome constituents using enhanced preterm formula milk or by any other means.
The limitation of our study, first and foremost, is the sample collection method. We were unable to collect fresh meconium samples to prevent contamination. The reason for this is that the method to obtain fresh meconium in preterm neonates requires administration of an anal spool using saline or glycerol. This technique is potentially harmful due to the nature of preterm gastrointestinal function in which peristaltic movement is usually delayed; therefore, collecting the sample directly and not from the diapers is of utmost difficulty if not actively monitored. The results were still within acceptable limits, whereas contamination would yield lower abundance from the DNA sequencing and would still yield a positive result, but the most dominant genus would still not be from the contamination source. Our second limitation is the small number of samples. We originally opted to collect more samples; however, this work serves as a preliminary study on how the microbiome of preterm neonates in our center is established. We deem the results of the metagenomic analysis to be acceptable for use as the basis for future studies.

Conclusion
A myriad of factors influences the development of the neonatal gut microbiome, including maternal, perinatal, and postnatal factors. Indonesian pregnant women have markedly low consumption of dairy products, which results in decreased populations of Bifidobacterium and Lactobacillus. The most abundant genus was unidentified Enterobacteriaceae, with slight variations in specific samples dominated by Clostridiales and Ureaplasma, and Proteobacteria and Firmicutes showed dominance at the phylum level. Perinatal factors showed influence on the meconium microbiome including PROM and mother's diet, in which populations of Bifidobacterium and Lactobacillus was not found, instead Ureaplasma was one of the dominant populations in the patient with mother's history of PROM. Notable associations between early life fecal microbiome and clinical outcomes include administration of early antibiotics and bacteremia, with the ensuing microbiome population to show less diversity. The development of the preterm neonate gut microbiome from meconium to day 4 and day 7 showed a pattern of change in diversity as time progresses, but a similarity with the meconium microbiome was maintained, and with decreasing complexity. These data emphasize the need for specific evaluation and management of dysbiosis and their related clinical outcomes in different populations.

Data availability
The datasets used and/or analysed during the current study is available from the corresponding author on reasonable request. www.nature.com/scientificreports/