Using custom-built primers and nanopore sequencing to evaluate CO-utilizer bacterial and archaeal populations linked to bioH2 production

The microbial community composition of five distinct thermophilic hot springs was effectively described in this work, using broad-coverage nanopore sequencing (ONT MinION sequencer). By examining environmental samples from the same source, but from locations with different temperatures, bioinformatic analysis revealed dramatic changes in microbial diversity and archaeal abundance. More specifically, no archaeal presence was reported with universal bacterial primers, whereas a significant archaea presence and also a wider variety of bacterial species were reported. These results revealed the significance of primer preference for microbiomes in extreme environments. Bioinformatic analysis was performed by aligning the reads to 16S microbial databases for identification using three different alignment methods, Epi2Me (Fastq 16S workflow), Kraken, and an in-house BLAST tool, including comparison at the genus and species levels. As a result, this approach to data analysis had a significant impact on the genera identified, and thus, it is recommended that use of multiple analysis tools to support findings on taxonomic identification using the 16S region until more precise bioinformatics tools become available. This study presents the first compilation of the ONT-based inventory of the hydrogen producers in the designated hot springs in Türkiye.

Our data is presented using a sequential approach, which encompassed the steps of sample gathering and processing.Biohydrogen production capacities and SEM images are also presented to evaluate the sequencing results, and the practical outcomes.
According to taxonomic classification data obtained from all hot springs, bacteria (95%) with 6,376,218 reads, and archaea with 85,786 reads (1%) were classified as superkingdom in the taxonomic classification.
Three different tools were used to compare the samples analyzed using archaea-specific primers.Overall, the analysis of Doğanbey samples indicated that the choice of primer pair has a significant impact on the range identified microorganisms and their abundances.Interestingly, when analyzed using Kraken (version number,citation), upstream samples showed no significant change with use of archaea primers at domain or phylum levels (Fig. 1).Crassaminicella, Paenibacillus and Pseudothermotoga at genus level were reported.Epi2Me (v3.6.2) tool also showed similar results.Using universal primers, Nitrosopumilus was the only archaeal genus detected at Doğanbey upstream, with no sign of archaea in the other locations.However, within all Doğanbey samples, the use of the archaea primers resulted in increased abundance of this genus, as well as the discovery of other archaea species.Newly detected archaea genera were Methanothermobacter for the upstream, and for the mid and downstream regions, Nitrososphaera, Archaeoglobus, Methanospirillum (Fig. 2).The in-house BLAST tool was used as an alternative for identification of genera (Figs. 3, 4).Using this tool obtained a similar microorganism profile as found by using Kraken and Epi2Me.In all samples analyzed using universal primers, reads were much more aligned to Novoshingobium species than in those analysed using archaea-specific primers.Moreover, using archaea-specific primers, the archaea genus Haloferax, rather than Nitrososphaera was identified as the dominant genus.
In-house BLAST classifications were compared to Kraken and Epi2me platforms, and indicated that these three different platforms were compatible, and result in very similar performances of bacteria as far as genus level classification.However, in species level classification, there are few differences, concerning most abundant species in the samples.For example, in Doğanbey downstream sample (E in Figs. 1, 4), the most abundant species, according to Kraken classification, is Oleiphilus balneatrix, and according to in-house BLAST classification, it is Novosphingobium aromaticivorans.
Based on the species obtained as a result of 16S rDNA metagenomic sequencing using universal primers depicted in Fig. 5, Doğanbey hot spring hosts dominantly Novospingobium species belonging in Proteobacteria genus, which is frequently encountered in thermal water resources 18 .Novospingobium sp., which live in milder temperature conditions (30-45 °C), also dominate the Çeşme hot spring community 19 .This was an expected finding considering the temperature of the hot spring was measured as 38-43 °C.
Among the 5 hot spring water communities, the archaea population was most evident in the Doğanbey thermal spring, from which more than 80 k reads were obtained.Archaea population in the hot spring was determined as a result of 16S analysis, and the three most common archaeal species were Nitrosopumilus ureiphilus, Nitrosopumilus maritimus, Nitrososphaera viennensis.
Only 4% of the reads could not be classified.Novosphingobium species capable of degrading aromatic compounds were found to be dominant (17%), but these bacterial species are moderately thermophilic (37-45 °C) 20 and aerobic, and therefore not considered as one of the species of interest in terms of H 2 production within the scope of the study.
Figure 6 demonstrates the diversity of the bacterial and archaeal specimens at the genus level, highlighting the variation of distinct organisms in three different hot springs (Bergama, Çeşme, Doğanbey) using both isolated directly from the spring water and cultivated isolates amplified with universal bacterial 16S rDNA primers.
Previously, reports were made of the presence of carboxydotrophic hydrogenogenic organisms in different hot springs of Izmir province: these were Methanosarcina acetivorans 24 , Thermincola ferriacetica 25 , Moorella thermoacetica 26 , Thermosinus carboxydivorans Nor1 and Thermosinus carboxydivorans [27][28][29] , Carboxydothermus pertinax 30 , Carboxydocella ferrireducens and Carboxydocella thermautotrophica and Carboxydocella spp. 31,32, Caldanaerobacter caldanaerobacter subterraneus and Caldanaerobacter thermoanaerobacter sp._RH0804 [33][34][35] with low abundances, with the most dominant being M. stamsii by % 0.254945 presence 26 .Consequently, in our www.nature.com/scientificreports/study, the reported low abundances of these microorganisms were in agreement with the known rarity of carbon monoxide (CO)-oxidizing and H 2 -producing microorganisms in nature (< 0.1%), and these are usually found in dormant state, associated with little to zero growth [36][37][38] .In order to test their capacity to convert CO into H 2 , the www.nature.com/scientificreports/enrichment of these CO-utilizing consortia was experimented under carboxydotrophic growth-specific media and CO gas as the substrate, suitable growth conditions were supplied for thermophilic hot spring microorganisms to cultivate under lab conditions, increase their abundances in the culture and produce hydrogen using necessary enzymes in their metabolisms.A significant increase in the genus of Moorella was achieved, with their abundance reaching up to 5% in cultivated samples, Caldanoerobacter species up to 0.6% however Carboxydocella and Carboxythermus genuses reached only 0.02% and 0.01% in abundance, respectively (Fig. 6).A comparative analysis of Doğanbey isolates cultured on different gas substrates showed a decrease in the number of species in the enriched mixed culture in lab conditions, and a significant difference between growth in syngas (see "Methods" section) and 100% CO.Caloramator sp., the species of greatest interest for hydrogen production in culture, is unable to grow in syngasfed culture medium (less than 5% in abundance) and thus allowing the dominant growth of Novosphingobium (up to 60%), but grows dominantly in 100% CO gas-fed culture medium (up to 50%), with the remaining culture consortia consisting mainly of Novosphingobium aromaticivorans (38%) and Moorella glycerini (6%) (Fig. 6).Environmental and chemical growth conditions were adjusted for the thermophilic hot spring microorganisms to cultivate them under lab conditions.These conditions were applied in order to increase their abundances in the culture and simultaneously produce hydrogen with the help of the relevant enzymes in their metabolisms.Microorganisms were grown until hydrogen production was detected representing the end of their logarithmic phase as shown in our previous study 39 .The comparative results of taxonomic profiling in direct water isolations and in cultivated samples demonstrated that bacterial species belong to the Firmicutes genus Anoxybacillus and Caloramator alongside other thermophilic carboxydotrophic hydrogenogens (Moorella stamsii, Tepidimicrobium ferriphilum, Thermodesulfovibrio, Geobacillus, Thermosediminibacter) [40][41][42] have been dominantly observed, and their abundances increased after being taken into anaerobic and thermophilic culture medium and feeding www.nature.com/scientificreports/with 100% CO gas.Hydrogen production and CO utilizing activities of the cultivated samples were reported in our previous study 39 and the results indicated that the presence of thermophiles together with anaerobes in the microbial community constructed an adequate mixed culture consortium for efficient hydrogen production.

Scanning electron microscopy (SEM) imaging of cultivated hot spring isolates
SEM imaging of 3 different H 2 -producing hot spring microbial communities showed that these communities consist of bacilli and cocci-shaped bacteria of various sizes and morphological features (Fig. 7).A dense population of a single bacilli-shaped bacteria 3.814 µm long and 362.8 nm wide was observed in Doğanbey.This single thermophilic and anaerobic microorganism showed significant morphological similarity to the genera of Thermoanaerobacter, Carboxydocella, and Bacillus associated with CO conversion to hydrogen and organic acids 43,44 .Çeşme samples exhibited mainly cocci-shaped bacterium with sizes ranging from 1.60 to 2.127 µm in length and 307.6 to 620.1 nm in width.In Bergama samples, both bacilli and cocci-shaped bacterium were observed simultaneously; the Bacilli-shaped bacteria were 1.850 µm in length and the cocci-shaped bacterium were 344.3 to 455.3 nm in width.A recent publication by the authors reported the biohydrogen production capacities of the isolated microorganisms from the hot springs of Izmir province; the highest was found in Bergama, with a yield of 0.18 ΔH 2 (mmol)/ΔCO (mmol), followed by Doğanbey (0.13 ΔH 2 (mmol)/ΔCO (mmol)) and Çeşme (0.12 ΔH 2 (mmol)/ ΔCO (mmol)) 39 .

Discussion
The abundances of microorganisms were determined by nanopore sequencing, and the findings were compared in accordance with the comparative bioinformatic analysis methods, sampling sites, sequencing primer selections, and culturing techniques.www.nature.com/scientificreports/Following the quality check and barcoding, the reads were aligned to 16S microbial databases for identification using three different alignment tools, Epi2Me (Fastq 16S workflow), Kraken and in-house BLAST tool.A comparison of the differences at genus and species levels shows that there is agreement between these tools at genus level classification, but this agreement disappears at species level in all cases.While Kraken identifies Oleiphilus species among samples in mid-and low-course samples, the other tools would align these reads to Marinobacterium.It is important to note that both belong to the same Oceanospirillales order.On the other hand, unlike the other two tools, in-house BLAST showed a significant proportion of reads (up to 15%) aligning to Haloferax sulfurifontis in samples studied with Archaea-specific primers.Such differences in the three analysis tools we have employed suggests that the way the data is analyzed has a significant impact on the species level identification, however, shows a significant compatibility in identified genus of hot spring microbial communities.These differences in species level identifications might be due to the fact that the 16S region where classification is based, is too short a stretch of DNA, resulting in significant alignment score fluctuations by the algorithms, but it is also very likely to be due to the differences in the available databases they utilize.For that reason, until more precise tools are developed, use of more than one analysis tool is beneficial to support findings on species level taxonomic identification using the 16S region.Cumulative contribution of the scientific community to excel sequencing analysis of microbiota may eventually result in one of these or many other tools to be commonly accepted, however, it should be remembered that the methods and databases utilized by different tools may be more sensitive over the others in particular conditions or sample sources.For this reason, it may not be possible to claim any tool to be superior to others and the use of multiple tools will remain the optimal approach.The problem with such an approach is how to evaluate the disagreements.While different approaches may be utilized to overcome this, the choice is still a question of statistics and beyond the scope of this work.
Besides the choice of tool, there were significant differences between the results obtained using different primer pairs.Accordingly, the archaea-specific primers classified both archaea and bacteria, while universal bacterial primers showed fewer reads aligned to any archaeal species.It is crucial to note that to generate the archaea-specific primers, only archaea genomes were used to identify a consensus sequence, and primers were selected at highly conserved regions.Despite this, the bacterial sequences made up the greater portion of the reads, even with archaea-specific primers.This may indicate that, despite the PCR bias for archaeal species, the abundance of bacteria may be well above the archaea species in the sample and make up more than the abundance observed by the archaeal species.Supporting this, universal primers resulted in a marginal number of reads aligned to the archaea.Read numbers are a result of PCR amplification thus not represent the reads obtained directly from microorganisms and this PCR bias could always effect the abundance of microorganisms frequency.For this study to minimize the PCR bias we have implemented PCR with minimum cycles to obtain target DNA concentration.
Despite a general agreement between the studies at the genus level and above, there is no agreement at the species level.Even in pure culture samples, there are conflicts regarding the classification at the species level, for example the species of Novosphingobium and Anoxybacillus are detected and in every sample, but there is a distinct bias towards Novosphingobium.It is argued that microbial genomes from natural environments exhibit trait biases resulting from phylogeny-based approaches, and lacking whole-genome sequences of uncultured bacteria, thus the available reference genome datasets may exhibit biases towards more abundant organisms 45 .In pure cultures from Doğanbey hot spring, there is a bias towards this species in bioinformatic analyzes, especially in Epi2Me workflows.Considering that the Novosphingobium species live in moderately thermophilic environments 18,20 , this bias is explained by the frequent classification of this species (about 15%) in the Doğanbey hot spring samples, where water temperatures reach 75 °C, and where the growth of Novosphingobium, a mesophilic species, is usually very limited 20 .
Figure 8 shows a Krona plot analysis of the comparative classification for different parts of the hot spring stream with temperatures decreasing with distance from the source.This revealed that sequencing with universal bacterial primers, archaea and bacteria species was the most common in terms of microbial variety in the most thermophilic part of the source (upstream), then, as the temperature decreased downstream, the archaea disappeared, the bacterial diversity of species decreased and Novosphingobium species became the more dominant (up to 50%).However, this finding runs counter to investigations reported in the literature, where an increase in temperature results in decreasing microbial variety, due to mainly temperature stress and low adaptation rates of the microorganisms to the stressful thermophilic conditions 46,47 .This suggests that universal bacterial primers did not represent an accurate phylogenetic model of the hot spring in terms of microbial variety.On the other hand, using custom designed primers in this study, the phylogenetic model of the three parts of the hot spring suggested a reasonably accurate representation in terms of microbial diversity, showing a decline in microbial diversity with increasing temperatures (Fig. 8).Even though the custom primers were designed using archaeal genomes, these still demonstrate a better representation in terms of the detection and dissociation of bacterial species.
Custom-made primers worked well on direct DNA isolation from water samples (uncultured); however, it was not possible to classify archaeal presence with universal bacterial primers.The amount of archaea increases in the midstream, and at the beginning, downstream distancing from the hot spring source.This can be explained by the lower number of reads in the upstream samples, while the sensitivity may also have decreased.These primers detected 4% of archaea at the most highly thermophilic part of the stream, 28% at the middle part, and 38% at the most highly mesophilic part (downstream).With the universal bacterial primers, the presence of archaea was highest at the source (upstream) with 4%, but no archaea were detected at the mid and downstream points (Fig. 9).
The designed archaea primers were sensitive in detecting the presence of archaea only at mild temperatures, while, at higher temperatures, no difference was observed in the classification of archaea between universal primers and custom-designed archaea primers.The use of two different primers in the upstream sample showed  www.nature.com/scientificreports/ that the archaea primers were able to classify a wider variety of species, but the universal primers dominantly classified Anoxybacillus and Novosphingobium species.
A study 48 demonstrated a custom-primer design for archaea, which included a wider variety of species that live in psychrophilic (4 °C from freshwater samples) to mesophilic (37 °C from bioreactor samples) and temperatures and involved alignment of 8500 of sequences.This study reported at least 38% higher coverage of archaea compared to more commonly used universal primers.Another study 49 , involved distinctive environmental conditions (1 to 4 °C and high pressure) and aimed for higher detection of microbial communities living in niche conditions, similar to the this current study.While as a difference, that study claimed that short read sequencing and V3-V4 hypervariable region of 16S gene in terms of taxonomic assignment of oceanic consortia, and the better performance of the archaeal primer designed in that study in terms of diversity of archaeal communities, especially on Thaumarchaeota (Crenarchaeota), compared to universal primers, and reported up to 70% coverage for archaea.The current study includes niche growth conditions (thermophilic temperatures 60 °C and above, anaerobic and CO-utilizing) of archaea with hot springs of similar ecological conditions, therefore we acknowledge that our custom-built primers are not more comprehensive in terms of the range of archaea covered, however, these primers offer a more sensitive and robust detection, and are able to capture thermophilic and anaerobic hot spring archaea where they can be found in very small abundances.Another study 50 targeted pig fecal samples using a short read NGS sequencer, Illumina MiSeq, and designed universal prokaryotic primers based on V3-V4 hypervariable regions of 16S gene.In their study, custom designed primers matched to 94.6% of Archaea rRNA gene sequences in the database used as a reference.It was additionally shown, similar to the current study, that, where a primer bias existed, archaeal species were also detected with the bacterial universal primer.Custom designed prokaryotic primers increased performance by 0.8% for Archaea compared to previously described universal primer sequences in the database used in that study, and they successfully lowered the bias with the custom designed prokaryotic primers when compared to commonly used universal prokaryotic primers.Some 16S rDNA primer sets have been reported to include biases, thus it is important to minimize these, and obtain an even coverage of both overall and desired microorganisms for an optimized resolution within primer sets.To achieve this, firstly, regarding 16S primers and their relative performance regarding microbial representation, it would be wise to conduct preliminary investigations with less complex samples or mock microbial communities before implementations on environmental extreme areas, especially for the investigation of archaeal species 51,52 .Imitation restrictions of these unknown areas in extreme conditions require sensitive and comprehensive studies to achieve maximum precision with microbial community profiling.Sequencing platforms directly affect the outcome of the study, as they differ greatly in terms of protocol directions (such as PCR conditions), and differences between MiSeq and HiSeq sequencing platforms were reported with their respective protocols using mock microbial communities 53 .High throughput sequencing platforms (Nanopore and PacBio technologies) present greater depths; however, another study 52 suggested that observed biases using the MiSeq platform between different primers could not be solved by greater depths, but it is possible to resolve biases by the comparative approach, using mock communities and field samples.A study extensively comparing both platform and primer choice effects on 16S sequencing reported that primer choice had a greater biological effect than sequencing platforms and emphasized the importance of experimental methods for achieving accurate representation of abundance in microbial communities 54 .
This current study explored hot spring thermophilic communities with investigational parameters, including the comparison of the community structure in different parts of the hot spring waterflow where water temperature ranged from 38.0 to 77.3 °C.Furthermore, we have compared several bioinformatic pipelines and demonstrated unalike prokaryotic profiles with the utilization of conventional and novel primers (universal vs. custom design) for PCR amplification of the samples.Custom design primers indicated better results in terms of decreasing microbial diversity among increased water temperatures of hot springs while sequencing results with universal primers suggested the opposite.Study results also highlighted that both custom designed and universal prokaryotic primers have a certain bias towards Novosphingobium species, this was indicated by the samples where water temperature was too high for this mesophilic species (30-45 °C) to live and still showed an abundance of Novosphingobium species in bioinformatic analysis.Overall, this study is the crucial to report the effectiveness of different bioinformatic pipelines for nanopore sequencing studies together with representing valuable data with the custom design primers used.
As a conclusion, among the two methologies of using targeted custom-design primers and using different bioinformatic analysis pipelines for the classification of hot spring microorganisms, the primer choice had a critical effect on detection of archaeal species and also identification of the variety of bacterial species.Sequencing results revealed that enriched cultures demonstrated a significant increase of carboxydotrophic hydrogenogenic bacteria in abundance due to the manipulation of operational parameters in their favor.This approach has led to the discovery of a methodology for the detection of hydrogen producers from extreme environments.The current study also highlights the advantages of the nanopore sequencing tool for improving the feasibility of molecular workflows in microbial metagenomic studies and the proposed methodology could also be used in future works to investigate the roles of industrially important bacteria and archaea in similarly extreme environments.

Archaeal primer design for custom-built 16S rDNA primers
Custom-built primers were specifically designed for PCR amplification of 16S rDNA regions of thermophilic archaea that are likely to be isolated in Izmir province.Based on literature investigations, thermophilic and anaerobic archaea species reported in hot spring microbial community studies were identified and selected.Archaea sequences were downloaded from the NCBI database in FASTA format (Table 1).In order to detect potential species, archaea species and their entire genomes were combined on the same text file.As an exemplary, it was decided to select a thermophilic archaeal species isolated from a hot spring Thermococcus celer and its 16s rDNA partial sequence (NCBI Reference Sequence: NR_113295.1)because species growth conditions showed similarity to the investigated hot springs in this study 55 .Archaeal sequence and the FASTA file were downloaded through the NCBI database.
For the alignment of the 16S rDNA sequence (query) and the merged archaeal sequences (subject) via BLAST + (BLASTN 2.10.1+), a Python code was programmed to give only the accession codes of the merged archaeal sequences.The program printed the access codes of the archaea, and codes were also transferred to BLAST+, so that it could load archaea sequences from its own database.The command 'blastn -query ".\Methanocaldococcus 16S.txt" -subject Merged_archaea.fasta-outfmt 7 > output.tsv'was entered to the BLAST + program.Tabular with comment lines was selected as a formatting option to save the results as output.tsvformat in an Excel file.The start and end of the sequences in common to the 16 s sequence, and the archaea sequence in the tsv file were determined as the query start (q.start) and end points (q.end).
A Python program called SeqExtractor was coded in order to detect the archaea name and sequences from the 'merged_archaea.fasta'file, which was initially given in the Python program to detect the 8th (query start) and 9th (query end) rows in the query table in tsv file given by the BLAST program.Query start and end points from the archaea sequences were extracted and printed as alignments.Archaea accession codes and alignment sequences were recorded in a file named '16ssequences.fasta' .The script receives two files, a list of fasta files containing the genomic sequences of selected archaea species and a BLAST output where a reference 16S sequence was aligned to these sequences.The script extracts 16S sequences from each genome and assembles them in a single file in fasta format, ready to be aligned.
These sequences were aligned with the UGENE Multiple Sequence Alignment program (v.33) 56 .Some sequences were found to be inverted, otherwise the sequences were very similar and aligned.16S sequences were also aligned with the ClustalX Multiple Sequence Alignment program (v2.0) 57 .Reverse complementary sequences of reversed sequences were obtained (via were loaded into the ClustalX program together with the sequences, and realigned (Fig. 10).Forward and reverse archaeal primers were determined according to the obtained alignment results (Table 2).
Universal 16S rDNA Bacterial primers were supplied by Nucleus Genetic Inc.In order to obtain the same T m (melting temperature) value of the primers to perform PCR under the same conditions as the primers to be used in bacteria, the T m value of the specific bacterial primers was first found on the website (https:// tmcal culat or.neb.com) and the Archaea primer sequences were shortened accordingly (Table 2).PCR trial conducted with custom designed 16S rDNA primers for archaea showed the success of the amplification of 16S region of the DNA samples isolated from hot springs.

Sampling and cultivation of hot spring isolates
Five different hot springs located in Doğanbey, Çeşme, Dikili Bademli, Dikili Nebiler, and Bergama, Izmir, Türkiye (Table 3) were field-visited to collect samples between 25/05/2021 to 23/08/2021 (Fig. 11).Samples were collected in sterile plastic bottles and transferred to the laboratory in a heat-insulated container.pH, temperature, and oxidation-reduction potential (ORP) measurements were made from each collection point with a portable pH meter (MW 105 Max, Milwaukee, USA).The samples were processed in two different workflows: Table 1.Archaeal species that are likely to be found in thermal waters in and around İzmir, living at a temperature of 50-65 °C, around pH 6-7.

Methanocella conradii NC_017034
Methanothermobacter marburgensis NC_014408 www.nature.com/scientificreports/ (1) Direct DNA isolation of hot spring samples was performed (5 L of volume) according to the manufacturer's instructions with Thermo Fisher GeneJET Genomic DNA Purification Kit (Thermo Fisher Scientific, USA) in triplicate, without any enrichment.Isolated DNA was stored at − 20 °C for further 16S rDNA analysis.(2) The collected samples were transferred into sterile 50 mL falcon tubes and centrifuged at 1398×g for 10 min (Hanil Science Industrial, South Korea).The enrichment media content including macronutrients, micronutrients, and vitamins, was prepared as indicated in Ref. 32 .pH of the nutrient medium was fixed to 6.8 with 0.5 M HCl and 54 mL of prepared basal medium was transferred to glass bottles with a total volume of 100 mL.The bottles were closed with gas-tight rubber stoppers and metal caps, and autoclaved at 121 °C and 15 min.O 2 in the headspace of the closed bottles was removed by purging with N 2 gas.Following sterilization, 0.1 mL of sterile 10× vitamin solution was added to sealed vials containing 54 mL of sterile anaerobic thermophilic medium, and 6 mL (10%) collected pellet was inoculated.Bottles were fed with two different gaseous substrates: syngas (H 2 (5%), O 2 (5%), CO (10%), CH 4 (5%), CO 2 (20%) and N 2 (40%)) and 100% CO gas for 30 s.The bottles were inoculated and placed in an incubator at a temperature appropriate to the isolates' isolation temperatures (45, 55, 60 or 65 °C).Following serial cultures grown to

Figure 1 .
Figure 1.Representation of species level classification comparing archaeal distribution among samples collected from 3 different parts of the Doğanbey hot spring and amplified with two different 16S rDNA primers (universal and custom-built) using Kraken tool as indicated in the (Supplementary Information) including the Python codes and BLAST data.Samples were collected from upstream (A,B), midstream (C,D) and downstream (E,F) regions, and analyzed using universal (A,C,E) and archaea (B,D,F) primers.

Figure 2 .
Figure 2. Representation of species level classification of samples collected from Doğanbey hot spring using the Epi2Me tool.Samples were collected from upstream (A,B), midstream (C,D) and downstream (E,F) regions, and analyzed using universal (A,C,E) and archaea (B,D,F) primers.

Figure 3 .
Figure 3. Representation of genus level classification of samples collected from Doğanbey hot spring using the in-house BLAST.Samples were collected from upstream (a,b), midstream (c,d) and downstream (e,f) regions, and analyzed using universal (a,c,e) and archaea (b,d,f) primers.

Figure 4 .Figure 5 .
Figure 4. Representation of species level classification of samples collected from Doğanbey hot spring using the in-house BLAST.Samples were collected from upstream (a,b), midstream (c,d) and downstream (e,f) regions, and analyzed using universal (a,c,e) and archaea (b,d,f) primers.

Figure 6 .Figure 7 .
Figure 6.Percentage representation of microbial community diversity and quantity plots of samples at genus level bacteria and archaea in Bergama, Çeşme, and Doğanbey hot springs from enriched cultures.

Figure 8 .
Figure8.Krona plot analysis of three different points of decreasing temperatures of the hot spring: upstream, midstream and downstream, demonstrating significant loss of both archaeal presence and microbial variety using universal 16S rDNA bacterial primers.

Figure 10 .
Figure 10.Alignment results on ClustalX for the archaea species likely to be found in the İzmir hot springs.

Table 3 .
Summary of the 16S rDNA metagenomic investigation of the classification of 5 distinct hot springs in Izmir area.Isolated cells describe the uncultured microorganisms directly analyzed from the water samples, and cultured cells describe the microorganisms grown under CO and enriched media.