Supplementary Figure 2: k-mer identity vs. percent nucleotide identity. | Nature Biotechnology

Supplementary Figure 2: k-mer identity vs. percent nucleotide identity.

From: Ultrafast search of all deposited bacterial and viral genomic data

Supplementary Figure 2

We ran 1,000 simulations where on each iteration we introduced 1 more random SNP into a sequence of length 1,000 bp and calculated the k-mer similarity between the original sequence and the sequence with introduced SNPs. Here, we plot the mean k-mer similarity observed for each percent identity. The grey area shows 3X the standard deviation around this mean.

Back to article page