Introduction

What is a novel mitochondrial DNA (mtDNA) mutation? At first sight the answer to this question appears to be utmost trivial: an observed mtDNA mutation or polymorphism is novel if it has not been observed before; that is, it has not been reported in the literature before or cannot be found in other publicly available sources. This, however, is not the manner in which the novelty of mtDNA mutations is perceived and treated in practice by the working human geneticist. Novelty is operationally defined by exactly two mouse clicks: with the first one the geneticist accesses http://www.mitomap.org/cgi-bin/mitomap/search.pl (click); with the second one he/she enters the targeted position number in the slot of “Search for:” (click): “please scroll down”. And there, on the screen, is the answer. The results of applying this search method means that if you find the observed mtDNA mutation cited, you can copy-and-paste the citation into your paper; it is not novel. Conversely, with the answer “sorry, your query did not match any items in MITOMAP database for the selected category”, you designate the new mutation as “novel”. If, in addition, this mutation, which either affects a ribosomal or transfer RNA gene or is coding and nonsynonymous, is not observed in some small convenience sample declared as “controls”, then a case can be made for disease-association or pathogenicity. The seeming pathogenic role is then often ‘garnished’ with the inferred consequences of the tertiary structure of the gene-product affected by that mutation.

Although this practice is not really supported by the more sophisticated strategies of weighing evidence for or against pathogenic status (Mitchell et al. 2006), it is nevertheless the standardly accepted protocol in almost all medical investigations. It is probably not well known nor greatly appreciated that MITOMAP is a database of rather limited scope and, consequently, quite problematic for two reasons: (1) it actually exhibits and classifies only a non-representative subset of the published mtDNA mutations, and (2) the mutations listed and sorted into different slots are mined from the articles without critical editing; that is, obviously flawed results would also find their way into the database (for a pertinent case, see Brandstätter et al. 2005). The user who innocently takes MITOMAP as the gold standard in the field can thus be seduced into making inadvertent interpretations of his/her mtDNA findings.

Methods

In a typical disease study a sequencing of the entire mitochondrial genome would reveal 5–50 mutations in the coding region relative to the revised Cambridge reference sequence (rCRS). These are then compared with public databases, such as the mtDNA database MITOMAP (Brandon et al. 2005; http://www.mitomap.org/), In addition to this database, as edited 27 February 2006, there is the Uppsala mtDB database (Ingman and Gyllensten 2006; http://www.genpat.uu.se/mtDB/) maintained by Max Ingman, as edited 14 December 2005. This latter database hosts a large subset of complete mtDNA sequences or entire coding-region sequences that result from systematic sequencing efforts published mainly in the field of population genetics, with reference to the originally published sequences stored in GenBank.

For supplementary control-region information, one may use the SWGDAM database (http://www.fbi.gov/hq/lab/fsc/backissu/april2002/miller1.htm). This forensic mtDNA database lists more than 4800 partial control-region sequences (covering the two hypervariable segments); for some caveats, however, see Bandelt et al. (2004a, b). For some early references concerning control-region sequences, the Mitochondrial DNA Concordance database (Miller et al. 1996; http://www.bioanth.cam.ac.uk/mtDNA/index.html) is quite useful. Note that this database was last updated 30 April 1998, and no new sequences are currently being added to it. Sequence data recorded in the vast literature available on the mtDNA genome (mainly in forensic and human population genetic journals) have been used as well; this information includes approximately 40,000 profiles for the first hypervariable region (HVS-I) and approximately 15,000 profiles for the second hypervariable segment (HVS-II).

Views of the basal parts of the mtDNA phylogeny can be found in Herrnstadt et al. (2002), and updated haplogroup nomenclature and trees are given in publications by Kong et al. (2003, 2006) for East Asian mtDNA, by Palanichamy et al. (2004) for West Eurasian mtDNA, and by Kivisild et al. (2006) and Torroni et al. (2006) for African mtDNA. Estimates of mtDNA trees can also be downloaded from the mtDNA webpage of Ian Logan (http://www.ilbg18230.pwp.blueyonder.co.uk/mtdna.htm), who has currently (as of 11 May 2006) examined nearly 2500 complete mtDNA sequences – haplogroup by haplogroup – using the up-to-date haplogroup nomenclature. One should bear the caveat in mind that any tree estimate whatsoever may bear some typos or oversights and, even when correctly represented, necessarily hinges upon the current knowledge of the mtDNA pool and an understanding of the mutational process. As such, tree estimates may slightly change when more information becomes available.

Complete mtDNA sequences obtained in medical studies are scattered over the literature, and there are only a few sources available that put some of them into context (see Macaulay et al. 1999; Kivisild et al. 2002). In order to retrieve the primary sources, it is helpful to use general-purpose search engines provided in the Internet. We have taken Google and Google Scholar by way of example. Note that different country-specific Google versions may return slightly different search results.

Results

About refractory anemia with ring sideroblasts and the ‘novelty’ of T9137C

Let us take an example from a recent study (Babušiaková et al. 2004), which highlights a number of ‘novel’ mtDNA mutations, such as the transition at 9137 relative to rCRS. MITOMAP does not offer any information about this site. The user adhering strictly to MITOMAP would therefore infer that this mutation is new – that is, it has never been observed before. Now, entering the query “human mtDNA 9137” into a search engine, such as Google (Deutschland), returns more than 100 results. The first entry is the reference to a publication by Maca-Meyer et al. (2001), where the 9137 transition was indeed recorded in a haplogroup HV lineage. The other hits are “cold”, typically referring to page or telephone numbers. The same query in Google Scholar delivers yet another hit, one of Kong et al. (2004), where this mutation occurs in a table, with reference to the haplogroup F2a lineage from Kong et al. (2003). Both of these two complete mtDNA sequences from haplogroups HV and F2a, respectively, can also be retrieved from the mtDB database after entering the query for “9137 C”.

When we enter “human mtDNA T9137C” into Google, we obtain the single reference to Máximo et al. (2002), which lists T9137C as a “somatic” mutation. The same query in Google Scholar delivers a second hit, namely De Joanna et al. (2000), who wrote that they “... found a new, homoplasmic T9137C mutation in the patient, but not in 50 control subjects: the mutation converts a moderately conserved isoleucine by tryptophan but its pathogenic role is unclear”. Indeed, the role of T9137C is still unclear, mainly because this mutation is rediscovered again and again in different contexts without any systematic analysis and screening of its natural occurrences in the mtDNA phylogeny.

Internet search for G6261A

To give another example, consider the transition at 6261, which is also mentioned by Babušiaková et al. (2004). MITOMAP lists this mutation by referring only to the paper of Petros et al. (2005), which may evoke the impression that this mutation has something to do with prostate cancer. In reality, this mutation occurs in many parts of the mtDNA phylogeny and had already been observed prior to 2005 in the coding-region data of Herrnstadt et al. (2002, 2003) and in the complete mtDNA data of Tanaka et al. (2004) – altogether in at least six different haplogroups C, D1, D4e, H3, L3e3, and T2). More recently, Kivisild et al. (2006) also observed this mutation in the Indian haplogroup M4b (sample no. AS10) and in a haplogroup T2 sequence (sample no. EU79). These references are all provided by the mtDB database. Entering “mtDNA 6261” into Google Scholar yields the reference to Howell et al. (2003) as the first entry, and we are thus led to the complete mtDNA sequence of their VIC 14 LHON pedigree. This mtDNA lineage belongs to haplogroup H3 and is closely related to coding-region sequence no. 426 of Herrnstadt et al. (2002, 2003). The same query entered into Google gives the additional reference to Appendix S1 of Loogväli et al. (2004), where this haplogroup H3 lineage no. 426 is displayed in the haplogroup H mtDNA phylogeny. Moreover, with Google we find an early reference to Fliss et al. (2000), where this mutation was also listed.

As in the case of T9137C, we find another set of references when we initiate a Google search with “mtDNA G6261A” instead of “mtDNA 6261”. Such a search has a high success rate because no page or telephone numbers or range numbers are detected and listed; on the other hand, such a search is bound to miss a number of relevant references that do not adhere to the usual notation of mtDNA mutations in medical genetics. The five hits offered by this new query are all relevant without exception: the first points to Petros et al. (2005) and the fourth and fifth to the same context of prostate cancer with reference to the survey article by Wallace (2005).

The second item of the previous search result leads us to a haplogroup L3e tree, which is embedded in the mtDNA webpage of Ian Logan. His haplotype lists and trees can be most useful for the working human geneticist, and one may wonder why such necessary bookkeeping work is done by an amateur genealogist rather than by medical laboratories that regularly publish findings about ‘pathogenic’ mtDNA mutations.

The third item from the Google results for “mtDNA G6261A” concerns a conference contribution (Arbuzova et al. 2001), where the mutations G6261A together with C5187T, C7873T, C10822T, A10972G, and A14977G were found in a Down syndrome patient. This listing includes only those mutations that were determined not to have been described prior to the time of the conference contribution. Interestingly, there is a large overlap with the five private transitions at 5187, 6261, 7873, 10822, and 11914 in the haplogroup T2 lineage no. 430 of Herrnstadt et al. (2002, 2003). Since G11914A is a mutation that has been known about for a long time (e.g. Tanaka and Ozawa 1994), we may well assume that this mutation was also present in that Down syndrome patient. Recently, Kivisild et al. (2006) have reported a haplogroup T2 lineage with private transitions at 783, 6261, 9524, 10822, 11914, 12346, and 15479. The Internet user is led to this and a subsequent reference through the initial Google search “mtDNA 6261”. Fraumene et al. (2005) have analyzed 66 complete mtDNA sequences from a Sardinian village (Ogliastra). Three of these belong to a particular branch of haplogroup T2 bearing transitions at sites 146, 279, 5187, 6261, 7679, 7873, 10822, 15784, and 16292 [see Fraumene et al. (2006) for the subsequent full publication]. Interestingly, the transition at 279 within haplogroup T2 seems to be confined to this branch, as indicated by the presence of 146 and/or 16292, which could be inferred from searching the SWGDAM mtDNA database. The sequence motif with transitions at 16126, 16292, and 16294 in the HVS-I segment appears to be quite frequent in other parts of Sardinia as well (e.g. Fraumene et al. 2003). It would appear that this is a new basal branch of haplogroup T2 that is partially characterized by G6261A and C16292T. Should we then, following Petros et al. (2005), expect an elevated frequency of prostrate cancer in Sardinia? It would be worth finding out whether carriers of this particular T2 subhaplogroup indeed bear a much higher cancer risk than non-carriers.

Revisiting the ‘prostate cancer’ mutation A6663G

In contrast to the highly recurrent G6261A, another mutation, A6663G (highlighted as novel by Babušiaková et al. 2004), which is also suspected to be connected with prostate cancer (Petros et al. 2005; Wallace 2005), seems to be an extremely rare event. In MITOMAP, the mutation A6663G appears in three categories – under “Coding Region Sequence Polymorphisms” (with reference to Scaglia et al. 2003), “Reported Mitochondrial DNA Base Substitution Diseases: Coding and Control Region Point Mutations” (referring to Petros et al. 2005), and “Unpublished MtDNA Polymorphisms” (Polyak and Vogelstein 1999).

Performing the same Internet search strategy as above, we come across ten occurrences of this mutation (Parfait et al. 1997; Polyak et al. 1998; Torroni et al. 2001; Herrnstadt et al. 2002, 2003; Howell et al. 2004; Kivisild et al. 2006), all pointing to the same particular subhaplogroup of haplogroup L2a1 (sensu Kivisild et al. 2006). Note that the mutation screening of Parfait et al. (1997) was fairly incomplete at the time: at the very least the five COX transitions at 7175, 7274, 7771 and 8206, 9221, which are characteristic for haplogroups L2a and L2, should have been listed for the L2a lineages of their Patients 5 and 15. This sub-haplogroup is characterized by G3010A and A6663G in the coding region and possibly by further transitions at 16193, 16213, 16239, and 513 in the control region. The corresponding samples are from Burkina Faso and from North or Central America. This specific mutation motif in HVS-I has also been found elsewhere; for example, in two mtDNAs from Guinea-Bissau (Rosa et al. 2004). One could certainly expect some population substructure in regard to this minor subhaplogroup. However, Petros et al. (2005) and Wallace (2005) failed to appreciate these particular circumstances. Before interpreting the (rare) presence of A6663G in patients with prostate cancer, it would have been necessary to carry out large-scale screening of males bearing this particular haplogroup L2a mtDNA type.

Misscored mutations

After going through the mutations declared to be ‘novel’ by Babušiaková et al. (2004) – one by one – we eventually find that all but one have already been reported elsewhere (Maca-Meyer et al. 2001; Herrnstadt et al. 2002, 2003; Achilli et al. 2004; Coble et al. 2004). It therefore seems highly unlikely that this single, truly novel substitution, G15084T (in heteroplasmic state), detected by Babušiaková et al. (2004) can sustain their claim of an “accumulation of homoplasmic mtDNA point mutations” in patients of refractory anemia with ring sideroblasts. Mutations at 1838, 11250, and 11457 would also be novel mutations according to MITOMAP. However, the polymorphisms “G1838A”, “A11250G”, and “A11457G” reported by Babušiaková et al. (2004) bear incorrect rCRS nucleotides. Therefore, one can safely regard these mutations as well as “G16154A” as typos (see our Table 1). On the other hand, the documented mtDNA variation is obviously incomplete, with one to seven mutations missing (Table 1); as such, it is not clear whether some mutation of potential disease status may have been overlooked.

Table 1 Mutations missed or misreported in the study of Babušiaková et al. (2004)

Mutations that seem to be novel can also be generated by inadvertent scoring or through clerical errors. For instance, the deletion 248del was declared to be an “unknown” mutation by Carew et al. (2003). Deletion of one A at positions 248–249, however, is normally scored as 249del, following the forensic convention for nomenclature of homopolymeric stretches (Carracedo et al. 2000). Consequently, 249del is listed among the known control-region mutations in MITOMAP. In the early literature, this deletion was often scored as 248del, as is testified, for example, by a Google search using the entry “mtDNA 248d”. Note that the current Uppsala mtDB database also employs the latter (outdated) convention, albeit not without problems: four haplogroup C sequences from Mishmar et al. (2003) bearing 249del are recorded – less parsimoniously – with 247del and A248G, which constitutes an artifact probably produced by some automatic alignment program.

Wong et al. (2002) considered the double deletion '286–291 del AA' as novel (see their Table 3); however, this deletion is diagnostic of haplogroup C1 and is, of course, well known: it is generally referred to as '290delA 291delA' (Carracedo et al. 2000) or '290–291del' or '290–291d'. A similar case is the variation in the short tandem CA repeat in region 514–523 (or alternatively interpreted as AC repeats in 515–524). In MITOMAP, the deletion or insertion of the dinucleotide repeat can be found scored in at least ten different ways as 514C > CAC, 514delC, 515delA, 515delAC, 516delC, 520C > CACACAC, 520delC, 520C > CAC, 521delA, or 522delC, respectively. Despite this variety of possible scorings, an inadvertent base shift could then easily produce a seemingly novel mutation: indeed, Li et al. (2004) scored one deletion of the dinucleotide repeat in this region wrongly as 515delCA and claimed, by referring to MITOMAP, that this was a previously unobserved mutational event.

Phantom mutations

Unusual ensembles of reported mutations may be due to artifacts generated in the course of the electrophoresis (Bandelt et al. 2002; Brandstätter et al. 2005). Quite often, such phantom mutations are predominantly transversions (Bandelt et al. 2005b; Bandelt and Kivisild 2006). For example, two-thirds of the mutations provided by Abu-Amero et al. (2006) in their Tables 2 and 3 are transversions. It is then instructive to inspect these mutations more closely, which were detected in 26 medullary thyroid carcinomas but not found in “119 normal population controls and 284 individuals with various mitochondrial disorders” according to the authors; novelty was decided by referring to the databases MITOMAP and mtDB. However, three (C4960T, A8836G, and C15247G) of the ‘novel’ mutations were first published by Maca-Meyer et al. (2001), three (T11204C, C11665T, and T15672C) by Herrnstadt et al. (2002, 2003), two (T5093C and T15674C) by Achilli et al. (2004), and one (G5979A) by Tanaka et al. (2004). These mutations (with additional references) can all be retrieved from the mtDB database. The deletion of C at 16188 is not new either, as can be seen from a query to the SWGDAM database. Additional Internet searches do not provide further hits, except for the 11708 mutation mentioned by Guan et al. (1998), which, however, constitutes an obvious typo. Thus, of the many mutations listed in Tables 2 and 3 of Abu-Amero et al. (2006), only 26 have apparently not been observed before. The excess of transversions (23 transversions versus only three transitions) is now most dramatic. The fact that 20 of those transversions could only be captured by one amplified sequence fragment each (of a length exceeding 700 bp) makes it plausible that some phantom mutations could have slipped into the sequencing results because good readability of both strands might not have been guaranteed over the whole range.

It is most problematic when the potential pathogenic status of an observed mutation so strongly depends on the apparent novelty of the mutation, especially when the database search was incomplete. For example, the three mutations C4960T, C8472T, and A8836G detected by Abu-Amero et al. (2006) belong to the ancestral mutation motif of haplogroup N1b according to Palanichamy et al. (2004). One should therefore be skeptical about the claim that A8836G affects protein function with “high confidence”. Only a systematic analysis (including cybrid analysis) of a large number of mtDNA lineages from haplogroup N1b could substantiate such a claim.

New polymorphisms in retrospect

Jerónimo et al. (2001) reported the spectacular case of a patient seemingly bearing 18 somatic mutations in a tumor, which, with retrospective analysis, can be explained as a clear case of contamination or sample mix-up (Salas et al. 2005). In passing, Jerónimo et al. (2001) noted 23 “new polymorphisms” in 16 patients with prostate cancer, which were deemed to be novel germline mutations at the time. Among the 23 coding-region mutations listed, as many as 16 are now known to characterize certain haplogroups and another three occur as private mutations in the normal population (as can be inferred from the mtDB database), so that only four mutations (A2335G, A12490G, A12777G, and C13623T) have hitherto not been observed elsewhere (according to our search procedure). It is not clear whether the control region has been properly aligned to rCRS since the reported rCRS nucleotides at two positions (16094 and 16409) do not match the real rCRS nucleotides. Therefore, neither the spectrum of the “new” germline mutations nor the single somatic mutations in two tumors would indicate anything unusual that could support any particular role of mtDNA in prostate cancer.

The discovery of novel (homoplasmic) mtDNA variants in several mitochondrial dysfunctions is the leitmotif of the article by Crimi et al. (2002). However, more than 60% of the mutations reported by this group had already been reported elsewhere before their paper was submitted (in May 2002); see Table 2. Most of these variants appeared in the 560 coding-region sequences of Herrnstadt et al. (2002), published in January 2002. One can then not quite agree with Crimi et al.’s claim that their “work can therefore help complete the already ample mtDNA polymorphism existent database”, especially as the complete genomes – with all variants included – were not reported. This is particularly unfortunate in the case of the mtDNA of Patient 3, since this lineage belongs to the rare U2d haplogroup, for which only one (nearly) complete mtDNA sequence is available in the literature to date; see Palanichamy et al. (2004).

Table 2 Efficiency of database search in the medical literature

The article by Crimi et al. (2002) is in no way exceptional in this regard. We have investigated the published record for ‘novel’ mutations from a further eight articles; see Table 2. For instance, Wong et al. (2002) performed a “comprehensive scanning of the entire mitochondrial genome for mutations” by analyzing 179 unrelated patients but did not report the observed haplotypes. Most (approx. 71%) of the “new” variants listed in that article are well-known polymorphisms in normal individuals (and approx. 43% were known at the time of publication), many of which are diagnostic of particular branches of the mtDNA phylogeny.

The tale goes on

Tagging the term novelty to a mutation found (by chance) in some patient for whatever disease may be under study at that moment seems to enhance the chances for publication and is therefore applied to 'spice up' a scientific contribution. A search of PubMed (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi) for “novel” OR “new” AND “mtDNA” AND “mutation” resulted in a list of 30 publications with these key words in the title and altogether 315 entries where these words appear in the title or abstract. In MITOMAP, a focused search for an (mtDNA) article bearing “novel” in its title even delivers 124 entries, most of which point to ‘novel’ mtDNA mutations. For example, Qu et al. (2006) included ‘novelty’ of the mutation A4435G in the title of their paper – even though this mutation has been known about for a couple of years (Herrnstadt et al. 2002, 2003; Tanaka et al. 2004): querying the mtDB database reveals five occurrences in mtDNA sequences belonging to different haplogroups (D1, D5a2, J2a, and M7a2). Similarly, the claim of Zhadanov et al. (2005) of having found a novel ND6 mutation not previously reported (G14279A) is unjustified, since this mutation had already been found by Herrnstadt et al. (2002, 2003) in an mtDNA sample (no. 83). MITOMAP does not recognize this earlier appearance of G14279A and only cites Zhadanov et al. (2005).

As a matter of coincidence, both Zsurka et al. (2004) and Cardaioli et al. (2005) found the same ‘novel’ heteroplasmic tRNALeu(CUN) mutation, G12276A, albeit in different disease contexts. This may well constitute a pathogenic mutation, which, however, has not yet found its way into MITOMAP (as edited 25 January 2006), so it will probably be rediscovered again and again as a ‘novel’ mutation in future studies. The advertisement “MITOMAP also maintains a compendium of all known pathogenic mtDNA mutations” (Brandon et al. 2005) should certainly be taken with a pinch of salt.

The abstract of the paper by Ballana et al. (2006) announces the ‘novelty’ of two mutations, T1243C and T1291C, in patients with hearing loss. The latter mutation has been found in a family of Cuban origin, but no haplogroup information is available from that study. Nonetheless we can conjecture that this mtDNA lineage belongs to a specific branch of the African haplogroup L1c1. Within this haplogroup, T1291C together with a further dozen mutations seem to define a sub-haplogroup, as can be inferred from two specific L1c1 lineages – one from the USA (recorded by Herrnstadt et al. 2002, 2003) and one directly sampled in Africa (Kivisild et al. 2006). An independent occurrence of T1291C is reported within the South Asian haplogroup R6 (Palanichamy et al. 2004). Thus, T1291C is rather unlikely to be a disease-causing mutation despite the claim of Ballana et al. (2006). Given the fact that this mutation is essentially confined to a minor L1c1 sub-haplogroup, it cannot come as a surprise that it would be absent in most control groups, especially if the control group is from Spain [see p 951 of Ballana et al. (2006)], where the mtDNA input from sub-Saharan Africa is very low.

The first mention of the mutation T1243C in a disease context (aminoglycoside-induced deafness) was probably that of Bacino et al. (1995). This reference can be obtained via http://www.inchem.org/documents/jecfa/jecmono/v51je02.htm, to which one is led to by a Google (or Google Scholar) search with query “human mtDNA T1243C”. Since this first mention, the ‘novelty’ of this mutation has been a recurring, never-ending theme (Schwartz et al. 1999; Jones et al. 2001; Taylor et al. 2001; Vives-Bauza et al. 2002; Zhu et al. 2005; Ballana et al. 2006). The variant T1243C is, in fact, ubiquitous in West Eurasia: namely, the mtDB database harbors as many as 56 entries, 53 of which are from West and South Eurasia, all belonging to haplogroup W; the three sporadic occurrences are from Africa (in haplogroups L0k1 and L0d1). It remains to be asked why a mutational motif of a well-known haplogroup is rediscovered over and over again by medical geneticists, who then declare one of its mutations, T1243C, as novel. The answer lies in the two mouse clicks leading to MITOMAP: this mutation is recorded there only in connection with the pancreatic cancer cell lines of Jones et al. (2001) and has never been recognized as a (benign) polymorphism by MITOMAP. The same fate is shared by the mutation C11674T that characterizes the super-haplogroup N2 in which haplogroup W is nested (Palanichamy et al. 2004); the latter is also listed as an “unpublished mtDNA polymorphism” connected with mental retardation (Mihailova 2006).

Conclusion

Internet searches involving only one to a few dozen clicks of the mouse will provide the searcher with a fairly complete picture of the published information on every single mtDNA mutation of interest, although for a few mutations the early (pre-Internet) literature would have to be consulted, especially in the case of control-region variants. All mtDNA lineages bearing the targeted mutation should then be located in the phylogenetic trees offered by any one (or more) website(s) or pertinent publication(s). This is the necessary homework that has to be done by the human geneticist before he/she can attempt to interpret the mtDNA mutation under investigation. Moreover, it is of crucial importance whether a mutation is highly recurrent and typically found as a private mutation or whether it constitutes an extremely rare event but defines only one or two minor basal haplogroups, as this would demand different strategies with respect to finding the appropriate control samples. There is no way to by-pass or short-cut these steps. Many of the premature claims in a number of publications concerning the pathogenicity or disease-association of certain mutations could have been avoided if, from the very outset, efficient search strategies and some degree of phylogenetic analysis had been performed (Bandelt et al. 2005a, c; Salas et al. 2005). This also entails that comprehensive sequencing results should be published – and not just a list of those mutations deemed to be ‘novel’ (Vega et al. 2004).

We provide here a first-aid algorithm for tracing mutations in the published record and subsequently hunting for their phylogenetic relevance. First, consult the cumulative databases MITOMAP and mtDB and look up the highlighted publications (if any). Then, enter the targeted mutation into several search engines by testing alternative designations, such as "T9137C", "9137T > C"’, "9137C", or "9137", in connection with "human mtDNA" or "mitochondrial disease". In the next step the occurrences of the mutation in the mtDNA phylogeny have to be sorted out: a quick consult of Logan’s website may assist the searcher in focusing subsequent searches. When the respective haplogroups, in which the targeted mutation thrives, and their (sub) continental mtDNA origin have been determined, one can then enter the key words "complete mtDNA", "human mtDNA", "mtDNA phylogeny" (or "mtDNA tree"), and, for the geographic focus, "West Eurasian" (or "African", "East Asian", etc.). This would also help in obtaining an up-to-date view on the corresponding parts of the mtDNA phylogeny and the refined nomenclature.

The fact that a freshly detected mutation appears to be ‘novel’ relative to the published record has absolutely no significance within the framework of its potential pathogenicity or disease-association – unless further compelling evidence is provided. For a homoplasmic mutation, the additional criteria “nonsynonymous” and “evolutionarily conserved” are too soft to be of real relevance. A hitherto unobserved mutation could simply be part of the mutational motif of an infrequent haplogroup (Yao et al. 2006). On the other hand, a frequent mutation could very well be mildly deleterious, acting as a secondary disease-mutation in a complex multi-causal relationship together with other mtDNA mutations on a particular nuclear background. One should therefore recall the statement that “it is difficult enough to identify frankly pathogenic mtDNA mutations, and it is obvious that proving haplogroup-associated polymorphisms act as risk factors will be commensurately more challenging” (Herrnstadt and Howell 2004).