HUGO Gene Nomenclature Committee (HGNC) recommendations for the designation of gene fusions

Bruford, Elspeth A.; Antonescu, Cristina R.; Carroll, Andrew J.; Chinnaiyan, Arul; Cree, Ian A.; Cross, Nicholas C. P.; Dalgleish, Raymond; Gale, Robert Peter; Harrison, Christine J.; Hastings, Rosalind J.; Huret, Jean-Loup; Johansson, Bertil; Le Beau, Michelle; Mecucci, Cristina; Mertens, Fredrik; Verhaak, Roel; Mitelman, Felix

doi:10.1038/s41375-021-01436-6

Download PDF

Consensus Statement
Open access
Published: 06 October 2021

HUGO Gene Nomenclature Committee (HGNC) recommendations for the designation of gene fusions

Leukemia volume 35, pages 3040–3043 (2021)Cite this article

28k Accesses
43 Citations
133 Altmetric
Metrics details

Subjects

Abstract

Gene fusions have been discussed in the scientific literature since they were first detected in cancer cells in the early 1980s. There is currently no standardized way to denote the genes involved in fusions, but in the majority of publications the gene symbols in question are listed either separated by a hyphen (-) or by a forward slash (/). Both types of designation suffer from important shortcomings. HGNC has worked with the scientific community to determine a new, instantly recognizable and unique separator—a double colon (::)—to be used in the description of fusion genes, and advocates its usage in all databases and articles describing gene fusions.

Brief historical background of gene fusions

Technical developments at the end of the 1970s enabled the identification of genes in the breakpoints of chromosome rearrangements, which in the early 1980s led to the discovery and characterization of gene fusions in neoplasia. While the products of translocation events are referred to by several terms, including fusion genes, hybrid genes and chimeric genes, here we largely choose to use the term fusion genes as this is most widely used in this context. Analyses of the recurrent balanced translocations in Burkitt lymphoma (BL) and chronic myeloid leukemia (CML) proved particularly pivotal. The picture to emerge was that reciprocal translocations exert their effects by one of two alternative mechanisms: deregulation, usually resulting in the overexpression of a seemingly normal gene in one of the breakpoints, or the creation of a hybrid, chimeric gene through fusion of parts of two genes, one in each breakpoint [1] (see Fig. 1).

**Fig. 1: The chromosomal basis of gene fusions.**

BL provided the first conclusive evidence for the deregulation mechanism. This tumor type was found to harbor one of three translocations: t(8;14)(q24;q32), t(2;8)(p11;q24) or t(8;22)(q24;q11). In all three, the breakpoints in chromosome 8 were found to be within or adjacent to the MYC oncogene (8q24), and the other breakpoint in an immunoglobulin gene, encoding the heavy chain (IGH in 14q32) or the kappa (IGK in 2p11) or lambda (IGL in 22q11) light chains [2,3,4]. As a consequence of these translocations, the MYC gene becomes transcriptionally deregulated, often overexpressed, owing to the influence of regulatory elements of the immunoglobulin genes.

The alternative mechanism, the creation of a hybrid gene, was documented at the same time in CML with the demonstration that the Philadelphia chromosome, i.e., the derivative chromosome 22 resulting from the recurrent reciprocal translocation t(9;22)(q34;q11), juxtaposed the 5′ part of the BCR gene at 22q11 with the 3′ part of the ABL1 tyrosine kinase-encoding gene from 9q34. This leads to an in-frame fusion of parts of the two genes and results in an abnormal protein, which displays increased tyrosine kinase activity [5,6,7,8] (see Fig. 1).

These and similar molecular insights into how cancer-specific chromosomal abnormalities act pathogenetically sparked an enormous interest in cytogenetics as a powerful means to pinpoint the locations of genes important in tumorigenesis, and an impressive amount of information has been accumulated through these efforts. Almost 1000 gene fusions have been found by genomic characterization of breakpoints in cytogenetically identified aberrations, balanced as well as unbalanced, in various leukemias, lymphomas, and solid tumors. The accumulated data have shown that the consequences of practically all gene fusions are in principle the same as those originally elucidated in BL and CML, i.e., deregulation of a seemingly normal gene or the creation of a hybrid gene. However, not all gene fusions have translational consequences, and some could result in gene inactivation [1, 9, 10].

The advent of massively parallel sequencing (MPS) has recently provided a radically new means to identify fusions at the DNA or RNA levels without any prior information on the cytogenetic features of the neoplastic cells. The results of such unbiased gene fusion detection efforts during the last decade have dramatically changed the gene fusion landscape. More than 30,000 gene fusions, the great majority involving previously unsuspected genes, have now been identified through deep sequencing in a wide variety of neoplasms and these are reported in a number of online resources, e.g., https://mitelmandatabase.isb-cgc.org/, http://atlasgeneticsoncology.org/, https://cancer.sanger.ac.uk/cosmic, https://ccsm.uth.edu/FusionGDB/, http://www.kobic.re.kr/chimerdb/, https://tumorfusions.org/. A major challenge will be to verify by functional studies which of the alleged gene fusions are pathogenetically important in carcinogenesis, and which are either secondary progressional changes or non-consequential “noise” abnormalities, e.g., by-products of the genetic instability that characterizes many cancer cells. It is important to note that while gene fusions are a hallmark of neoplasia, they can also occur in heritable disorders such as the formation of the Lepore and anti-Lepore haemoglobins from the HBD and HBB genes [11].

There has never been a generally recommended, standardized way to denote gene fusions. Instead, multiple notations have been used with varying popularity over time, though the most common designation is SYMBOL-SYMBOL followed by SYMBOL/SYMBOL, both of which we critique below.

Problems with the nomenclature used to describe gene fusions

The SYMBOL-SYMBOL notation, e.g., BCR-ABL1, to denote fusion genes has three important shortcomings:

(1) The HGNC has approved the use of the hyphen separator, in collaboration with all contributing genome annotation groups involved in the Consensus Coding Sequence (CCDS) Project [12], for denoting readthrough transcripts, e.g., INS-IGF2.

(2) A hyphen is often used in the literature to denote members of a complex, e.g., MRE11-NBN, MRE11-RAD50-NBN.

(3) There are also specific groups of approved gene symbols containing hyphens as separators within the symbol, e.g., TRX-CAT1-2.

Hence, it is difficult to search specifically for gene fusions in databases and in the literature using the hyphen symbol.

The SYMBOL/SYMBOL notation, e.g., BCR/ABL1, has at least four major disadvantages:

(1) The forward slash is an accepted symbol in the established cytogenetic International System for Human Cytogenomic Nomenclature (ISCN) to denote different clones, both constitutionally (mosaicism) and in cancer cells; the Human Genome Variation Society (HGVS) guidelines (https://varnomen.hgvs.org/recommendations/general/) also use a forward slash to indicate mosaicism [13].

(2) The forward slash is often used in the literature in place of “either/or”, e.g., BRCA1/2, and to denote involvement of alternative genes in a fusion, e.g., “SS18-SSX1/SSX2”.

(3) Pathway and complex descriptions use this character, e.g., RAS/RAF/MAPK.

(4) Commercial dual fusion fluorescence in situ hybridization translocation probes (CE marked) also use a forward slash to indicate the two probe sets used, e.g., BCR/ABL1.

In view of the considerations listed above, and hence the clear need for a standardized and unique way to denote gene fusions, the HGNC concluded that an alternative needed to be sought to replace the use of either a hyphen (-) or a forward slash (/).

Recommended new nomenclature to describe gene fusions

After careful deliberation, and consultations with experts in the field, HGNC recommends that a new separator—a double colon (::)—be used in describing gene fusions, e.g., BCR::ABL1. The double colon (::) has several important advantages:

First, it follows the long-standing recommendation of the internationally accepted ISCN cytogenetic nomenclature in which a single colon (:) is used to indicate a chromosome break and a double colon (::) to denote break and reunion [14]. The:: separator thus nicely reflects the principal mode of origin of most fusion genes. We are aware that fusion transcripts may occasionally originate at the RNA level through cis- or trans-splicing without a genomic breakage and reunion correlate, but we deem it unnecessary to create a different nomenclature system for such events, especially since the HGVS already recommends using the double colon to describe RNA fusion transcripts (https://varnomen.hgvs.org/recommendations/general/).

Secondly, it is instantly recognizable and creates a unique symbol in the existing gene nomenclature, and hence is easily searchable in databases and in the literature.

Thirdly, different gene fusions found in different single cells or in separate clones within the same tumor will be easily recognizable, i.e., SYMBOL::SYMBOL/SYMBOL::SYMBOL.

Specific recommendations when describing gene fusions

In line with established HGNC recommendations [15], genes involved in fusions should be designated by their HGNC approved gene symbols written in italics, whereas proteins are not italicized, e.g., BCR::ABL1 denotes a fusion of the BCR and ABL1 genes, while BCR::ABL1 designates the corresponding protein product. By convention, fusion transcripts identified at the RNA level, e.g., by RNA-seq, are designated as genes, i.e., in italics. The double colon (::) should be used for all types of gene fusions, i.e., both those giving rise to a hybrid, chimeric gene (BCR::ABL1) and those where regulatory elements from one gene deregulate a partner gene (IGH::MYC).

In accordance with established practice in designating gene fusions, the 5′ partner gene should always be listed first in the description of a fusion gene, i.e., before the double colon, irrespective of chromosomal location or the orientation of the gene. Thus, in the BCR::ABL1 fusion gene—the outcome of the translocation t(9;22)(q34.1;q11.2)—the BCR gene in chromosome 22 is the 5′ gene, the ABL1 gene from chromosome 9 is the 3′ gene.

In tables in scientific articles and in databases presenting gene fusions, the two genes are often designated either as “Gene A” and “Gene B” or “Gene 1” and “Gene 2”. Thus, it is not explicitly stated that “Gene A” or “Gene 1” represents the 5′ gene although this is usually the case. To avoid ambiguous interpretations, HGNC recommends that 5′ and 3′ genes be clearly indicated in tables showing gene fusions. In fusions giving rise to a deregulated gene the regulatory or enhancer element should be listed first, whenever known.

If one of the genes in a fusion is unknown this may be indicated by either a question mark (?) or by the chromosomal band where the breakpoint is located, following ISCN convention. If one of the breakpoints lies in an intergenic region and the genomic coordinate of that breakpoint is known, this can be denoted in an abbreviated format for publication as chr#:g.coordinate number. For example, ABL1::? denotes a fusion between ABL1 and an unknown gene, 6q25::ABL1 a fusion between an unknown gene located in chromosome band 6q25 and ABL1, and ABL1::chr11.g:1850000 a fusion between ABL1 and a breakpoint at nucleotide 1,850,000 on chromosome 11. In the first and third examples the unknown gene and intergenic region are the 3′ partners, and in the second example the unknown gene is the 5′ partner. Full ISCN [14] (https://iscn.karger.com/) or HGVS (https://varnomen.hgvs.org/recommendations/DNA/variant/complex/) nomenclature should be used for formal reporting.

Note that HGNC always advocate listing a stable gene ID, ideally an HGNC ID, when referencing genes in publications. We do not recommend that the IDs be included in the fusion notation, but rather in the accompanying text, e.g., a gene fusion involving BCR(HGNC:1014) and ABL1(HGNC:76) is denoted as BCR::ABL1.

Conclusions

There has long been a need for a unique, standardized and easily recognizable way to symbolize gene fusion events consistently, both in the literature and in databases. Following consultation with experts in the field of gene fusions, HGNC recommends the use of the separator “::”, a double colon, between approved gene symbols, to designate the genes involved in gene fusion events, e.g., BCR::ABL1. This recommendation is further endorsed by all the authors of this manuscript, by the HGVS and the ISCN, and by the following resources: the WHO Classification of Tumors, COSMIC, OMIM, Atlas of Genetics and Cytogenetics in Oncology and Haematology, Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer, and the Tumor Fusion Gene Data Portal. We urge all readers to use and publicize this form of notation for describing gene fusions in all future communications to avoid confusion. We recognize that this is a newly established recommendation that could be further developed in due course, and welcome feedback from the community (via the HGNC website, www.genenames.org).

References

Mertens F, Johansson B, Fioretos T, Mitelman F. The emerging complexity of gene fusions in cancer. Nat Rev Cancer. 2015;15:371–81.
Article CAS Google Scholar
Dalla-Favera R, Bregni M, Erikson J, Patterson D, Gallo RC, Croce CM. Human c-myc onc gene is located on the region of chromosome 8 that is translocated in Burkitt lymphoma cells. Proc Natl Acad Sci USA. 1982;79:7824–7.
Article CAS Google Scholar
Taub R, Kirsch I, Morton C, Lenoir G, Swan D, Tronick S, et al. Translocation of the c-myc gene into the immunoglobulin heavy chain locus in human Burkitt lymphoma and murine plasmacytoma cells. Proc Natl Acad Sci USA. 1982;79:7837–41.
Article CAS Google Scholar
Erikson J, Nishikura K, ar-Rushdi A, Finan J, Emanuel B, Lenoir G, et al. Translocation of an immunoglobulin κ locus to a region 3’ of an unrearranged c-myc oncogene enhances c-myc transcription. Proc Natl Acad Sci USA. 1983;80:7581–5.
Article CAS Google Scholar
Heisterkamp N, Stam K, Groffen J, de Klein A, Grosveld G. Structural organization of the bcr gene and its role in the Ph’ translocation. Nature. 1985;315:758–61.
Article CAS Google Scholar
Shtivelman E, Lifshitz B, Gale RP, Canaani E. Fused transcript of abl and bcr genes in chronic myelogenous leukaemia. Nature. 1985;315:550–4.
Article CAS Google Scholar
Ben-Neriah Y, Daley GQ, Mes-Masson AM, Witte ON, Baltimore D. The chronic myelogenous leukemia-specific P210 protein is the product of the bcr/abl hybrid gene. Science. 1986;233:212–4.
Article CAS Google Scholar
Fainstein E, Marcelle C, Rosner A, Canaani E, Gale RP, Dreazen O, et al. A new fused transcript in Philadelphia chromosome positive acute lymphocytic leukaemia. Nature. 1987;330:386–8.
Article CAS Google Scholar
Kumar-Sinha C, Kalyana-Sundaram S, Chinnaiyan AM. Landscape of gene fusions in epithelial cancers: seq and ye shall find. Genome Med. 2015;7:129.
Article Google Scholar
Rowley JD, Le Beau MM, Rabbitts TH, editors. Chromosomal Translocations and Genome Rearrangements in Cancer. New York: Springer; 2015.
Efremov GD. Hemoglobins Lepore and anti-Lepore. Hemoglobin. 1978;2:197–233.
Article CAS Google Scholar
Pujar S, O’Leary NA, Farrell CM, Loveland JE, Mudge JM, Wallin C, et al. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation. Nucleic Acids Res. 2018;46:D221–228. D1
Article CAS Google Scholar
den Dunnen JT, Dalgleish R, Maglott DR, Hart RK, Greenblatt MS, McGowan-Jordan J, et al. HGVS recommendations for the description of sequence variants: 2016 Update. Hum Mutat. 2016;37:564–9.
Article Google Scholar
McGowan-Jordan J, Hastings R, Moore S, editors. An International System for Human Cytogenomic Nomenclature. Cytogenet Genome Res. 2020;160:341–503.
Bruford EA, Braschi B, Denny P, Jones TEM, Seal RL, Tweedie S. Guidelines for human gene nomenclature. Nat Genet. 2020;52:754–8.
Article CAS Google Scholar

Download references

Acknowledgements

EAB is currently funded by the National Human Genome Research Institute (NHGRI) grant U24HG003345 and Wellcome Trust grant 208349/Z/17/Z. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health, nor the views of the authors’ employers and associated institutions. Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization. We thank our colleagues from the College of American Pathologists, and Dr Zbyslaw Sondka from COSMIC, for useful discussions.

Author information

Authors and Affiliations

HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Elspeth A. Bruford
Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
Elspeth A. Bruford
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Cristina R. Antonescu
Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
Andrew J. Carroll
University of Michigan Medical School, Ann Arbor, MI, USA
Arul Chinnaiyan
Howard Hughes Medical Institute, Chevy Chase, MD, USA
Arul Chinnaiyan
International Agency for Research on Cancer, World Health Organization, Lyon, France
Ian A. Cree
Faculty of Medicine, University of Southampton, Southampton, UK
Nicholas C. P. Cross
Wessex Regional Genetics Laboratory, Salisbury NHS Foundation Trust, Salisbury, Wiltshire, UK
Nicholas C. P. Cross
Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
Raymond Dalgleish
Centre for Haematology Research, Department of Immunology and Inflammation, Imperial College London, London, UK
Robert Peter Gale
Translational and Clinical Research Institute, Newcastle University Centre for Cancer, Newcastle upon Tyne, UK
Christine J. Harrison
The Women’s Centre, John Radcliffe Hospital, Oxford University Hospitals Foundation Trust, Oxford, UK
Rosalind J. Hastings
10 rue des Treilles, Masseuil, Quinçay, France
Jean-Loup Huret
Division of Clinical Genetics, Department of Laboratory Medicine, Lund University, Lund, Sweden
Bertil Johansson, Fredrik Mertens & Felix Mitelman
Comprehensive Cancer Center, University of Chicago, Chicago, IL, USA
Michelle Le Beau
Department of Medicine and Surgery, University of Perugia, Perugia, Italy
Cristina Mecucci
Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
Roel Verhaak
Department of Neurosurgery, Amsterdam University Medical Center, Amsterdam, Netherlands
Roel Verhaak

Authors

Elspeth A. Bruford
View author publications
You can also search for this author in PubMed Google Scholar
Cristina R. Antonescu
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Carroll
View author publications
You can also search for this author in PubMed Google Scholar
Arul Chinnaiyan
View author publications
You can also search for this author in PubMed Google Scholar
Ian A. Cree
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas C. P. Cross
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Dalgleish
View author publications
You can also search for this author in PubMed Google Scholar
Robert Peter Gale
View author publications
You can also search for this author in PubMed Google Scholar
Christine J. Harrison
View author publications
You can also search for this author in PubMed Google Scholar
Rosalind J. Hastings
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Loup Huret
View author publications
You can also search for this author in PubMed Google Scholar
Bertil Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Le Beau
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Mecucci
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik Mertens
View author publications
You can also search for this author in PubMed Google Scholar
Roel Verhaak
View author publications
You can also search for this author in PubMed Google Scholar
Felix Mitelman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RPG, NCPC and EAB instigated the initial discussions. EAB coordinated the work, FM and EB drafted the paper, and JLH created the figure. All authors commented on and edited previous drafts, and approved the final paper.

Corresponding author

Correspondence to Elspeth A. Bruford.

Ethics declarations

Competing interests

MLB is a member of the Board of Directors of the American Cancer Society and a member of the ACS Cancer Action Network; RV is a co-founder of, and has received research funding from, Boundless Bio.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bruford, E.A., Antonescu, C.R., Carroll, A.J. et al. HUGO Gene Nomenclature Committee (HGNC) recommendations for the designation of gene fusions. Leukemia 35, 3040–3043 (2021). https://doi.org/10.1038/s41375-021-01436-6

Download citation

Received: 08 September 2021
Revised: 10 September 2021
Accepted: 21 September 2021
Published: 06 October 2021
Issue Date: November 2021
DOI: https://doi.org/10.1038/s41375-021-01436-6

This article is cited by

Präzisionsmedizin in der Onkologie
- Elisabeth Mack
Die Innere Medizin (2024)
Tri©DB: an integrated platform of knowledgebase and reporting system for cancer precision medicine
- Wei Jiang
- Peng-Ying Wang
- Yun-Juan Bao
Journal of Translational Medicine (2023)
Identification of apoptosis-related key genes and the associated regulation mechanism in thoracic aortic aneurysm
- Qi Ma
- Long Hu
- Huimin Zeng
BMC Cardiovascular Disorders (2023)
The 5th edition of the WHO classification of haematolymphoid tumors: comments from the Groupe Francophone de Cytogénétique Hématologique (GFCH)
- Florence Nguyen-Khac
- Audrey Bidet
- Christine Lefebvre
Leukemia (2023)
Survival with chronic myeloid leukaemia after failing milestones
- Michael Lauseker
- Rüdiger Hehlmann
- Susanne Saußele
Leukemia (2023)

HUGO Gene Nomenclature Committee (HGNC) recommendations for the designation of gene fusions

Subjects

Abstract

Brief historical background of gene fusions

Problems with the nomenclature used to describe gene fusions

Recommended new nomenclature to describe gene fusions

Specific recommendations when describing gene fusions

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

This article is cited by

Präzisionsmedizin in der Onkologie

Tri©DB: an integrated platform of knowledgebase and reporting system for cancer precision medicine

Identification of apoptosis-related key genes and the associated regulation mechanism in thoracic aortic aneurysm

The 5th edition of the WHO classification of haematolymphoid tumors: comments from the Groupe Francophone de Cytogénétique Hématologique (GFCH)

Survival with chronic myeloid leukaemia after failing milestones

HGNC nomenclature for fusion genes

Search

Quick links

Subjects

Abstract

Brief historical background of gene fusions

Problems with the nomenclature used to describe gene fusions

Recommended new nomenclature to describe gene fusions

Specific recommendations when describing gene fusions

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links