Abstract
The DNA sequence of human chromosome 21 (HSA21)1 has opened the route for a systematic molecular characterization of all of its genes. Trisomy 21 is associated with Down's syndrome, the most common genetic cause of mental retardation in humans. The phenotype includes various organ dysmorphies, stereotypic craniofacial anomalies and brain malformations2. Molecular analysis of congenital aneuploidies poses a particular challenge because the aneuploid region contains many protein-coding genes whose function is unknown. One essential step towards understanding their function is to analyse mRNA expression patterns at key stages of organism development. Seminal works in flies, frogs and mice showed that genes whose expression is restricted spatially and/or temporally are often linked with specific ontogenic processes. Here we describe expression profiles of mouse orthologues to HSA21 genes by a combination of large-scale mRNA in situ hybridization at critical stages of embryonic and brain development and in silico (computed) mining of expressed sequence tags. This chromosome-scale expression annotation associates many of the genes tested with a potential biological role and suggests candidates for the pathogenesis of Down's syndrome.
Main
The current HSA21 gene catalogue1 (http://chr21.molgen.mpg.de) contains 238 entries for which we have identified 168 cognate mouse orthologues (see Methods in the Supplementary Information). We isolated 187 mouse cDNA clones matching 158 unique genes (referred to as mmu21 genes; see Supplementary Table 1) whose orthology was confirmed on the basis of the known synteny between HSA21 and segments of mouse chromosomes MMU16, MMU10 and MMU17 (http://www.informatics.jax.org/) (see Supplementary Information; this database is also available at http://chr21.molgen.mpg.de/hsa21/).
To identify potential candidates with a role in patterning and organ development, we have explored the expression of the 158 mmu21 genes by systematic whole-mount in situ hybridization3 (WISH) at mid-gestation (embryonic day 9.5; E9.5), a stage covering a wide range of embryonic processes. We also analysed a subset of clones at two other stages (Supplementary Table 1). We found 111 of 158 genes expressed at E9.5; 78 genes showed widespread expression and 33 genes a restricted pattern (49% and 21%, respectively, of the genes examined at E9.5). In addition, 12 widespread genes also defined particular embryonic structures (for example, Prkcbp2; Supplementary Table 1). Among the mmu21 genes conserved in Saccharomyces cerevisiae (Y), Caenorhabditis elegans (C) and Drosophila melanogaster (D) (CDY in Supplementary Table 1), 60% were widely expressed and 17% showed a restricted pattern. In contrast, 43% of the genes conserved only in multicellular organisms (CD) were widespread and 30% were patterned. The functional classification of the 45 pattern genes shows a bias for cell–cell communication and signal transduction molecules. Only six of the mmu21 genes were previously analysed by whole mounts4,5,6,7,8,9, including Ets2, a gene that causes skeletal defects in transgenic mice10. Our analysis at E9.5 identified a number of interesting patterns (Fig. 1). Kiaa0184 is a new gene with a striking expression profile in the central and peripheral nervous systems, marking subsets of cells in the midbrain and hindbrain, basal plate margin cells of the spinal cord, and cranial as well as dorsal root ganglia. Samsn1, containing SH3 and SAM domains, was found associated with blood vessel formation. In addition, we uncovered new potential roles for previously characterized genes. Tsga2, a component of junctional membrane complexes reported to be testis-specific11, is restricted to rhombomeres at E9.5 and might take part in early events of hindbrain patterning. Informative patterns might be associated with organ-specific traits possibly found in Down's syndrome, although their function remains to be demonstrated. Igsf5 and Tff3 are candidates for male sterility. Igsf5 is present in the mesonephros required for seminiferous tubule formation. Tff3, a secretory protein promoting cell migration in intestinal epithelium12,13, was detected in a small group of cells possibly representing primordial germ cells. Sh3bgr is strikingly expressed in heart, correlating with the specific cardiac defects in Down's syndrome. Genes (such as Slc19a1, Clic6 and Lss) expressed in the branchial apparatus might have a role in the stereotypic facial abnormalities. Adamts5, a disintegrin metalloprotease, is specifically expressed in the maxillary process of the branchial apparatus, and transiently (E8.75–9.5) in the ventral midbrain, making it a candidate for facial abnormalities and mental retardation in Down's syndrome. Of the 28 genes expressed in the brain and/or head mesenchyme at E9.5, 20 show a regionalized expression in the maturing brain at postnatal day 2 (P2) (Supplementary Table 1); examples are Clic6 and Tsga2, which are restricted to the choroid plexus (Fig. 2).
We have investigated the expression of mmu21 genes in neonatal brain, the most important organ affected in Down's syndrome. At P2, the maturation of brain structures involves critical events, including a major shift from neurogenesis to gliogenesis and the elaboration of cytoarchitecture and connectivity. Under our experimental conditions and scoring criteria, 60% of the mmu21 genes were found expressed by in situ hybridization (ISH) in the P2 mouse brain (Supplementary Table 1). Of the clones expressed in brain, 42% gave widespread signals, half of these also displaying regionally enhanced expression (for example, Pfkl; Supplementary Table 1); 58% of the expressed clones defined only patterns in discrete brain territories, as illustrated in Fig. 2. Figure 3 summarizes the distribution of the mmu21 transcriptome in 11 mouse brain regions.
Overall, there was a prominent expression of mmu21 genes in post-mitotic cells (Fig. 2), whereas only 19 genes were found expressed in the mitotically active ventricular/subventricular zones (Supplementary Table 1). The largest fraction of the mmu21 transcripts expressed at P2 was detected in three major laminar structures of the brain: neocortex (41%), hippocampus (25%) and cerebellum (25%). These structures develop through partly common mechanisms14,15 and share the expression of a significant number of genes (Fig. 3). We observed a bias towards neocortical expression, although it remains unclear whether such predominance is also found for the genes encoded by other chromosomes16,17. Transcripts prominently expressed in this structure represent a pool of candidates for the cognitive defects of Down's syndrome. Dscam, Synj1 and Tiam1 have already been described as being involved in synaptic function, axonal guidance, cell migration and neurite outgrowth18,19,20. The hippocampus is necessary for the integrity of memory, and impaired spatial memory in Down's syndrome21 might be associated with some of the hippocampal genes, such as the semaphorin-like oncogene Pttg1ip (ref. 22). A plot of the brain expression domains along the chromosome (Fig. 3a) did not show evidence for the existence of local expression clusters. We found four genes expressed in the meninges (Wrb, Col18a1, Col6a1 and Col6a2), a tissue enveloping the central nervous system. The three collagen genes are physically close on 21q22.3 and might represent a functionally regulated gene cluster. Because many of the mmu21 genes are expressed in the dorsal brain at P2, our data would be compatible with the hypothesis that cumulative gene dosage effects might underlie brain malfunction in trisomy 21 (refs 23–25).
We obtained a wider overview of mmu21 gene expression by extracting in silico expression data from sequenced mouse cDNA libraries representing a large range of tissues and developmental stages. Screening of 632 libraries from non-cancerous tissues totalling 2,253,129 mouse expressed sequence tags (ESTs) identified 5,454 ESTs (on average 34 EST hits per mmu21 gene) displayed in Supplementary Table 1 (see Methods in the Supplementary Information), allowing a detailed analysis of mmu21 gene expression. Although EST mining shows limitations for measuring differential gene expression levels26,27,28,29, we can exploit EST counts to reveal patterns of genes with correlated expression profiles by using a correlation-based clustering method30. Figure 4 shows the expression profiles for the mmu21 genes, many of which are weakly or moderately expressed in more than one tissue. Interestingly, 19 genes seemed prominently expressed in single tissues (red boxes in Fig. 4), suggesting that they might have a crucial role in a tissue-specific mechanism (for example, Tff (refs 12, 13)). Gene clusters identified here provide information on possible roles of these genes in common biological mechanisms. For instance, five genes (Runx1, Aire, Ifngr2, Itgb2 and Ubash3a) from cluster no. 8 showing thymus-specific expression are linked to immunity or haematopoiesis.
We sought to compare ISH data with expression profiles generated by other approaches. Genes that were scored as not expressed (or not interpretable) on brain sections were tested by polymerase chain reaction with reverse transcription (RT–PCR) on whole P2 brain. As expected, RT–PCR was more sensitive: almost 50% of these ISH-negative genes were found expressed in whole P2 brain by this method (Supplementary Table 1). It is likely that most of those are either expressed ubiquitously at very low levels or are present within a few cells that were absent from the tested sections, which cannot cover the whole brain even though median sagittal sections span a maximum of structures. ISH and RT–PCR are complementary, because ISH aims at identifying patterned genes that are often highly expressed in a given territory, whereas RT–PCR lacks topographical resolution. In addition, the efficiency of riboprobes depends in part on their sequence composition or on the occurrence of splice variants that might show differential expression. It was difficult to correlate our expression map of the P2 brain with the EST information, owing to the heterogeneity in size, tissue region and developmental stages of the original material for the different brain EST libraries analysed. Of the genes found expressed by section ISH, 60% did identify brain ESTs (interrogating 251,426 brain ESTs pooled for all brain regions and ages ranging from neonatal to adult, excluding embryo; see Supplementary Information); 65% of the ISH-/PCR+ genes hit brain ESTs, which originated from stages older than P2 with a few exceptions, for example Ifnrg2 and Cryzl1 at neonatal stage. Of the genes scoring ISH-/PCR- at P2, 37% hit brain ESTs originating mostly from adult stages, except Cbs, which is present in the neonatal cortex. Finally, 16 genes are not, or are very weakly, expressed in neonatal brain because they scored negative by all three methods (Cldn8, Kcne1, Runx1, Sim2, Wdr9, C21orf11, Mx2, Mx1, C21orf25, Tff3, Tff2, Ubash3a, Wdr4, Kiaa0958, Ftcd and Hsf2bp).
As expected, the correlation between data from WISH and EST mining was very good because both techniques are able to detect transcripts in most or all embryonic tissues (620,471 ESTs ranging from stage 2 cells to E19.5). This correlation was quantified statistically with Fisher's exact test to refute the hypothesis of statistical independence of the two techniques. For each gene we denoted whether it had positive or negative expression in both methods. Out of 158 genes tested, 123 showed an agreement either with expression (100) or lack of expression (23) in both techniques, whereas in 35 cases we found a disagreement (P = 2.0496 × 10-7). Of the 47 genes not detected at E9.5 by WISH, 23 are unlikely to be expressed at early stages of development because they do not hit any embryonic EST either. One-quarter are found only expressed in libraries originating from later stages, starting from E12.5–E14.5. Ten genes hit ESTs encompassing stage E9.5 (Tmprss2, Slc37a1, Cbs, Tmem1, Dnmt3l, C21orf19, Ifnar2, Cldn8, Usp16 and Usp25) but were not detected, possibly because of very low expression levels in E9.5 embryos. Overall, most of the transcripts detected at E9.5 by WISH did hit ESTs and also showed sustained expression until stages E14.5–E15.5 or later. Nevertheless, embryonic EST libraries are often derived from the whole fetus and therefore lack organ information. Where information is available, EST mining is correlated with the WISH results (for example, Sh3bgr in heart, or Kiaa0184 and Pcbp3 in brain).
We observed local clusters of gene expression in several chromosomal regions. Significance was judged by computing upper-tail P values (<0.01 in all cases) for observed numbers of expressed genes in groups of neighbouring genes of given sizes in accordance with a hypergeometric distribution. WISH data suggest co-expression from Pdxk to Mcm3ap in head mesenchyme and extra-embryonic component, and from Tmprss2 to Abcg1 in the nose. EST mining data give evidence for a significant group from H2bfs to Pfkl co-expressed in haematopoietic tissues, kidney, liver and mammary gland. In addition, a large silenced region from Kcne2 to Cryaa was identified in brain, ovary, retina and testis.
Our analysis of the pattern of gene expression for HSA21 is based on complementary techniques, combining the thorough coverage of developmental stages by EST mining with the high anatomical resolution of WISH and brain ISH. We have identified new expression patterns for a large number of genes, including many with highly localized expression, and generated information on the possible functions for a large fraction of the HSA21 genes. This study provides tools and insights for the further analysis of developmental processes and identifies a pool of gene candidates for phenotypes specific to Down's syndrome and other pathologies linked to HSA21.
References
The chromosome 21 mapping and sequencing consortium The DNA sequence of human chromosome 21. Nature 405, 311–319 (2000)
Epstein, C. J., et al. The Metabolic and Molecular Bases of Inherited Disease, 7th edn (ed. Scriver, C. R.) 749–794 (McGraw-Hill, New York, 1995)
Neidhardt, L. et al. Large-scale screen for genes controlling mammalian embryogenesis, using high-throughput gene expression analysis in mouse embryos. Mech. Dev. 98, 77–94 (2000)
Maroulakou, I. G., Papas, T. S. & Green, J. E. Differential expression of ets-1 and ets-2 proto-oncogenes during murine embryogenesis. Oncogene 9, 1551–1565 (1994)
Lin, B. et al. Prostate-localized and androgen-regulated expression of the membrane-bound serine protease TMPRSS2. Cancer Res. 59, 4180–4184 (1999)
Egeo, A. et al. Developmental expression of the SH3BGR gene, mapping to the Down syndrome heart critical region. Mech. Dev. 90, 313–316 (2000)
Epstein, D. J. et al. Members of the bHLH-PAS family regulate Shh transcription in forebrain regions of the mouse CNS. Development 127, 4701–4709 (2000)
Orti, R. et al. Characterization of a novel gene, C21orf6, mapping to a critical region of chromosome 21q22.1 involved in the monosomy 21 phenotype and of its murine ortholog, orf5. Genomics 64, 203–210 (2000)
Casas, C. et al. Dscr1, a novel endogenous inhibitor of calcineurin signaling, is expressed in the primitive ventricle of the heart and during neurogenesis. Mech. Dev. 101, 289–292 (2001)
Sumarsono, S. H. et al. Down's syndrome-like skeletal abnormalities in Ets2 transgenic mice. Nature 379, 534–537 (1996)
Taketo, M. M. et al. Mapping of eight testis-specific genes to mouse chromosomes. Genomics 46, 138–142 (1997)
Mashimo, H., Wu, D. C., Podolsky, D. K. & Fishman, M. C. Impaired defense of intestinal mucosa in mice lacking intestinal trefoil factor. Science 274, 262–265 (1996)
Otto, W. R. & Patel, K. Trefoil factor family (TFF)-domain peptides in the mouse: embryonic gastrointestinal expression and wounding response. Anat. Embryol. 199, 499–508 (1999)
Altman, J. & Bayer, S. A. Atlas of Prenatal Rat Brain Development (CRC Press, Boca Raton, 1995)
Dahmane, N. et al. The Sonic Hedgehog–Gli pathway regulates dorsal brain growth and tumorigenesis. Development 128, 5201–5212 (2001)
Sandberg, R. et al. Regional and strain-specific gene expression mapping in the adult mouse brain. Proc. Natl Acad. Sci. USA 97, 11038–11043 (2000)
Miki, R. et al. Delineating developmental and metabolic pathways in vivo by expression profiling using the RIKEN set of 18,816 full-length enriched mouse cDNA arrays. Proc. Natl Acad. Sci. USA 98, 2199–2204 (2001)
Leeuwen, F. N. et al. The guanine nucleotide exchange factor Tiam1 affects neuronal morphology; opposing roles for the small GTPases Rac and Rho. J. Cell Biol. 139, 797–807 (1997)
Yamakawa, K. et al. DSCAM: a novel member of the immunoglobulin superfamily maps in a Down syndrome region and is involved in the development of the nervous system. Hum. Mol. Genet. 7, 227–237 (1998)
Saito, T. et al. Mutation analysis of SYNJ1: a possible candidate gene for chromosome 21q22-linked bipolar disorder. Mol. Psychiat. 6, 387–395 (2001)
Sylvester, P. E. The hippocampus in Down's syndrome. J. Ment. Defic. Res. 27, 227–236 (1983)
Pei, L. Identification of c-myc as a down-stream target for pituitary tumour- transforming gene. J. Biol. Chem. 276, 8484–8491 (2001)
Delabar, J. M. et al. Molecular mapping of twenty-four features of Down syndrome on chromosome 21. Eur. J. Hum. Genet. 1, 114–124 (1993)
Korenberg, J. R. et al. Down syndrome phenotypes: the consequences of chromosomal imbalance. Proc. Natl Acad. Sci. USA 91, 4997–5001 (1994)
Kahlem, P. & Yaspo, M. L. Human chromosome 21 sequence: impact for the molecular genetics of Down syndrome. Gene Funct. Dis. 1, 175–183 (2000)
Schmitt, A. O. et al. Exhaustive mining of EST libraries for genes differentially expressed in normal and tumour tissues. Nucleic Acids Res. 27, 4251–4260 (1999)
Vingron, M. & Hoheisel, J. Computational aspects of expression data. J. Mol. Med. 77, 3–7 (1999)
Scheurle, D. et al. Cancer gene discovery using digital differential display. Cancer Res 60, 4037–4043 (2000)
Krause, A., Haas, S. A., Coward, E. & Vingron, M. SYSTERS, GeneNest, SpliceNest: exploring sequence space from genome to protein. Nucleic Acids Res. 30, 299–300 (2002)
Ewing, R. M. et al. Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression. Genome Res. 9, 950–959 (1999)
Acknowledgements
We thank M. Sultan, H.-J. Warnatz, G. Teltow, B. Eppens, D. Balzereit, C. Bail, V. Palma, P. Sánchez, M. Beyna, J. Mullor and M. Rallu for their contribution; S. Antonarakis for providing clones; P. Avner for the mouse somatic cell hybrids; and the Resource Center of the German Genome Project (RZPD, Berlin) for providing clones and filters. Y.G., N.D., L.N., P.K. and A.B.K. contributed equally. This work was supported by grants from the Hirscht Trust and the NIH-NINDS and NCI to A.R.A.; from an ATIPE grant to N.D.; from the German Human Genome Project to B.G.H.; from the National Genome Forschung Netz (NGFN), the Ministry of Education and Research (BMBF) and the Max Planck Society to M.-L.Y.
Author information
Authors and Affiliations
Consortia
Ethics declarations
Competing interests
The authors declare that they have no competing financial interests.
Supplementary information
41586_2002_BFnature01270_MOESM2_ESM.pdf
List and description of the EST libraries used for the comparison of the ISH and WISH data with the EST mining (PDF 46 kb)
41586_2002_BFnature01270_MOESM3_ESM.zip
Browser allowing to navigate in the matrix presented in Figure 4 and providing access to gene names and EST library descriptions displayed in the bottom line (ZIP 183 MB)
41586_2002_BFnature01270_MOESM4_ESM.pdf
Description of the 190 EST libraries represented in Figure 4, indicating their size, tissue origin and their reference UNILIB numbers in the EMBL database (PDF 46 kb)
41586_2002_BFnature01270_MOESM5_ESM.pdf
Dendograms (UPGMA trees) of the 159 mmu21 genes and of the 190 EST libraries, respectively, used to re-arrange the expression matrix shown in Fig. 4 (PDF 261 kb)
41586_2002_BFnature01270_MOESM7_ESM.pdf
Sequences of oligonucleotide primers used for RT-PCR studies and for mapping the mmu21 genes on mouse chromosomes. Primers are given with reference to clone numbers and gene names. For mapping, in silico information is also indicated together with synteny between HSA21 and MMU16, 10 and 17 (PDF 88 kb)
Rights and permissions
About this article
Cite this article
The HSA21 expression map initiative. A gene expression map of human chromosome 21 orthologues in the mouse. Nature 420, 586–590 (2002). https://doi.org/10.1038/nature01270
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nature01270
This article is cited by
-
HACS1 signaling adaptor protein recognizes a motif in the paired immunoglobulin receptor B cytoplasmic domain
Communications Biology (2020)
-
Meta-analysis of heterogeneous Down Syndrome data reveals consistent genome-wide dosage effects related to neurological processes
BMC Genomics (2011)
-
Apoptosis screening of human chromosome 21 proteins reveals novel cell death regulators
Molecular Biology Reports (2010)
-
A systematic screen for genes expressed in definitive endoderm by Serial Analysis of Gene Expression (SAGE)
BMC Developmental Biology (2007)
-
An additional human chromosome 21 causes suppression of neural fate of pluripotent mouse embryonic stem cells in a teratoma model
BMC Developmental Biology (2007)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.