Analysis | Published:

Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture

Nature Genetics volume 43, pages 10591065 (2011) | Download Citation


Hi-C experiments measure the probability of physical proximity between pairs of chromosomal loci on a genomic scale. We report on several systematic biases that substantially affect the Hi-C experimental procedure, including the distance between restriction sites, the GC content of trimmed ligation junctions and sequence uniqueness. To address these biases, we introduce an integrated probabilistic background model and develop algorithms to estimate its parameters and renormalize Hi-C data. Analysis of corrected human lymphoblast contact maps provides genome-wide evidence for interchromosomal aggregation of active chromatin marks, including DNase-hypersensitive sites and transcriptionally active foci. We observe extensive long-range (up to 400 kb) cis interactions at active promoters and derive asymmetric contact profiles next to transcription start sites and CTCF binding sites. Clusters of interacting chromosomal domains suggest physical separation of centromere-proximal and centromere-distal regions. These results provide a computational basis for the inference of chromosomal architectures from Hi-C experiments.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


  1. 1.

    et al. An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 462, 58–64 (2009).

  2. 2.

    et al. Polycomb-dependent regulatory contacts between distant Hox loci in Drosophila. Cell 144, 214–226 (2011).

  3. 3.

    et al. Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells. Nat. Genet. 42, 53–61 (2010).

  4. 4.

    & Nuclear organization of the genome and the potential for gene regulation. Nature 447, 413–417 (2007).

  5. 5.

    & Mapping cis- and trans- chromatin interaction networks using chromosome conformation capture (3C). Methods Mol. Biol. 464, 105–121 (2009).

  6. 6.

    et al. Quantitative analysis of chromosome conformation capture assays (3C-qPCR). Nat. Protoc. 2, 1722–1733 (2007).

  7. 7.

    , , & Capturing chromosome conformation. Science 295, 1306–1311 (2002).

  8. 8.

    et al. Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture-on-chip (4C). Nat. Genet. 38, 1348–1354 (2006).

  9. 9.

    et al. High-resolution identification of balanced and complex chromosomal rearrangements by 4C technology. Nat. Methods 6, 837–842 (2009).

  10. 10.

    et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).

  11. 11.

    The three 'C' s of chromosome conformation capture: controls, controls, controls. Nat. Methods 3, 17–21 (2006).

  12. 12.

    , , & Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 36, e105 (2008).

  13. 13.

    et al. Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol. 12, R18 (2011).

  14. 14.

    & Transcription factories are nuclear subcompartments that remain in the absence of transcription. Genes Dev. 22, 20–25 (2008).

  15. 15.

    , , & The role of transcription factories in large-scale structure and dynamics of interphase chromatin. Semin. Cell Dev. Biol. 18, 691–697 (2007).

  16. 16.

    et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011).

  17. 17.

    et al. Diverse gene reprogramming events occur in the same spatial clusters of distal regulatory elements. Genome Res. 21, 697–706 (2011).

  18. 18.

    & CTCF: master weaver of the genome. Cell 137, 1194–1211 (2009).

  19. 19.

    , , & The insulator binding protein CTCF positions 20 nucleosomes around its binding sites across the human genome. PLoS Genet. 4, e1000138 (2008).

  20. 20.

    et al. CTCF-mediated functional chromatin interactome in pluripotent cells. Nat. Genet. 43, 630–638 (2011).

  21. 21.

    et al. Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions. Nature 453, 948–951 (2008).

  22. 22.

    et al. Molecular maps of the reorganization of genome–nuclear lamina interactions during differentiation. Mol. Cell 38, 603–613 (2010).

  23. 23.

    et al. A three-dimensional model of the yeast genome. Nature 465, 363–367 (2010).

  24. 24.

    , & Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 18, 1851–1858 (2008).

  25. 25.

    et al. Discovery of functional noncoding elements by digital analysis of chromatin structure. Proc. Natl. Acad. Sci. USA 101, 16837–16842 (2004).

  26. 26.

    et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).

Download references


We thank W. de Laat for discussions and members of the Tanay group for critical reading of the manuscript. Research at A.T.'s laboratory was supported by the Israeli Science Foundation and by the EPIGENESYS FP7 program of the European Commission.

Author information


  1. Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel.

    • Eitan Yaffe
    •  & Amos Tanay


  1. Search for Eitan Yaffe in:

  2. Search for Amos Tanay in:


E.Y. and A.T. conceived and performed the analysis. E.Y and A.T wrote the article.

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Amos Tanay.

Supplementary information

PDF files

  1. 1.

    Supplementary Text and Figures

    Supplementary Figures 1–8 and Supplementary Table 1

About this article

Publication history





Further reading