Article | Open | Published:

A protein-truncating R179X variant in RNF186 confers protection against ulcerative colitis

Nature Communications volume 7, Article number: 12342 (2016) | Download Citation

  • An Erratum to this article was published on 13 September 2016

This article has been updated

Abstract

Protein-truncating variants protective against human disease provide in vivo validation of therapeutic targets. Here we used targeted sequencing to conduct a search for protein-truncating variants conferring protection against inflammatory bowel disease exploiting knowledge of common variants associated with the same disease. Through replication genotyping and imputation we found that a predicted protein-truncating variant (rs36095412, p.R179X, genotyped in 11,148 ulcerative colitis patients and 295,446 controls, MAF=up to 0.78%) in RNF186, a single-exon ring finger E3 ligase with strong colonic expression, protects against ulcerative colitis (overall P=6.89 × 10−7, odds ratio=0.30). We further demonstrate that the truncated protein exhibits reduced expression and altered subcellular localization, suggesting the protective mechanism may reside in the loss of an interaction or function via mislocalization and/or loss of an essential transmembrane domain.

Introduction

A total of 200 loci have been unequivocally implicated in the two common forms of inflammatory bowel diseases (IBDs): Crohn’s disease (CD) and ulcerative colitis (UC)1,2. For these findings, like most genome-wide association study (GWAS) results, it has proven challenging to infer the functional consequences of common variant associations3 beyond cases where protein-altering variants have been directly implicated. Protein-truncating variants (PTVs), also commonly referred to as loss-of-function variants4 as they often result in a non-functional or unstable gene product, are generally the strongest acting genetic variants in medical genetics and, as one functional copy of the gene is removed, may often provide insight into what is achievable pharmacologically via inhibition of the product of the gene5. Thus, identifying PTVs that are demonstrated to lead to loss of gene function and confer protection from disease hold particular promise for identifying therapeutic targets6,7,8,9.

Here we conduct targeted sequencing of the exons of 759 protein-coding genes in regions harbouring common variants associated to IBD in 917 healthy controls, 887 individuals with UC (cases) and 1,204 individuals with CD (cases) to identify predicted PTVs that may confer protection to disease. Through replication genotyping and imputation of a PTV in RNF186 (p.R179X) in 11,148 UC patients and 295,446 controls we find a significant protective association to UC. By combining RNA allele-specific expression, protein expression and immunoflourescene imaging, we find that the truncated protein exhibits reduced expression and altered subcellular localization suggesting the protective mechanism may reside in the loss of an interaction or function via mislocalization and/or loss of an essential transmembrane domain.

Results

Screen sequencing

We conducted targeted sequencing of the exons of 759 protein-coding genes in regions harbouring common variants associated to IBD10,11 in 917 healthy controls, 887 individuals with UC (cases) and 1,204 individuals with CD (cases) from the NIDDK IBD Genetics Consortium (North American clinical samples of European descent). We jointly analysed these data with sequencing data from the same genes taken from an exome-sequencing data set of Finnish individuals: 508 with UC; 238 with CD; and 8,124 Finnish reference samples sequenced within Sequencing Initiative Suomi (SISu) project ( www.sisuproject.fi)12. Across this targeted gene set, we discovered 77 PTVs found in 2 or more individuals (Supplementary Table 1), and used a Cochran–Mantel–Haenszel (CMH) χ2-test to scan for protective variants with two strata corresponding to the two cohorts. The test for association was run based on the phenotype (CD, UC or IBD) indicated by the common variant association in the region13 (that is, truncating variants in a gene associated only to CD such as at NOD2 would be tested for CD versus control association). We identified three putatively protective PTVs with a P value <0.05: (1) a previously published low-frequency variant in CARD9 (c.IVS11+1G>C) located on the donor site of exon 11, which disrupts splicing (P=0.04)7,14; (2) a frameshift indel in ABCA7 (P=0.02); and (3) a premature stop gain variant (p.R179X) in RNF186 with signal of association (P=0.02) to UC. As the CARD9 result was a well-established protective association, and ABCA7 contained four PTVs that in aggregate did not appear protective (combined odds ratio (OR)=0.51, P=0.21), we focused specifically on follow-up work to confirm or refute the association of the RNF186 nonsense variant (the only PTV detected in RNF186 in either sequence data set).

Replication

Replication genotype data obtained in 8,300 UC patients and 21,662 controls from the United States, Canada, the United Kingdom, Sweden, Belgium, Germany, Netherlands and Italy provided strong support that the premature stop-gain allele, p.R179X, confers protection against UC (P=0.0028, OR=0.36 (95% confidence interval (CI)=0.19–0.71)). Cluster plots from all genotyping assays were manually inspected to ensure consistent high quality across all experimental modalities used to assess this variant (Supplementary Figs 1 and 2).

Further evidence of replication was seen in whole-genome sequence data followed by imputation collected by deCODE Genetics15,16, in which a set of 1,453 Icelandic patients with UC were compared with a very large population sample (n=264,744) and a consistent strong protection (P=5.0 × 10−4, OR=0.30 (95% CI=0.15–0.59), imputation information of 0.99; overall replication P=8.69 × 10−6, OR=0.33 (0.20–0.55)) was observed between the truncating allele and the disease (Table 1 and Methods). Of note, this observation is advantaged by the property that R179X has a roughly fourfold higher frequency in Iceland (minor allele frequency (MAF)=0.78%) than in other European populations such that the Icelandic group, despite a moderate contribution in absolute number of cases (close to 1/6), have around half of the contribution in term of effective sample size and power.

Table 1: Association of p.R179X in RNF186 with ulcerative colitis.

Access to imputation data in 150,000 individuals from the Icelandic group enabled us to identify rare loss-of-function homozygotes17. We found eight individuals homozygous for the 179X allele, the oldest reached the age of 70 (still alive) and one of the eight died (age 62), consistent with Hardy–Weinberg expectation (n=9.1). There was no significant association of the homozygous genotype with a decreased lifespan or fertility (number of children). Given the lower frequency in other populations, no homozygous individual was expected and/or detected in the Exome Aggregation Consortium (ExAC) nor in the remaining set of individuals in this study. Together this indicates that having two copies with a stop gained in RNF186 is compatible with life, reproduction and ageing—and importantly does not highlight severe medical consequences that would be of obvious concern in developing a therapeutic to mimic the effect of this allele.

The combined significance across all samples tested is P=6.89 × 10−7 (OR=0.30 (95% CI=0.19–0.50))—considering we advanced only one variant to follow-up study, the replication P value of 8.7 × 10−6 is unequivocally significant and would have been significant even if 5,000 variants were put through this follow-up, let alone all 77 that were discovered in the sequencing screen. No other PTV in RNF186 was discovered and tested in our sequencing. The gene is relatively small (227 amino acids) and R179X is by far the most common detected PTV in ExAC, more than 10 times more common than the sum of all other PTVs in the gene.

RNF186 is located at 1p36 a locus implicated in a UC GWAS that identified at least two non-coding independent association signals separated by recombination hotspots (rs4654903 and rs3806308; r2=0.001, MAF=45.5% and 47%, respectively) that did not implicate any one of the three genes (RNF186OTUD3PLA2G2E) in the region18. Recently, a low-frequency coding variant in RNF186 (rs41264113, p.A64T, MAF=0.8%) was found to confer increased risk to UC (OR=1.49 (1.17–1.90))14. In the discovery and replication component of this study p.R179X was found to lie on the haplotype background of the non-reference allele for rs4654903, and very little correlation was observed with rs3806308 or the low-frequency coding variant p.A64T (Supplementary Fig. 3). Naturally, the protective signal at p.R179X remains unchanged when corrected for the background allele rs4654903 on the haplotype it arose on (Supplementary Table 2). Furthermore, we did not observe any evidence of association to CD (P=0.94; Supplementary Table 3), consistent with the common variant associations in this region, which are strong and specific to UC only.

Transcript and protein expression

RNF186, a single-exon protein-coding gene, encodes the ring finger E3 ligase, which localizes at the endoplasmic reticulum and regulates endoplasmic reticulum stress-mediated apoptosis in a caspase-dependent manner19,20. To understand the functional consequences of p.R179X we integrated transcriptome and protein expression level data. First, we examined the gene expression profile of RNF186 (encoded by a single transcript isoform: ENST00000375121) across multiple tissues in the Genotype Tissue Expression (GTEx) project and identify that its highest expression is in the transverse colon (median reads per kilobase of transcript per million mapped reads (RPKM)=17.32, n=61) with only three other tissues having an RPKM level above 1: (i) pancreas; (ii) kidney cortex; and (iii) the terminal ileum (Supplementary Fig. 4A)21. RNF186 protein was observed at ‘medium’ expression levels for tissues in the gastrointestinal tract (stomach, duodenum, small intestine, appendix and colon) in the Human Protein Atlas22 (Supplementary Fig. 4B).

Impact on transcript allele expression

We integrated allele-specific expression (ASE) data for individual carriers of p.R179X in the GTEx project. Within GTEx we identified two individual carriers containing ASE data for p.R179X in transverse colon and sigmoid colon. The carriers had consistent patterns of no ASE effects (Supplementary Figs 5 and 6) suggesting that nonsense-mediated decay does not degrade the aberrant transcripts containing the truncating alleles and that additional functional follow-up would be necessary to determine the molecular impact of p.R179X on RNF186 function (ref. 5). The gene contains one exon and is intronless, and there is prior expectation that those genes do not undergo nonsense-mediated decay since this presence is reported to require the presence of at least one intron.

Impact on protein allele expression

Given that R179X messenger RNA was detected at levels similar to the reference allele, we sought to quantify protein expression. Accordingly, we transiently transfected 293T cells with RNF186 expression constructs containing an epitope tag for detection by western blot with anti-V5 antibodies. As expected, we found that the reference allele of RNF186 was efficiently expressed at the protein level, whereas the truncated protective allele R179X was expressed at reduced levels and the missense risk allele A64T was expressed at higher levels relative to the reference allele (Fig. 1). Notably, the reference allele encodes two transmembrane domains and lacks an N-terminal signal peptide, which supports a model in which RNF186 N and C termini are present on the cytoplasmic side of membrane structures. In contrast, R179X lacks the second transmembrane domain, and must therefore position its N and C termini on opposite sides of the membrane. Collectively, reduced expression and altered membrane topology predict that R179X truncation impairs RNF186 function.

Figure 1: R179X impact on protein allele expression and cellular localization.
Figure 1

(a) Schematic diagram of the RNF186 protein with a zinc finger RING-type and two helical transmembrane (TM1 and TM2) domains, and the A64T and R179X variants shown. (b) 293T cells were transfected with the indicated expression constructs and analysed by western blot for expression of Rnf186 and the R179X variant. The RNF186 protein with a premature stop at amino-acid position 179 is expressed, but at reduced levels. (c) 293T cells were transfected with the indicated expression constructs and analysed by immunofluorescence to demonstrate altered subcellular localization of the R179X variant.

Impact on cellular localization

To determine if R179X truncation alters subcellular localization, we over-expressed RNF186 and variant in 293T cells for immunofluorescence imaging. While RNF186 localized to compact intracellular membrane structures, R179X appeared more diffuse, with preferential plasma membrane localization. To directly compare RNF186 and R179X localization in the same cell, we cotransfected 293T cells with Flag-tagged RNF186 and V5-tagged R179X. After staining with anti-V5 and anti-Flag antibodies, we observed very little overlap of RNF186 and R179X localization (Fig. 1). Importantly, these findings suggest that mislocalization of R179X impairs RNF186 function or otherwise alters association with interacting proteins and subsequent ubiquitination of putative substrates.

Discussion

This study strengthens the direct evidence for the involvement of RNF186 in UC risk in which a powerful allelic series, including common non-coding alleles and risk increasing p.A64T is available for further experimentation. Further supporting the medical relevance of this truncating variant, the same rare allele of the same variant coding p.R179X has been reported in Iceland to have a genome wide significant association with a modest increase in serum creatinine level (effect=0.13 s.d., P=5.7 × 10−10) and a modest increase in risk of chronic kidney disease16. In the context of the few other established protective variants in IBD, including the coding IL23R variants (p.V362I, MAF=1.27%, OR=0.72 (0.63–0.83); p.G149R, MAF=0.45%, OR=0.60 (0.45–.0.79)) and the splice disrupting CARD9 variant (c.IVS11+1G>C, MAF=0.58%, OR=0.29 (0.22–0.37))7, R179X exerts a comparable protective effect. Because of the strong protective effect associated with the RNF186 PTV, studies of RNF186 inhibition and the specific action of this variant protein should be useful in understanding the mechanism by which protection to UC disease occurs and whether this reveals a promising therapeutic opportunity similar to that which has been realized from the example of PCSK9 and cardiovascular disease.

Methods

Ethics statement

All patients and control subjects provided informed consent. Recruitment protocols and consent forms were approved by Institutional Review Boards at each participating institutions (Protocol Title: The Broad Institute Study of Inflammatory Bowel Disease Genetics; Protocol Number: 2013P002634). All DNA samples and data in this study were denominalized.

Cohort descriptions

For all cohorts, UC was diagnosed according to accepted clinical, endoscopic, radiological and histological findings.

Genotyping of the Belgian cohort was performed at the Laboratory for Genetics and Genomic Medicine of Inflammatory ( www.medgeni.org) of the Université de Montréal. Belgian patients were all recruited at the IBD unit of the University Hospital Leuven, Belgium; control samples are all unrelated, and without family history of IBD or other immune-related disorders.

NIDDK IBD Genetics Consortium (IBDGC) samples were recruited by the centres included in the NIDDK IBDGC: Cedars Sinai, Johns Hopkins University, University of Chicago and Yale, University of Montreal, University of Pittsburgh and University of Toronto. Additional samples were obtained from the Queensland Institute for Medical Research, Emory University and the University of Utah. Medical history was collected with standardized NIDDK IBDGC phenotype forms. Healthy controls are defined as those with no personal or family history of IBD. The Prospective Registry in IBD Study at MGH (PRISM) is a referral centre-based, prospective cohort of IBD patients. Enrollment began 1 January 2005. PRISM research protocols were reviewed and approved by the Partners Human Research Committee (#2004-P-001067), and all experiments adhered to the regulations of this review board. The PRISM study data were merged with population controls of European ancestry broadly consented for biomedical studies. These controls included samples from the NIMH repository23, POPRES24, the 1000 Genomes Project25 and controls ascertained for an age-related macular degeneration study26. The Italian samples were collected at the S. Giovanni Rotondo ‘CSS’ (SGRC) Hospital in Italy. The Dutch cohort is composed of UC cases recruited through the Inflammatory Bowel Disease unit of the University Medical Center Groningen (Groningen), the Academic Medical Center (Amsterdam), the Leiden University Medical Center (Leiden) and the Radboud University Medical Center (Nijmegen), and of healthy controls of self-declared European ancestry from volunteers at the University Medical Center (Utrecht).

Subject ascertainment, diagnosis and validation for the UK samples are described elsewhere and are part of the UK Inflammatory Bowel Disease Genetics Consortium (UKIBDGC)27.

German patients were recruited either at the Department of General Internal Medicine of the Christian-Albrechts-University Kiel, the Charité University Hospital Berlin, through local outpatient services, or nationwide with the support of the German Crohn and Colitis Foundation. German healthy control individuals were obtained from the popgen biobank. Genotyping of the German cohort was performed at the Institute for Clinical Molecular Biology.

Finnish patients were recruited from Helsinki University Hospital and described in more detail previously28,29.

Subject ascertainment, diagnosis and validation for a subset of the Swedish samples with UC14 and without UC30 are described elsewhere.

Icelandic population: a total of 1,453 individuals diagnosed with UC was used in the analysis. All the cases were histologically verified, and diagnosed either by 1997 or prospectively during the period 1997–2009 at Landspitali, the National University Hospital of Iceland.

NIDDK NHGRI targeted sequencing

Sample selection

We selected 3,008 samples (1,204 CD, 887 UC and 917 controls) for sequencing composed of North-American samples of European descent from the NIDDK IBD Genetics Consortium repository samples.

Target selection

Target exonic sequences were selected based on the coding exons of 759 genes (2.546 Mb). Genes were selected if they were in regions identified in the GWASs for inflammatory bowel disease in Franke et al. (CD)10 and Anderson et al. (UC)11.

Finland exome sequencing

Finnish individuals were exome sequenced as part of SISu ( www.sisuproject.fi). The SISu project consists of the following population and case–control cohorts: 1000 Genomes Project; ADGEN (Genetic, epigenetic and molecular identification of novel Alzheimer’s disease-related genes and pathways) Study; The Botnia (Diabetes in Western Finland) Study; EUFAM (European Study of Familial Dyslipidemias); The National FINRISK Study; FUSION (Finland–United States Investigation of NIDDM Genetics) Study; Health 2000 Survey; Inflammatory Bowel Disease Study, METSIM (METabolic Syndrome In Men) Study; Migraine Family Study; Oulu Dyslipidemia Families; Northern Finland Intellectual Disability (NFID); and Northern Finland Birth Cohort (NFBC). All samples were sequenced at the Broad Institute of MIT and Harvard, Cambridge, USA, University of Washington in St. Louis, USA and Wellcome Trust Sanger Institute, Cambridge, UK.

Exome sequencing

To produce a harmonized good-quality call set we applied the core variant calling workflow for exome-sequencing data that is composed of two stages that are performed sequentially: pre-processing, from raw sequence reads to analysis-ready reads; and variant discovery, from analysis-ready reads to analysis-ready variants (Supplementary Methods).

Identification of Finnish samples

To obtain genetically well-matched controls for comparison with the Finnish IBD cases we first jointly called Finnish exomes with Swedish exomes to identify genetic Finns from other close Nordic country. Final probability was obtained by dividing the probability of being Finnish divided by the sum of probabilities of being Finnish or Swedish. Training samples in distance calculations were selected for being from Finnish or Swedish cohort as appropriate and clustering on the expected cluster (PC1<0.002 for Finnish samples and PC10 and PC2≤0.01). Samples with 99% Finnish probability (8,124 non-IBD samples, 508 UC, 238 CD and 92 indeterminate colitis (IC)) were then subset and principal component analysis (PCA) with the same parameters was run again to obtain PCAs for Finnish substructure (Supplementary Figs 7 and 8 and Supplementary Notes).

Variant annotation

Variants for the targeted and exome-sequencing data sets were annotated using PLINK/SEQ v0.10 and RefSeq reference transcript set downloaded from https://atgu.mgh.harvard.edu/plinkseq/resources.shtml.

Follow-up genotyping of RNF186

Sequenom

RNF186 p.R179X was assayed using Sequenom MassARRAY iPLEX GOLD chemistry and SpectroCHIPs were analysed in automated mode by a MassArray MALDI-TOF Compact system 2 with a solid phase laser mass spectrometer (Bruker Daltonics Inc.). The variant was called by real-time SpectroCaller algorithm, analysed by SpectroTyper v.4.0 software and clusters were manually reviewed for validation of genotype calls. Reported genetic map positions for the markers were retrieved from the single-nucleotide polymorphism (SNP) database of the National Center for Biotechnology Information (NCBI).

Exome array

The Illumina HumanExome Beadchip array includes 247,870 markers focused on protein-altering variants selected from >12,000 exome and genome sequences representing multiple ethnicities and complex traits. Nonsynonymous variants had to be observed three or more times in at least two studies, splicing and stop-altering variants two or more times in at least two studies. Additional array content includes variants associated with complex traits in previous GWAS, HLA tags, ancestry informative markers, markers for identity-by-descent estimation and random synonymous SNPs. We focused on variant exm26442, which was the only PTV in the targeted sequencing data set that was also in the exome array and had a P value <0.05 in the screening component of the study. Samples in the targeted sequencing data set were excluded from the exome array analysis.

UK sequencing

The UKIBDGC sequenced low-coverage whole genomes of 1,767 UC patients from our nationwide cohort (median depth 2 × ) and compared them with 3,652 population controls from the UK10K project (median depth 7 × ). Samples were jointly called using samtools31, and subjected to two rounds of genotype improvement using BEAGLE32. Genotype count for R179X from exome-sequencing data in 161 additional UK UC patients with severe adverse drug response to common IBD drugs were included.

Replication in Iceland population

The Iceland population data have been extended following the step below15,33.

Sequencing was performed using three different types of Illumina sequencing instruments.

  1. Standard TruSeq DNA library preparation method. Illumina GAIIx and/or HiSeq 2000 sequencers (n=5,582).

  2. TruSeq DNA PCR-free library preparation method. Illumina HiSeq 2500 sequencers (n=2,315).

  3. TruSeq Nano DNA library preparation method. Illumina HiSeq X sequencers (n=556).

Genotyping and imputation methods and the association analysis method in the Icelandic samples were essentially as previously described15 with some modifications that are described here. In short, we sequenced the whole genomes of 8,453 Icelanders using Illumina technology to a mean depth of at least 10 × (median 32 × ). SNPs and indels were identified and their genotypes determined using joint calling with the Genome Analysis Toolkit HaplotypeCaller (GATK version 3.3.0)34. Genotype calls were improved by using information about haplotype sharing, taking advantage of the fact that all the sequenced individuals had also been chip-typed and long range-phased. The sequence variants identified in the 8,453 sequenced Icelanders were imputed into 150,656 Icelanders who had been genotyped with various Illumina SNP chips and their genotypes phased using long-range phasing35,36.

Functional consequence of R179X

Allele specific expression data for R179X carriers

The primary and processed data used to generate the ASE analyses presented in this manuscript are available in the following locations: all primary sequence and clinical data files, and any other protected data, are deposited in and available from the database of Genotypes and Phenotypes ( www.ncbi.nlm.nih.gov/gap) (phs000424.v6.p1). Tissues with at least eight reads of data are presented.

Immunofluorescence

293T cells were plated on glass coverslips and transfected as described above. Cells were then fixed in 4% paraformaldehyde, blocked (3% BSA, 0.1% saponin, in PBS), and stained with primary antibodies diluted 1:250 in blocking buffer. Primary antibodies were M2 mouse anti-Flag (Sigma F3165-1MG) and rabbit anti-V5 (Cell Signaling Technology D3H8Q). The following secondary antibodies were used at a 1:1,000 dilution in blocking buffer: Alexa Fluor594 goat anti-mouse IgG (Life Technologies R37121) and Alexa Fluor488 goat anti-rabbit IgG (Life Technologies A27034). Cells were mounted in Vectashield medium containing 4,6-diamidino-2-phenylindole (Vector Laboratories) and imaged with a Zeiss Axio A1 microscope equipped with × 63/1.25 objective. Image acquisition was performed with the AxioVision (Rel.4.8) software package.

Plasmids

cDNA encoding human RNF186 was obtained from The Genetic Perturbation Platform (GPP, Broad Institute) and cloned by Gibson assembly into the pLX_TRC307 expression construct. Sequences encoding V5 and Flag tags were appended to oligonucleotides for PCR amplification of RNF186.

Biochemistry

293T cells (American Type Culture Collection) were transfected with RNF186 expression constructs by means of Lipofectamine 2000 (Life Technologies) as indicated by the manufacturer. One day after transfection, cells were lysed (1% NP-40 in PBS), resolved by SDS–PAGE, and detected by western blot. Mouse anti-V5 HRP (Sigma V2260-1VL) was diluted 1:5,000 and used in conjunction with chemiluminescent substrate (Pierce SuperSignal West Pico).

Association analysis

Association analysis of PTVs in targeted sequencing data and the exome-sequencing data was performed using the CMH χ2test implemented in R to screen for PTVs with evidence of protective signal of association37. Combined (screen+replication) association analysis was conducted with the CMH χ2test. In the replication cohort a set of 1,453 Icelandic patients with UC were compared with a very large group representing the general population (n=264,744). Logistic regression analysis was applied to the data set to obtain study-specific association statistics.

Data availability

Raw sequence-based counts of PTVs on which all analyses are based are provided in Supplementary Table 1. Final VCF for the targeted sequencing data set is available on request from NIDDK IBDGC (Phil Schumm < pschumm@uchicago.edu> and Mark J. Daly < mjdaly@atgu.mgh.harvard.edu>).

Additional information

How to cite this article: Rivas, M. A. et al. A protein-truncating R179X variant in RNF186 confers protection against ulcerative colitis. Nat. Commun. 7:12342 doi: 10.1038/ncomms12342 (2016).

Change history

  • 13 September 2016

    The HTML version of this Article incorrectly listed the authors of the UK IBD Genetics Consortium and the NIDDK IBD Genetics Consortium individually in the main author list. This has now been corrected in the HTML; the PDF version of the paper was correct from the time of publication.

References

  1. 1.

    et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).

  2. 2.

    et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979–986 (2015).

  3. 3.

    Rare and common variants: twenty arguments. Nat. Rev. Genet. 13, 135–145 (2011).

  4. 4.

    et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science 335, 823–828 (2012).

  5. 5.

    et al. Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome. Science 348, 666–669 (2015).

  6. 6.

    , & Validating therapeutic targets through human genetics. Nat. Rev. Drug Discov. 12, 581–594 (2013).

  7. 7.

    et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat. Genet. 43, 1066–1073 (2011).

  8. 8.

    et al. Loss-of-function mutations in SLC30A8 protect against type 2 diabetes. Nat. Genet. 46, 357–363 (2014).

  9. 9.

    et al. Loss-of-function variants in ABCA7 confer risk of Alzheimer’s disease. Nat. Genet. 47, 445–447 (2015).

  10. 10.

    et al. Genome-wide meta-analysis increases to 71 the number of confirmed Crohn’s disease susceptibility loci. Nat. Genet. 42, 1118–1125 (2010).

  11. 11.

    et al. Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nat. Genet. 43, 246–252 (2011).

  12. 12.

    et al. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet. 10, e1004494 (2014).

  13. 13.

    et al. Association mapping of inflammatory bowel disease loci to single variant resolution. Preprint at (2015).

  14. 14.

    et al. Deep resequencing of GWAS loci identifies rare variants in CARD9, IL23R and RNF186 that are associated with ulcerative colitis. PLoS Genet. 9, e1003723 (2013).

  15. 15.

    et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015).

  16. 16.

    et al. Rare mutations associating with serum creatinine and chronic kidney disease. Hum. Mol. Genet. 23, 6935–6943 (2014).

  17. 17.

    et al. Identification of a large set of rare complete human knockouts. Nat. Genet. 47, 448–452 (2015).

  18. 18.

    et al. Genome-wide association identifies multiple ulcerative colitis susceptibility loci. Nat. Genet. 42, 332–337 (2010).

  19. 19.

    , , , & A novel RING finger E3 ligase RNF186 regulate ER stress-mediated apoptosis through interaction with BNip1. Cell Signal. 25, 2320–2333 (2013).

  20. 20.

    et al. Altered endoplasmic reticulum stress affects translation in inactive colon tissue from patients with ulcerative colitis. Gastroenterology 141, 1024–1035 (2011).

  21. 21.

    Consortium, G.T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).

  22. 22.

    et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).

  23. 23.

    et al. Whole-genome association study of bipolar disorder. Mol. Psychiatry 13, 558–569 (2008).

  24. 24.

    et al. HLA-B*5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin. Nat. Genet. 41, 816–819 (2009).

  25. 25.

    Genomes Project, C.. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).

  26. 26.

    et al. Rare variants in CFI, C3 and C9 are associated with high risk of advanced age-related macular degeneration. Nat. Genet. 45, 1366–1370 (2013).

  27. 27.

    Consortium, U.I.G.. et al. Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat. Genet. 41, 1330–1334 (2009).

  28. 28.

    et al. Familial and sporadic inflammatory bowel disease: comparison of clinical features and serological markers in a genetically homogeneous population. Scand. J. Gastroenterol. 37, 692–698 (2002).

  29. 29.

    et al. CARD15/NOD2 gene variants are associated with familially occurring and complicated forms of Crohn's disease. Gut 52, 558–562 (2003).

  30. 30.

    et al. Copy number variation in schizophrenia in Sweden. Mol. Psychiatry 19, 762–773 (2014).

  31. 31.

    et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

  32. 32.

    & A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet. 84, 210–223 (2009).

  33. 33.

    et al. Loss-of-function variants in ATM confer risk of gastric cancer. Nat. Genet. 47, 906–910 (2015).

  34. 34.

    et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).

  35. 35.

    et al. Detection of sharing by descent, long-range phasing and haplotype imputation. Nat. Genet. 40, 1068–1075 (2008).

  36. 36.

    et al. Parental origin of sequence variants associated with complex diseases. Nature 462, 868–874 (2009).

  37. 37.

    Categorical data analysis, xvi 714Wiley (2013).

Download references

Acknowledgements

M.J.D. is supported by grants from the following: the National Institute of Diabetes and Digestive and Kidney Disease (NIDDK) and the National Human Genome Research Institute (NHGRI; DK043351, DK064869 and HG005923); the Crohns and Colitis Foundation (3765); the Leona M. & Harry B. Helmsley Charitable Trust (2015PG-IBD001); and Amgen (2013583217). R.J.X. is supported by grants from Amgen (2013583217) and CCFA (3765). J.D.R. is funded by grants from NIDDK (DK064869 and DK062432). IBD Research at Cedars-Sinai is supported by grant PO1DK046763, and the Cedars-Sinai F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute Research Funds. D.P.B.M. is supported by DK062413, AI067068 and U54DE023789-01; grant 305479 from the European Union; and The Leona M. and Harry B. Helmsley Charitable Trust and the Crohn’s and Colitis Foundation of America. S.R.B. is support by an NIH U01 grant (DK062431). The sequencing of UK patients was funded by a grant from the Medical Research Council, UK (MR/J00314X/1). The UK10K project was funded by the Wellcome Trust (WT091310). C.A.A. is funded by the Wellcome Trust (098051). J.C. is supported by grants from NIH (U01 DK062429, U01 DK062422, R01 DK092235, SUCCESS). R.H.D. is supported by NIH grant U01 DK062420 and the Inflammatory Bowel Disease Genetic Research Chair at the University of Pittsburgh. We thank the Broad Communications team for the feature image.

Author information

Author notes

Affiliations

  1. Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA

    • Manuel A. Rivas
    • , Daniel Graham
    • , Christine Stevens
    • , A. Nicole Desch
    • , Mitja I. Kurki
    • , Monkol Lek
    • , Beryl Cummings
    • , Taru Tukiainen
    • , Daniel G. MacArthur
    • , Benjamin M. Neale
    • , Aarno Palotie
    • , Ramnik J. Xavier
    •  & Mark J. Daly
  2. Analytic and Translational Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts 02114, USA

    • Manuel A. Rivas
    • , Mitja I. Kurki
    • , Monkol Lek
    • , Beryl Cummings
    • , Taru Tukiainen
    • , Daniel G. MacArthur
    • , Benjamin M. Neale
    • , Aarno Palotie
    •  & Mark J. Daly
  3. deCODE Genetics, Amgen Inc., 101 Reykjavik, Iceland

    • Patrick Sulem
    • , Daniel Gudbjartsson
    • , Ingileif Jonsdottir
    • , Unnur Thorsteinsdottir
    •  & Kari Stefansson
  4. Research Center, Montreal Heart Institute, Montréal, Québec, Canada H1T1C8

    • Philippe Goyette
    • , G. Boucher
    •  & John D. Rioux
  5. School of Engineering and Natural Sciences, University of Iceland, 101 Reykjavik, Iceland

    • Daniel Gudbjartsson
  6. Department of Immunology, Landspitali, the National University Hospital of Iceland, 101 Reykjavik, Iceland

    • Ingileif Jonsdottir
  7. Faculty of Medicine, University of Iceland, 101 Reykjavik, Iceland

    • Ingileif Jonsdottir
    • , Unnur Thorsteinsdottir
    •  & Kari Stefansson
  8. Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, 24118 Kiel, Germany

    • Frauke Degenhardt
    • , Sören Mucha
    •  & Andre Franke
  9. F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, USA

    • Dalin Li
    • , Talin Haritunians
    •  & Dermot P. B. McGovern
  10. Inflammatory Bowel Disease Center, Cedars-Sinai Medical Center, Los Angeles, California 90048 USA

    • Dalin Li
    • , Talin Haritunians
    •  & Dermot P. B. McGovern
  11. Department of Biosciences and Nutrition, Karolinska Institutet, 14183 Stockholm, Sweden

    • Mauro D’Amato
  12. BioCruces Health Research Institute and IKERBASQUE, Basque Foundation for Science, 48903 Bilbao, Spain

    • Mauro D’Amato
  13. Unit of Gastroenterology, Istituto di Ricovero e Cura a Carattere Scientifico-Casa Sollievo della Sofferenza (IRCCS-CSS) Hospital, 71013 San Giovanni Rotondo, Italy

    • Vito Annese
  14. Strutture Organizzative Dipartimentali (SOD) Gastroenterologia 2, Azienda Ospedaliero Universitaria (AOU) Careggi, 50134 Florence, Italy

    • Vito Annese
  15. Department of Clinical and Experimental Medicine, Translational Research in GastroIntestinal Disorders (TARGID), Katholieke Universiteit (KU) Leuven, Leuven 3000, Belgium

    • Severine Vermeire
  16. Division of Gastroenterology, University Hospital Gasthuisberg, BE-3000 Leuven, Belgium

    • Severine Vermeire
  17. Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, 9713 GZ Groningen, The Netherlands

    • Rinse K. Weersma
    •  & Marijn C. Visschedijk
  18. Department of Gastroenterology, Faculty of Medicine and Health, Örebro University, SE 701 82 Örebro, Sweden

    • Jonas Halfvarson
  19. Department of Medicine, University of Helsinki, 00100 Helsinki, Finland

    • Paulina Paavola-Sakki
    • , Maarit Lappalainen
    • , Kimmo Kontula
    •  & Martti Färkkilä
  20. Helsinki University Hospital, 00100 Helsinki, Finland

    • Paulina Paavola-Sakki
    • , Maarit Lappalainen
    • , Kimmo Kontula
    •  & Martti Färkkilä
  21. Clinic of Gastroenterology, Helsinki University Hospital, 00100 Helsinki, Finland

    • Paulina Paavola-Sakki
    •  & Martti Färkkilä
  22. Research Programs Unit, Immunobiology, and Department of Medical and Clinical Genetics, University of Helsinki, 00014 Helsinki, Finland

    • Maarit Lappalainen
    •  & Lotta L. E. Koskinen
  23. Department of Transplantation and Liver Surgery, University of Helsinki, 00100 Helsinki, Finland

    • Leena Halme
    • , P. Fleshner
    •  & S. R. Targan
  24. Department of Medical Genetics, Biomedicum Helsinki, University of Helsinki, 00100 Helsinki, Finland

    • Lotta L. E. Koskinen
  25. Gastroenterology Unit, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts 02114, USA

    • Ashwin N. Ananthakrishnan
    •  & Ramnik J. Xavier
  26. Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts 02114, USA

    • Ashwin N. Ananthakrishnan
  27. Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK

    • Yang Luo
    •  & Carl A. Anderson
  28. IBD Pharmacogenetics, Royal Devon and Exeter NHS Trust, Exeter EX2 5DW, UK

    • Graham A. Heap
    • , J. Barrett
    • , K. de Lange
    • , M. Pollard
    • , J. Randall
    •  & D. Rice
  29. Peninsula College of Medicine and Dentistry, Exeter PL6 8BU, UK

    • Tariq Ahmad
  30. Meyerhoff Inflammatory Bowel Disease Center, Department of Medicine, School of Medicine, Johns Hopkins University, Baltimore, Maryland, 21205, USA

    • Steven R. Brant
  31. Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, 21205, USA

    • Steven R. Brant
  32. Division of Gastroenterology, Hepatology and Nutrition, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania 15261, USA

    • M. H. Wang
    •  & Richard H. Duerr
  33. Department of Human Genetics, University of Pittsburgh Graduate School of Public Health, Pittsburgh, Pennsylvania 15261, USA

    • M. Regueiro
    •  & Richard H. Duerr
  34. Department of Medicine, Inflammatory Bowel Disease Centre, Mount Sinai Hospital, Toronto, Ontario, Canada M5G 1X5

    • Mark S. Silverberg
  35. Department of Genetics, Yale School of Medicine, New Haven, Connecticut 06510, USA

    • R. Milgrom
    • , J. M. Stempak
    •  & Judy H Cho
  36. Institute for Molecular Medicine Finland, University of Helsinki, 00100 Helsinki, Finland

    • Aarno Palotie
  37. Massachusetts General Hospital, Center for Human Genetic Research, Psychiatric and Neurodevelopmental Genetics Unit, Boston, Massachusetts 02114, USA

    • Aarno Palotie
  38. Research Programs Unit, Immunobiology, University of Helsinki, 00100 Helsinki, Finland

    • Päivi Saavalainen
  39. Faculté de Médecine, Université de Montréal, Montréal, Québec, Canada H3T 1J4

    • John D. Rioux
  40. Department of Gastroenterology, Torbay Hospital, Devon, UK

    • C. Edwards
  41. Department of Medicine, St. Mark’s Hospital, Middlesex, UK

    • A. Hart
  42. Nottingham Digestive Disease Centre, Queens Medical Centre, Nottingham, UK

    • C. Hawkey
  43. Wellcome Trust Centre for Human Genetics, University of Oxford, Headington, UK

    • L. Jostins
  44. Christ Church, University of Oxford, Oxford, UK

    • L. Jostins
  45. Gastrointestinal Unit, Wester General Hospital, University of Edinburgh, Edinburgh, UK

    • N. Kennedy
    •  & C. Lees
  46. Newcastle University, Newcastle upon Tyne, UK

    • C. Lamb
    •  & J. Mansfield
  47. Inflammatory Bowel Disease Research Group, Addenbrooke’s Hospital, Cambridge, UK

    • J. Lee
    •  & M. Parkes
  48. Department of Medical and Molecular Genetics, Guy’s Hospital, London, UK

    • C. Mathew
    •  & N. Prescott
  49. Department of Medical and Molecular Genetics, King’s College London School of Medicine, Guy’s Hospital, London, UK

    • C. Mathew
    •  & N. Prescott
  50. Department of Medicine, Ninewells Hospital and Medical School, Dundee, UK

    • C. Mowatt
  51. Genetic Medicine, Manchester Academic Health Science Centre, Manchester, UK

    • W. Newman
  52. The Manchester Centre for Genomic Medicine, University of Manchester, Manchester, UK

    • W. Newman
  53. Centre for Genomic and Experimental Medicine, University of Edinburgh, Edinburgh, UK

    • E. Nimmo
    •  & J. Satsangi
  54. Translational Gastroenterology Unit, John Radcliffe Hospital, University of Oxford, Oxford, UK

    • A. Simmons
  55. Human Immunology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK

    • A. Simmons
  56. Gastroenterology & General Medicine, Norfolk and Norwich University Hospital, Norwich, UK

    • M. Tremelling
  57. Translational Gastroenterology Unit and the Department of Pediatrics, University of Oxford, Oxford, UK

    • H. Uhlig
  58. Pediatric Gastroenterology and Nutrition, Royal Hospital for Sick Children, Edinburgh, UK

    • D. Wilson
  59. Child Life and Health, University of Edinburgh, Edinburgh, UK

    • D. Wilson
  60. Section of Digestive Diseases, Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA

    • C. Abraham
    •  & D. Proctor
  61. Department of Gastroenterology and Hepatology, Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA

    • J.P. Achkar
  62. Department of Pathobiology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA

    • J.P. Achkar
  63. Division of Gastroenterology, Royal Victoria Hospital, Montréal, Québec, Canada

    • A. Bitton
    •  & J. Glas
  64. Inflammatory Bowel Disease Group, Zane Cohen Centre for Digestive Diseases, Mount Sinai Hospital, University of Toronto, Toronto, Ontario, Canada

    • K. Croitoru
    •  & S. Kugathasan
  65. Department of Pediatrics, Emory University School of Medicine, Atlanta, Georgia, USA

  66. Division of Pediatric Gastroenterology, Hepatology and Nutrition, Hospital for Sick Children, Toronto, Ontario, Canada

    • J. V. Limbergen
  67. Department of Public Health Sciences, University of Chicago, Chicago, Illinois, USA

    • P. L. Schumm
  68. Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA

    • Y. Sharma

Consortia

  1. UK IBD Genetics Consortium

  2. NIDDK IBD Genetics Consortium

Authors

  1. Search for Manuel A. Rivas in:

  2. Search for Daniel Graham in:

  3. Search for Patrick Sulem in:

  4. Search for Christine Stevens in:

  5. Search for A. Nicole Desch in:

  6. Search for Philippe Goyette in:

  7. Search for Daniel Gudbjartsson in:

  8. Search for Ingileif Jonsdottir in:

  9. Search for Unnur Thorsteinsdottir in:

  10. Search for Frauke Degenhardt in:

  11. Search for Sören Mucha in:

  12. Search for Mitja I. Kurki in:

  13. Search for Dalin Li in:

  14. Search for Mauro D’Amato in:

  15. Search for Vito Annese in:

  16. Search for Severine Vermeire in:

  17. Search for Rinse K. Weersma in:

  18. Search for Jonas Halfvarson in:

  19. Search for Paulina Paavola-Sakki in:

  20. Search for Maarit Lappalainen in:

  21. Search for Monkol Lek in:

  22. Search for Beryl Cummings in:

  23. Search for Taru Tukiainen in:

  24. Search for Talin Haritunians in:

  25. Search for Leena Halme in:

  26. Search for Lotta L. E. Koskinen in:

  27. Search for Ashwin N. Ananthakrishnan in:

  28. Search for Yang Luo in:

  29. Search for Graham A. Heap in:

  30. Search for Marijn C. Visschedijk in:

  31. Search for Daniel G. MacArthur in:

  32. Search for Benjamin M. Neale in:

  33. Search for Tariq Ahmad in:

  34. Search for Carl A. Anderson in:

  35. Search for Steven R. Brant in:

  36. Search for Richard H. Duerr in:

  37. Search for Mark S. Silverberg in:

  38. Search for Judy H Cho in:

  39. Search for Aarno Palotie in:

  40. Search for Päivi Saavalainen in:

  41. Search for Kimmo Kontula in:

  42. Search for Martti Färkkilä in:

  43. Search for Dermot P. B. McGovern in:

  44. Search for Andre Franke in:

  45. Search for Kari Stefansson in:

  46. Search for John D. Rioux in:

  47. Search for Ramnik J. Xavier in:

  48. Search for Mark J. Daly in:

Contributions

Manuel A Rivas, Mark J Daly, J.D.R, and R.J.X participated in the study design. D.G., A.N.D, and R.J.X provided reagents, designed, and led the functional experiments. C.S. managed the project. P.S., P.G., D.G., I.J., U.T., F.D., S.M., Mitja I. Kurki, D.L, M.A, Vito Annese, S.V, R.K.W, J.H, P.P, M.L, M.L, B.C, T.T, T.H, L.H, L.L.E.K, A.N.A, Yang Luo, G.A.H, M.C.V, D.G.M, B.M.N, Tariq Ahmad, C.A.A, S.R.B, Richard H. Duerr, M.S.S, J.H.C, A.P, P.S, K.K, M.F, D.P.B.M. and Andre Franke provided reagents and tools. Patrick Sulem, D.G, I.J, U.T, and Kari Stefannson generated and analysed the Icelandic replication data. M.A.R led the association analysis. P.G, J.D.R, Mark J. Daly, S.R.B, R.H.D, Mark S. Silverberg, J.H.C, and D.P.B.M are members of the NIDDK IBD Genetics consortium. Carl Anderson is a member of the UK IBD Genetics consortium. All authors commented on the final version of the manuscript. Manuel A. Rivas, D.G., and Mark J Daly wrote the manuscript.

Competing interests

K.S. and his team report personal fees from deCODE Genetics/Amgen outside the submitted work. The remaining authors declare no competing financial interests.

Corresponding authors

Correspondence to Manuel A. Rivas or Mark J. Daly.

Supplementary information

PDF files

  1. 1.

    Supplementary Information

    Supplementary Figures 1-8, Supplementary Tables 1-3, Supplementary Notes 1-3 and Supplementary References

  2. 2.

    Peer Review File

Excel files

  1. 1.

    Supplementary Table 1

    Prioritizing protective protein truncating variants identified in the targeted sequencing data set

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/ncomms12342

Further reading

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.