Introduction

Major depressive disorder (MDD) is a common, complex and recurrent disorder of gene–environment interactions. The estimated heritability may range from 0.36 to 0.66.1, 2 Following up on previous study on the pathophysiology of MDD and on the prevailing hypotheses for treatment response, we sought to identify genes that influence susceptibility for MDD or treatment response in the central nervous system pathways relevant to stress reactivity and to the pathways of action of antidepressant drugs. Current data point out to roles for genes involved in drug transport, serotonin neurotransmission, neurotrophin signaling and response to stress. Promising linkage results are located in several chromosomes,3 which highlight the multilocus nature of the genetic vulnerability to MDD.

Recently, rapid technological advances have started unraveling the contributions of common (frequency >1%) and rare genetic variants in complex disorders. In a topical review, Bodmer and Bonilla4 have synthesized current views, implications and integration of the competing hypotheses of common disease–common variant and common disease–rare variant. For most common variants, the disease-associated variant is unlikely to be functionally relevant; it may be closely linked to the functional variant, and it will cause a small increase in disease risk (odds ratio smaller than 2, generally between 1.1 and 1.4). In contrast, rare variants generally have functional and large phenotypic effects; in many cases they are missense variants that reflect amino-acid changes relevant to protein–protein interactions. Diverse scenarios may occur in the pathophysiology of common complex disorders: Common variants may be modifiers of genes with rare variant effects, such as recently described for the MC4R gene.5 Moreover, areas near common variants may contain candidate genes in which there are rare variants. The identification of rare variants may significantly affect our understanding of complex disease etiology.

We re-sequenced seven candidate genes of importance in the pathophysiology of MDD.6 Conceptually, we sought a group of genes that reflects a sequence of events relevant to drug action at four levels: (1) entry into the brain, (2) binding to monoaminergic transporters, and (3) distal effects at the transcription level, resulting in (4) changes in neurotrophin and neuropeptide receptors. Specifically, we studied a blood–brain barrier drug transporter pump (ACCB1, also called MDR1), which regulates drug entry into the brain (level 1), the norepinephrine, dopamine, and serotonin transporters (SCL6A2, SLC6A3 and SL6A4) (level 2), an antidepressant-regulated transcription factor (cyclic AMP-responsive element binding protein 1 (CREB1)) (level 3) and two receptors (level 4): neurotrophic tyrosine kinase type 2 receptor (NTRK2), important in synaptic function and neural plasticity, and corticotropin-releasing hormone receptor 1 (CRHR1), which regulates the response to stress at the behavioral, neuroimmune and neuroendocrine—hypothalamic–pituitary–adrenalaxis levels.

Materials and methods

Patients and controls

The study consisted of 272 patients (66% female, 34% male; mean age: 38±10) with MDD and 264 healthy control individuals (60% female, 40% male; average age: 36±11). MDD was defined as a DSM-IV (Diagnostic and Statistical Manual of Mental Disorders, 4th Edition) diagnosis of current, unipolar major depressive episode and a 21-item Hamilton Depression Rating Scale (HAM-D21) score of 18 with item number 1 (depressed mood) rated 2. All MDD patients were screened for the pharmacogenetic study of antidepressant treatment response as previously described.7 All MDD patients had comprehensive psychiatric and medical assessments in their primary language, on the basis of diagnostic and ratings instruments that had been fully validated in English and in Spanish. Exclusion criteria included active medical illnesses that could be etiologically related to the ongoing depressive episode, current or active suicidal ideation with a plan and strong intent, pregnancy, lactation, current use of medications with significant central nervous system activity, which interfere with electroencephalogram (EEG) activity (for example, benzodiazepines) or any other antidepressant treatment within the 2 weeks before enrollment, illicit drug use and/or alcohol abuse in the last 3 months or current enrollment in psychotherapy. All MDD patients were Mexican-Americans and had at least three grandparents born in Mexico.

All patients had an initial comprehensive psychiatric and medical assessment and, if enrolled in the pharmacogenetic study of antidepressant treatment response, had weekly structured follow-up assessments for 9 weeks. The study consisted of two phases: a 1-week single-blind placebo lead-in phase to minimize the impact of placebo responders followed, if subjects continued to meet the inclusion criteria after phase 1, by random assignment to one of the two treatment groups: fluoxetine 10–40 mg per day or desipramine 50–200 mg per day, administered in a double-blind manner for 8 weeks. Our primary clinical outcome measure was HAM-D21 score and clinical remission on antidepressants was defined as having a final (week 8) HAM-D21 score <8. In addition, the relative response change was also computed as the difference in HAM-D21 score between pre- and post-treatment divided by the pretreatment HAM-D21 score.

Age-, gender- and ethnicity-matched healthy control individuals were recruited from the same Mexican-American community in Los Angeles by the same bilingual clinical research team. Controls for our genomic studies were in general good health but were not screened for medical or psychiatric illness.

Genomic DNA collection, amplification and sequencing

At the initial visit, after informed consent was obtained from the participating individuals, blood samples were collected into EDTA (K2EDTA) BD Vacutainer EDTA tubes (Becton Dickinson, Franklin Lakes, NJ, USA), and genomic DNA was isolated by using Gentra Puregene DNA purification kits (Gentra Systems, Indianapolis, IN, USA). DNA sequencing for seven genes was carried out in collaboration with the Sanger Institute by following ExoSeq protocol (http://www.sanger.ac.uk/humgen/exoseq/). Briefly, the known protein-coding regions, novel coding sequences and transcripts, exons and their flanking sequence were extracted from the Vega database (http://vega.sanger.ac.uk/index.html). Primers were designed automatically using Primer3 (http://frodo.wi.mit.edu/) to amplify DNA and primer pairs were checked for uniqueness before ordering and pre-screened to determine the optimum conditions for amplification. After amplification, a sample of the products were visualized on an agarose gel to confirm the size of the PCR product. The remaining PCR product was then cleaned up using two enzymes, Exonuclease 1 and Shrimp Alkaline Phosphatase. Bidirectional sequencing of amplicons was carried out using Big DyeTM chemistry (Big Dye Terminator, Version 3.1; Applied Biosystems, Foster City, CA, USA). Single nucleotide polymorphisms (SNPs) were called using ExoTrace http://www.sanger.ac.uk/humgen/exoseq/analysis.shtml, a novel algorithm developed in-house for the detection of heterozygotes in sequence traces , which processes the sense and antisense sequence reads separately and subsequently, and combines the results to allow SNP scoring. All polymorphisms reported here had a genotyping rate of 80% and an average nucleotide call rate of 93%.

Genomic control genotyping

To detect potential bias due to population stratification, two approaches were used to test for hidden stratification in our data. First, 54 independent SNPs across 22 autosomal chromosomes were selected to analyze a combined sample using the genotype data download from three HapMap ethnic samples using STRUCTURE program (http://pritch.bsd.uchicago.edu/software.html)8, 9 and showed that three distinct clusters were well identified with an average proportion of at least 92% of individuals correctly assigned to the given ethnic populations (CEU, CHB+JPT, YRI). This panel of SNPs were then used as genomic control to test our sample and showed an almost equal proportion assigned to each clusters, given K=2, 3, 4 in both cases and controls. Second, genotype frequencies from each of the 54 unlinked SNPs were also compared between cases and controls using the method described by Pritchard and Rosenberg 10 and no significant difference was found based on an overall test statistic (χ2=100.50, d.f.=108, P=0.68). Therefore, no population stratification adjustment was necessary for our association analyses.

Nucleotide diversity, population differentiation and Hardy–Weinberg equilibrium

Nucleotide diversity (θ) and its standard deviation (S(θ)) were calculated under the assumption of an infinite neutral allele model,11, 12 and all calculations were based on n=946 for all the sites given that the average sample size was 473 individuals across all the polymorphisms. Population differentiation estimation was based on the pairwise FST values for the dbSNPs (single nucleotide polymorphism database hosted at the National Center for Biotechnology Information), which were both detected in our Mexican-American sample and reported in HapMap sample. FST values were calculated as described byWeir,13 Weir and Cockerham,14 and Weir and Hill.15 In order to compare allele frequencies and to be able to treat chromosomes as independent observations, the genotype frequencies must be in Hardy–Weinberg equilibrium (HWE).16 Exact testing of HWE was performed separately for healthy controls and MDD patients using the PLINK program Version1.00 (http://pngu.mgh.harvard.edu/~purcell/plink/).17 SNPs that were not in HWE in the healthy control group were excluded from the allele-based association analyses of cases and controls.

Statistical analysis

Data preparation and descriptive statistics were carried out with SAS software (SAS Version 9.1.3, SAS Institute, Cary, NC, USA). For SNP-based association analyses of case vs control or remitter vs non-remitter, Fisher's exact test (two-tailed) was performed to compare allele and genotype distributions between depressed and healthy individuals using PLINK. In the allelic association analysis, each polymorphism was tested in controls to ensure the fitting with HWE; the odds ratio on the 2 × 2 contingency table of allele counts and its 95% confidence interval were estimated using Woolf's method or fitting exact logistic regression model with SAS software when the frequency in a table cell is zero.18 In genotypic association analysis, SNP effects were tested under a codominant model on the 2 × 3 contingency table of genotype counts.

For the quantitative outcome (relative reduction % in HAM-D21 scores between pre- and post-treatment), the analyses based on dominant model were performed, separately, for the joint sample of patients treated with desipramine or fluoxetine and for medication-specific sample. General linear regression models were used to examine the association between genotype and relative HAM-D21 score reduction by controlling for age, gender and baseline (pretreatment) HAM-D21 score using the PLINK program. The Benjamini and Hochberg method was used to control for false discovery rate and the significance threshold was set at FDR_BH0.0519.

For haplotype-based association analysis, Haploview (Version 4.1, Broad Institute of MIT and Harvard, http://www.broad.mit.edu/mpg/haploview/), was first used to identify the haplotype blocks by applying the Four Gamete Rule20 based on the SNPs with a minor allele frequency (MAF) 0.01 in the combined sample of cases and controls and HWE exact test P>0.01 in controls. The PLINK program was then used to examine the association of specific haplotype with depression diagnosis, clinical remission, as well as quantitative outcome of antidepressant treatment.

Results

Identification of sequence variations

A total of 419 single nucleotide sequence variants (Table 1) were identified by re-sequencing of 105 kb of exonic sequence and their flanking regions in the selected seven genes in an ethnically homogeneous sample of 264 healthy controls and 272 MDD patients. Among the 419 SNPs, 204 (49%) are novel polymorphisms, not previously described, including 86 in introns, 72 in untranslated regions (UTRs), 19 (12 synonymous) in coding regions, 18 in upstream and 9 in downstream regions. Overall, 95% of the novel polymorphisms had a MAF lower than 5%, whereas the corresponding proportion was 57% for dbSNPs (Supplementary Table 1). Similar distribution of MAFs was seen between cases and controls for both SNPs in intronic and in exonic regions (Figure 1a). Among the 419 SNPs, the proportion of SNPs with HWE exact test P-value 0.05 was 92% for controls and 91% for MDD cases (Supplementary Table 1).

Table 1 Single nucleotide polymorphisms (SNPs) detected in seven candidate genes for depression in Mexican-Americans
Figure 1
figure 1

Minor allele frequency (MAF), nucleotide diversity and FST measure in seven candidate genes in Mexican-American major depressive disorder (MDD) patients and controls. Histograms show the total number of single nucleotide polymorphisms (SNPs) detected in intronic (black bar) and exonic (gray bar) regions in the seven genes by MAF in 272 MDD patients and 264 healthy controls (a); the nucleotide diversity in noncoding (black bar) and coding (gray bar) (b) or intronic (black bar) and exonic (gray bar) regions (c) by gene in the combined sample of 272 MDD patients and 264 healthy controls; the total number of SNPs shared by Mexican-American (MA) sample and HapMap samples by pairwise FST value (d) or represent the average FST by gene (e) in MA vs CEU (black bar), MA vs HCB (dark gray bar), MA vs JPT (gray bar) and MA vs YRI (White bar).

Nucleotide diversity was estimated for each gene by correcting for both sample size and length of the screened site (Table 1). Nucleotide diversities were comparable in SLC6A3 (0.00053±0.00012), NTRK2 (0.00051±0.0001) and SLC6A2 (0.00056±0.00012), but were lower in CREB1 (0.00032±0.00009) and ATP-binding cassette subfamily B member 1 (ABCB1) (0.00038±0.00008) and appeared higher in SLC6A4 (0.00078±0.00016) and CRHR1 (0.00109±0.00024). This led to an overall nucleotide diversity of 0.00054 for all the seven genes investigated. When the nucleotide diversity was estimated separately for coding and noncoding sequence, five out of seven genes (except for SLC6A3 and ABCB1) showed higher nucleotide diversity in noncoding regions when compared with coding segments (Figure 1b). However, when the nucleotide diversity was estimated separately for exonic and intronic sequence, all the seven genes showed higher nucleotide diversity in exonic regions than in intronic segments (Figure 1c). This is because of the high nucleotide diversity in untranslated regions (0.00088±0.00017).

Among the 215 dbSNPs detected, 83 were reported in all four HapMap ethnic groups: CEU (Caucasian), YRI (African), CHB (Han Chinese) and JPT (Japanese) in the NCBI database as of 25 June 2008. Pairwise FST values between Mexican Americans (MA) and each HapMap ethnic sample were computed for the shared 83 dbSNPs. Overall, the greatest difference in allele frequencies was found between Mexican Americans and Africans with a highest mean FST of 0.126, compared with mean FST of 0.035 in MA vs CEU, 0.033 in MA vs CHB and 0.032 in MA vs JPT (Figure 1d). For the gene-specific mean FST in MA vs YRI, larger mean FST values were observed for SLC6A3 (0.208) and SLC6A4 (0.198), but much lower for SLC6A2 (0.04) (Figure 1e).

SNP-based genetic association analyses of cases and controls

Single nucleotide polymorphism-based allelic and genotypic association analyses revealed that 16 polymorphisms were associated with MDD with a nominal P<0.05 in five genes (Table 2), including two common 3′ UTR polymorphisms in NTRK2 (rs7020204 and rs2013566) and one rare 5′ UTR polymorphism in SLC6A4 (rs28914831). Among the nine SNPs with a nominal P<0.05 in both allelic and genotypic tests, seven were uncommon polymorphisms with a MAF <0.03 in controls, including one in CREB1 (rs3732076), two in ABCB1 (rs4728697, rs58898486) and four in SLC6A4 (rs7212502, rs28914831, NT_010799.14_3288789 and rs56355214) (Table2 and Supplementary Table 1). Three SLC6A4 common polymorphisms (rs7224199 and rs3813034 in upstream and rs140701) showed genotypic association, but with a small allelic odds ratio <1.3 and allelic test nominal P>0.05. No associated SNPs remained significant after adjusting for multiple tests with an FDR_BH 0.05.

Table 2 Polymorphisms associated with depression in Mexican-Americans

SNP-based genetic association analysis of antidepressant response

In this study, there were 142 MDD patients who enrolled in the pharmacogenetic trial and completed 8-week antidepressant treatment (68 treated with desipramine and 74 treated with fluoxetine). For the discrete outcome (remission vs non-remission), SNP-based allelic or genotypic association analyses revealed that clinical remission status was associated with several polymorphisms in or near three genes, ABCB1, NTRK2 and SLC6A2 (Table 3). All of the nine associated NTRK2 SNPs were in 3′ UTR or coding regions except for rs2289658 at a splice site, whereas the two associated SLC6A2 SNPs were in intron or upstream region. For the ABCB1 gene, the associated SNPs included two in UTR, two in introns and one in coding sequence. No associated SNPs remained significant after adjusting for multiple tests with an FDR_BH 0.05 in the discrete outcome analysis.

Table 3 Polymorphisms associated with remission after 8-week antidepressant treatment with desipramine or fluoxetine

For the quantitative outcome (relative reduction in HAM-D21 score) after controlling for age, gender and baseline HAM-D21 score, general linear regression analyses revealed that relative reduction of HAM-D21 scores was associated with six NTRK2 SNPs (three in 3′ UTR, two synonymous and one intronic at splice site) and one SLC6A3 intronic SNP rs8179029 in desipramine-treated patients, two SLC6A2 upstream SNPs in fluoxetine-treated patients and one SLC6A3 intronic SNP rs8179029 for combined sample, with a nominal P<0.01 (Table 4). Among the associated SNPs, only two NTRK2 synonymous SNPs, rs2289657 and rs56142442, remained statistically significant after correcting for multiple testing with an FDR_BH=0.05 in the sample of patients treated with desipramine. Desipramine-treated patients who are homozygous for C allele at synonymous SNP rs2289657 or at rs56142442 had higher levels of improvement with 27% larger reduction in HAM-D21 scores, compared with those who are not homozygous for C allele at rs2289657 or rs56142442.

Table 4 Polymorphisms associated with relative reduction of HAM-D21 score after 8-week antidepressant treatment with desipramine or fluoxetine

Haplotype-based analyses

Haplotype analysis identified a total of 17 haplotype blocks in the seven genes using the Four Gametes Rules with the Haploview program, including one block in CREB1, two blocks in each of SLC6A3, SLC6A4 and CRHR1, three blocks in each of ABCB1 and SLC6A2, and four blocks in NTRK2 (Figure 2). For the association analysis of case and control, the diagnosis of depression was found to be associated with five haplotypes with a nominal P-value between 0.01 and 0.05 in CREB1, SLC6A3, ABCB1, NTRK2 and SLC6A2. Among the five depression-associated haplotypes, four included at least one SNP showing an association with depression in the single SNP-based analysis (Table 5). For the association of remitter and non-remitter, eight haplotypes were found to be associated with remission status, including two in ABCB1 (ACA in block 1 for desipramine-treated patients and GCGCACACGAGAC in block 2 for fluoxetine-treated patients), two in NTRK2 (TCG and CAG in block 3 for desipramine-treated patients), one in SLC6A2 (GCCAGT in block 4 for desipramine-treated patients) and three in SLC6A4 (TAGC and TAGA in block 1 and ATTGTAACCC in block 2 for the combined sample of desipramine- or fluoxetine-treated patients). Among the eight remission-associated haplotypes, three showed an association with a nominal P0.01: TCG in block 3 of NTRK2 and GCCAGT in block 4 of SLC6A2 (P=0.009) for desipramine-treated patients, and TAGC in block 1 of SLC6A4 for fluoxetine-treated patients (P=0.004) (Table 5).

Figure 2
figure 2

Linkage disequilibrium (LD) pattern in seven genes: cyclic AMP-responsive element binding protein 1 (CREB1) (a), SLC6A3 (b), ATP-binding cassette subfamily B member 1 (ABCB1) (c), neurotrophic tyrosine kinase type 2 receptor (NTRK2) (d), SLC6A2 (e), SLC6A4 (f) and corticotropin-releasing hormone receptor 1 (CRHR1) (g). Standard color scheme in Haploview program is used to display the level of logarithm of odds (LODs) and the D′. Shown in each box are estimated statistics of the D′, which indicates the LD relationship between each pair of single nucleotide polymorphisms (SNPs) and are not labeled if D′=1.00. Regions are shown in bright red, light blue, shades of pink/red and white for D′=1+LOD2, D′=1+LOD<2, D′<1+LOD2 and D′<1+LOD<2, respectively. Vertical lines on the long horizontal white indicate the relative positions of SNPs in the gene.

Table 5 Haplotypes associated with depression or clinical remission after 8-week antidepressant treatment

For quantitative outcome analysis of antidepressant treatment, 15 haplotypes were found to be associated with the relative reduction in HAM-D21 score after controlling for age, gender and baseline HAM-D21 score (Table 6). Among the 15 associated haplotypes, 2 in SLC6A3 and 3 in NTRK2 showed a correlation with a nominal P<0.004 in desipramine-treated patients, and 2 in SLC6A2 showed an association with a nominal P<0.008 in fluoxetine-treated patients. The most significant association was found between NTRK2 haplotype CAG (rs2289658, rs2289657 and rs2289656) and relative reduction of HAM-D21 score with a nominal P=0.0002 and an effect size of squared R=0.20 (Table 6).

Table 6 Haplotypes associated with relative reduction of HAM-D21 score after 8-week antidepressant treatment

Discussion

In this study, we analyzed the fine structure of seven genes that are relevant to the pathophysiology of MDD or to antidepressant response at four sequential levels: (1) entry into the brain, (2) binding to monoaminergic transporters, and (3) distal effects at the transcription level, resulting in (4) changes in neurotrophin and neuropeptide receptors. We observed new alleles in all seven genes in Mexican-Americans. We described a total of 204 novel SNPs (Table 1), which almost doubled the number of reported SNPs in these genes that was detected in these individuals (total of dbSNPs was 215). The number of novel SNPs identified in these Mexican-American subjects ranged from 12 to 57, and in the case of CREB1, the total number of SNPs tripled from 6 to 18. Most of the novel SNPs reported here had MAF lower than 5% (Supplementary Table 1). Higher nucleotide diversity was found in the exonic regions of these genes, particularly in UTRs (Figure 2b). Only a small number of the novel SNPs were in coding regions19 and of those <40% (7) were non-synonymous. Analyses of HapMap data on four ethnic groups found different allele frequencies, with the greatest differences between Mexican Americans and Africans (Figure 1d).

Our analyses revealed nominal associations of eight SNPs and four haplotypes with susceptibility for MDD; those SNPs and haplotypes were located in four genes, ABCB1, CREB1, NTRK2 and SLC6A3. In addition, eight SNPs in SLC6A4 and one haplotype in SLC6A2 were also associated with MDD (Tables 2 and 5). However, some of these SNPs were not very common (MAF <0.03 in controls).

Nominal associations with several polymorphisms were also found for treatment response of 142 MDD patients who completed 8-week antidepressant treatment with desipramine or fluoxetine. Discrete outcome analyses (remitters vs non-remitters) showed that SNPs and haplotypes in ABCB1 and NTRK2 were associated with response. Variation in SLC6A2 and one haplotype in SLC6A4 were also associated with remission status. Quantitative outcome analyses showed that SNPs and haplotypes in ABCB1, NTRK2 and SLC6A2 were associated with relative HAM-D21 score reduction, but only two SNPs and one haplotype in NTRK2 remained significant for desipramine treatment after correcting for multiple testing.

Our data show that variations in six out of seven genes were associated with MDD or antidepressant response. Briefly, (i) SNPs in ABCB1 (located at 7q21.1), which is also called multidrug resistance 1, were associated with MDD and antidepressant response. ABCB1 encodes a large transmembrane transporter protein that acts as an active efflux pump transporting a wide range of drugs from the brain to the blood. Polymorphisms in this gene have been reported to predict the response to antidepressant treatment to drugs that are substrates for this transporter.21 (ii) SNPs in the CREB1 gene were associated with MDD. CREB (cyclic AMP response element-binding protein, located at 2q32.2-q34) encodes a transcription factor that modulates key growth factors important for synaptogenesis and neurogenesis. Sequence variations in the promoter and intronic regions of the CREB1 gene have previously been described to be cosegregated with mood disorders in women.22 3) SNPs in NTRK2 (located at 9q22.1) were associated with susceptibility to MDD and antidepressant response. Furthermore, two SNPs and one haplotype in NTRK2 continued to be significantly associated with relative reduction of HAM-D21 scores in the desipramine-treated group, after controlling for age, gender and baseline HAM-D21 scores. NTRK2, also known as tyrosine kinase receptor B, and its ligand, brain-derived neurotropic factor, regulate short- and long-term synaptic functions and neural plasticity. NTRK2 variants have been recently associated with obsessive-compulsive disorder in female patients.23 (iv) SNPs in SLC6A2 (noradrenaline transporter, located at 16q12.2) were associated with remission status and relative reduction of HAM-D21 scores, and one haplotype in this gene (ACCAGA) was associated with MDD. SLC6A2 gene encodes a transporter, which regulates norepinephrine (noradrenaline) homeostasis and the reuptake of norepinephrine into presynaptic nerve terminals.24 SLC6A2 polymorphisms have been reported to be associated with depression25, 26 and response to antidepressants.27, 28 (v) SNPs or haplotypes in SLC6A3 (dopamine transporter or DAT1, located at 5p15.33), which encodes a transporter that is important in dopaminergic neurotransmission, were associated with risk for MDD or relative reduction of HAM-D21. This transporter mediates the active re-uptake of synaptic dopamine.29 Variations in this gene have already been implicated in susceptibility for mood disorders30, 31 and antidepressant action.32 Other neuropsychiatric conditions have also been associated with SLC6A3, such as parkinsonism,33 attention-deficit hyperactivity disorder,34, 35, 36 Tourette's syndrome and addictive behavior.37, 38 6) SLC6A4 (serotonin transporter, located at 17q11.1-q12) encodes a transporter, which mediates antidepressant action, and behavioral effects of cocaine and amphetamines. Sequence variations in SLC6A4 have been extensively queried and they may be associated with several neuropsychiatric conditions, including MDD,39, 40, 41 anxiety-related personality traits42 and antidepressant response.43, 44 Our findings support that variations in SLC6A4 are associated with MDD risk. Haplotypes in SCL6A4 have also been associated with remission status and reduction of HAM-D21 scores.

The analyses presented here have not shown that variations in CRHR1 (located at 17q12-q22) gene are associated with susceptibility to MDD or antidepressant response. It can be noted that the current analyses have not taken anxiety levels into consideration. CRHR1 encodes the receptor of CRH, a key stress hormone that regulates the response to stress at the behavioral, immune, autonomic and neuroendocrine levels, through the activation of the hypothalamic–pituitary–adrenalaxis. Polymorphisms in CRHR1 were reported to be associated with antidepressant response, but only when anxiety scores are taken in consideration,45, 46 and with seasonal pattern and early onset of first depressive episode.47

In summary, we show that substantial levels of sequence variation, especially those that are not very common (MAF >5%), are likely to be found in candidate genes in an ethnically defined and understudied group. In this population group, for example, half of the SNPs detected were novel. Therefore, deep sequencing data may be relevant to our understanding of common and complex disorders, such as major depression, particularly in minority populations. Our analyses showed that several sequence variations and haplotypes in six out of seven selected genes were nominally associated with MDD risk and/or antidepressant treatment response and that after controlling for age, gender and baseline HAM-D21 score, as well as correcting for multiple testing, there was a significant association of antidepressant response with two NTRK2-coding SNPs and one haplotype. Our findings suggest that these variants may be implicated in the pathophysiology of MDD. The Mexican Americans are the most rapidly growing population group in the United States, but remain under represented in research studies. These results highlight the importance of direct re-sequencing of key candidate genes in ethnic minority groups in order to discover novel genetic variants that cannot be simply inferred from existing databases.

Conflict of interest

The authors declare no conflict of interest.