Introduction

Colorectal cancer (CRC), the most common type of gastrointestinal cancer, is one of the leading causes of mortality worldwide.1 Understanding the mechanisms underlying the growth and progression of CRC is essential for improving treatment. Several pathways have been implicated in colorectal carcinogenesis; for instance, the integrin/FAK/ERK pathway is activated in colon cancer cells.2 Integrins constitute a family of heterodimeric glycoproteins involved in bidirectional cell signaling that participate in the regulation of gene transcription and cell shape, adhesion, migration, differentiation, survival and proliferation.3, 4 Studies also suggest that proton beam irradiation can inhibit metastatic potential, including cell adhesion and migration, by modulating the expression of genes involved in integrin trafficking and integrin-mediated signaling, which are necessary for tumor progression.5, 6 Claudins are cell adhesion molecules that localize to tight junctions. Claudin-7 (Cldn-7, CLDN7) is unique in that it has a stronger basolateral membrane distribution than other claudins, proteins that typically localize to apical tight junctions in the intestinal epithelium. Cldn-7 is highly expressed in the normal colon, and its knockdown in mice results in altered epithelial cell homeostasis and neonatal death. Notably, dysregulation of epithelial homeostasis potentiates oncogenic transformation and tumor growth. However, the role of Cldn-7 in the regulation of colon tumorigenesis remains poorly understood.7 Our recent study provides strong evidence that Cldn-7 performs a non-tight junction-related function in mediating cell-matrix interactions by interacting with integrin α2 at the basolateral membrane of normal intestinal epithelia;8 however, the modulatory effect of Cldn-7 on the integrin/FAK pathway has not been reported to date. In the present study, tissue microarrays and gene chips were analyzed to detect the Cldn-7 expression levels at different stages of colorectal carcinoma, and bioinformatic analysis was performed to identify the relationships between Cldn-7 and proteins of the integrin/FAK signaling pathway. We also examined the expression of putatively related genes at the mRNA or protein level. The purpose of this study was to provide a theoretical basis for the relationship between Cldn-7 and the integrin/FAK signaling pathway in colon tumorigenesis.

Materials and methods

Ethical statement

Colon cancer specimens were obtained with informed consent from Beijing Shijitan Hospital of Capital Medical University with the approval of the Ethical Review Board of Investigation in Human Beings at Beijing Shijitan Hospital of Capital Medical University. This investigation was performed in accordance with the ethical standards and according to the Declaration of Helsinki Principles as well as national and international guidelines.

Human tissue samples and tissue microarray analysis

In the microarray study, we analyzed samples of six paired carcinoma and para-cancerous healthy tissues. Adjacent control samples were collected 5 cm from the cancer tissue from colon carcinoma patients undergoing surgery. The cancer tissue histology was examined independently by two histopathologists who categorized the degree of differentiation as well (n=1), moderate (n=4) or poor (n=1). Consensus was reached in all cases. The carcinoma samples were classified according to the tumor node metastasis (TNM) staging system. The distribution of individuals included in the study according to gender and age is shown in Supplementary Table S1.

A human colon cancer tissue array was purchased from US Biomax (cat. no.: HCol-Ade180CS-01, US Biomax, Inc., Swampscott, MA, USA). The microarrays contained tissue cores of 90 paired colon cancer and para-cancerous samples (90 para-cancerous normal tissue cores; 17 stage 1 colon cancer tissue cores; 13 stage 2A cancer tissue cores; 8 stage 2B cancer tissue cores; 2 stage 2C cancer tissue cores; 27 stage 3B cancer tissue cores; 8 stage 3C cancer tissue cores; 7 stage 4A cancer tissue cores; and 8 stage 4B cancer tissue cores; clinical staging standards based on the AJCC, seventh edition). The information for patients who provided the samples for the tissue array is available at http://www.superchip.com.cn/product/detail_182.aspx.

RNA isolation

Tissue samples were frozen in liquid nitrogen until use. RNA was isolated using the Ambion RNA Isolation kit (cat. no. 1560, Ambion, Applied Biosystems, Foster City, CA, USA), and the quantity and quality of the isolated RNA were assessed using a NanoDrop 2000 system (Thermo Scientific, Boston, MA, USA). The RNA (A260) and protein (A280) concentrations and sample purity (A260/A280 ratio) were also measured. Only high-quality (A260/A280≥2), intact total RNA was examined for samples that showed a classic 18S and 28S ribosomal RNA band pattern based on bioanalyzer results.

Gene expression microarray and statistics

RNA (300 ng) was reverse transcribed into complementary DNA and labeled with biotin-UTP using an RNA amplification kit (cat. no. 1791, Ambion). Complementary RNA was quantified using a NanoDrop 2000, and complementary RNA quality was further evaluated using a bioanalyzer. Biotinylated complementary RNA was hybridized to the HumanHT-12 whole-human genome expression array (47 231 probes; Illumina, San Diego, CA, USA); the microarrays were washed and subsequently scanned using an Illumina BeadStation 500.

The raw signal intensities for the microarray data were processed and analyzed using GenomeStudio (2008, Illumina), and background subtraction and average normalization were performed using the average signal intensities; P-values were then calculated. To examine the integrin/FAK signaling pathway, differential gene expression between CRC tissues and matched para-cancerous control tissues from six cases was investigated using significance analysis of microarrays based on paired Student’s t-tests. Significance analysis of microarray was applied individually to the microarray data sets for each of the 12 tissue sets under consideration. During each significance analysis of microarray test, 100 random iterations were performed to determine the false discovery rate (FDR) for each gene. The FDR was constrained to zero to identify significantly differentially expressed genes (DEGs). Genes absent across all samples were filtered out, and genes displaying a significant interaction (P<0.05) were filtered in. Statistically significant DEGs between cancerous and para-cancerous tissues that passed all of the described filtering criteria (including a foldchange cut-off of >1.5 and an FDR-adjusted P-value of <0.05) were entered into the Gene Functional Annotation Tool available at the DAVID website (https://david.ncifcrf.gov/) using its official gene symbols. The Gene Ontology options GOTERM_BP_ALL and GOTERM_MF_ALL were selected, and a functional annotation chart was generated. To retrieve only significant categories, a maximum P-value of 0.05 was chosen. The microarray data have been submitted to Gene Expression Omnibus (accession number GSE54986; http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE54986).

For functional enrichment analysis of selected genes, gene probe IDs were annotated using Entrez gene symbols (GenomeStudio Plus HumanHT-12_V4_0) and analyzed using DAVID Bioinformatics Resources 6.7 to identify significantly over-represented KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways in the data set; the default settings and Fisher’s exact test were applied. The selected pathways displayed P<0.01. These results produced lists of the genes involved in the enriched pathways and provided a connectivity map of each pathway. A model network based on these pathways was generated using KEGG (http://www.kegg.jp/). Gene expression heatmaps were generated using GenomeStudio.

Quantitative real-time reverse transcription–PCR

The expression levels of selected genes identified by gene expression profiling based on the microarray results were validated by quantitative real-time reverse transcription–PCR (qRT-PCR). For extraction of total RNA from tissues, the samples were homogenized using a TissueLyser II system (Qiagen, Düsseldorf, Germany), and total RNA was isolated using commercial RNeasy Fibrous Tissue kits (cat. no. 74704, Qiagen). The extracted RNA was treated with DNase I and then quantified using a NanoDrop 2000 (Thermo). Complementary DNA synthesis was performed with ~4 μg RNA in a total volume of 20 μl using a SuperScript III first-strand synthesis system (cat. no. 18080-051, Invitrogen, Carlsbad, CA, USA), random primers and Oligo(dT)20 according to the manufacturer’s protocol. The resulting complementary DNA was used as a template for quantitative PCR, which was performed using the IQ5 system (Bio-Rad, Hercules, CA, USA) with primers specific to the CLDN7, ITGB1BP1, PTK2 (FAK), MAPK1 (ERK2), PTK2B (FAK2, PYK2), DIAPH1, GSN (gelsolin), RHOA, MYLK, TLN1 (Talin-1) and MAPK3 genes and the ACTB and GAPDH housekeeping genes and Power SYBR Green PCR Master Mix (cat. no. 4367659, Applied Biosystems, Foster City, CA, USA). To avoid amplification of any contaminating genomic DNA, the primers were designed to span at least one intron. GAPDH and ACTB were used as internal controls for normalization of mRNA expression levels. The amplification efficiencies of all primers were calculated from the standard curve slope. The primer sequences are listed in Supplementary Table S2. The quantitative PCR protocol was as follows: 10 min at 95 °C followed by 40 cycles of denaturation at 95 °C for 15 s and annealing at 60 °C for 1 min. Melting curve analysis was performed to examine the final PCR products. Relative expression values were obtained from Ct values detected using IQ5 analysis software according to the calibration curve method. The data were normalized to a factor calculated as the geometric mean of the relative mRNA concentrations of the two most stable housekeeping genes (GAPDH and ACTB); these analyses were performed using geNORM and NormFinder under our experimental conditions. All samples were quantified in triplicate.

Immunohistochemistry of colon cancer tissue array samples and statistics

The sections used for tissue microarray analysis were dehydrated in a graded ethanol series from 50 to 100% ethanol. The sections were washed with Tris-buffered saline (TBS; 5 min, three times) and incubated in retrieval buffer (10 mm citrate buffer, pH 6.0) in an autoclave at 120 °C for 10 min. The sections were then treated with 3% H2O2 in distilled water for 10 min and rinsed thoroughly in distilled water, washed in TBS, pH 7.6, for 5 min, followed by a 10 min incubation in 2% bovine serum albumin diluted in TBS. Subsequently, the sections were incubated with primary antibodies overnight at 4 °C. The optimized dilutions for antibodies were as follows: Cldn-7 (cat. no. H00001366-D01, Novus, Littleton, CO, USA) 1:500; ERK1/2 (cat. no. 9101S, CST, Boston, MA, USA) 1:800; Gelsolin (cat. no. NB110-10067, Novus) 1:1600; MYLK (cat. no. NBP2-32497, Novus) 1:150; PYK2 (cat. no. NBP1-61259, Novus) 1:6000; and Talin-1 (cat. no. NBP1-87820, Novus) 1:800. The next day, the sections were washed with TBS and incubated with the secondary antibody for 30 min at 37 °C. After an additional three washes for 5 min each in TBS, a streptavidin-biotin system and diaminobenzidine (cat. no. K-4011, DAKO, Glostrup, Denmark) were applied for 10 min. The slides were then rinsed in distilled water, followed by counterstaining with Mayer’s hematoxylin for 90 s, rinsing in water for 5 min, and dehydration. Finally, all specimens were sealed with neutral glue. Images were captured using an Aperio ScanScope CS system (Vista, CA, USA). The relative distributions of the proteins of interest in the immunostained tissue array sections were semiquantitatively analyzed by a trained analyst using Positive Pixel Count (version 8.1, Vista, CA, USA) software. Only normal or tumor areas were selected and analyzed. The protein staining levels on IHC slides were collected as total optical density values per unit of positive area in one field of view. The final quantitative result of each protein level is the average value of six cube images: Positive density=total intensity of positive area/total positive area. Analysis of variance was adopted to statistically analyze the immunohistochemistry results, comparing cancer tissues of different stages with para-cancerous normal tissue. The mean±s.d. values are displayed in the tables. The data were analyzed using SPSS, version 19.0 for Windows (SPSS, Inc., Chicago, IL, USA). P<0.05 was considered to indicate a statistically significant difference.

In silico co-expression array database study

We predicted the association between Cldn-7, Gelsolin and MAPK1/3 using COXPRESdb version 6.0, located at http://coxpresdb.jp/,9 and the STRING database (the Search Tool for the Retrieval of Interacting Genes/Proteins)10 was used to predict protein–protein interaction (PPI) pairs. In addition, further validation was performed via a meta-analysis of data in the TCGA database to identify the association between CLDN7 and integrin/FAK pathway-related genes.11

All expression data for COXPRESdb are based on Affymetrix GeneChips (Affymetrix Inc., Santa Clara, CA, USA) the information for which has been released by NCBI Gene Expression Omnibus. Using the default settings and entering Cldn-7, Gelsolin, MAPK1 and MAPK3 as gene symbols in the NetworkDrawer, co-expressed gene networks were drawn according to the clustering of pathways from the KEGG database. The similarity in the expression between CLDN7 and all other genes in the database was calculated using Pearson’s correlation coefficients; these values were then ranked. In addition, we extracted genes involved in the integrin/FAK signaling pathway from our microarray and used the WGCNA package to construct a weighted gene co-expression network between CLDN7 and the integrin/FAK pathway. The analysis results were visualized using Cytoscape3.3.0 (The Cytoscape Collaboration, National Institute of General Medical Sciences (NIGMS), Bethesda, MD, USA).

The STRING (http://string-db.org/) database is another database of known and predicted protein interactions that provides information regarding both experimental and predicted protein interactions; the database currently covers 9 643 763 proteins from 2031 organisms. The interactions include direct physical and indirect functional associations.12 To examine protein interactions between Cldn-7 and components of the integrin/FAK signaling pathway in colon tumorigenesis, genes were mapped onto the STRING database to build PPI pairs. PPI pairs with reliability scores higher than 0.4 were screened to construct a PPI network of Cldn-7 and integrin/FAK signaling pathway components.

The TCGA database is freely and readily accessible.13 It contained 33 cancer types and subtypes with matched tumor and normal tissues from 11 000 patients, including 10 rare cancers. We downloaded gene expression data on 262 colon cancer samples from the TCGA database (sample sizes of the data sets: batch 28=155; batch 29=7; batch 30=15; batch 36=33; batch 41=48; and batch 45=4) and performed a meta-analysis of the correlation coefficients using a DerSimonian-Laird approach in the metacor package of R.14 The mean correlation (r) values, 95% confidence intervals (CIs) and P-values were calculated. To decrease the risk for bias in the meta-analysis, gene expression data generated using the same platform (Agilent G4502A-07, Agilent Technologies, Santa Clara, CA, USA) were selected. Pearson's correlation coefficients between each pair of genes was performed using SPSS 22. The alpha-level for the main test and for the CIs was 0.05. Significant correlations had an upper or lower limit of the 95% CI of more than 0.1 or less than −0.1.

Results

Differential gene expression patterns between CRCous tissues and matched para-cancerous normal tissues

To identify changes in gene expression in human colon carcinoma tissue, the expression of 18 401 human genes was profiled. Genes displaying an FDR-adjusted P-value of <0.05 and a foldchange in the expression of >1.5 in cancerous tissues compared with matched normal tissues were considered to be DEGs. A total of 790 genes were upregulated and 181 genes downregulated in the cancer tissues. The top 10 significantly upregulated or downregulated genes (P<0.001) are listed in Figure 1 and Supplementary Table S3. Gene Ontology and KEGG analyses identified several regulated networks of upregulated or downregulated genes, including (1) regulation of cell proliferation, (2) response to wounding, (3) defense response and (4) cell adhesion.

Figure 1
figure 1

Hierarchical clustering and Gene Ontology (GO) enrichment analyses of differentially expressed genes between colon cancer and para-cancerous normal tissues. The heatmap was generated using the top 10 up- or downregulated genes obtained from GenomeStudio. The top portion of the heatmap and pathways corresponds to upregulated genes, and the bottom portion corresponds to downregulation genes. A full color version of this figure is available at the Journal of Human Genetics journal online.

Focusing on functional changes with regard to cell adhesion during carcinogenesis, we chose to delineate associations between Cldn-7 and integrin/FAK pathway members in human colon carcinoma. The mRNA levels of CLDN7 and candidate genes associated with the integrin/FAK pathway based on our expression microarray data were analyzed (Table 1). CLDN7 mRNA showed no change in cancer tissue (foldchange=0.63, P=0.14). Among 18 selected integrin/FAK pathway-associated genes, GSN (foldchange=0.5, P=0.031) was differentially downregulated, whereas DIAPH1 and MYL2 were differentially upregulated. The other 15 genes displayed a foldchange >1 but with P>0.05. Furthermore, the mRNA levels of CLDN7, ITGB1BP1, PTK2, MAPK1, PTK2B, DIAPH1, GSN, RHOA, MYLK, TLN1 and MAPK3 were validated by qRT-PCR (Figure 2,Supplementary Table S4), showing a highly significant 3.8-fold reduction in GSN mRNA, a 1.8-fold reduction in MAPK3 mRNA and a 1.5-fold reduction in MAPK1 mRNA in the carcinoma tissues compared with the para-cancerous normal tissues. Although these changes in gene expression are accordance with the microarray results, the observed 2.3-fold reduction in PTK2B mRNA contrasts with the microarray results. In addition, the CLDN7 mRNA level tended to be reduced in the carcinoma tissues (1.48±0.78) compared with the para-cancerous normal tissues (1.84±0.70; P>0.05).

Table 1 Candidate genes (upregulated or downregulated genes) associated with the integrin/FAK pathway
Figure 2
figure 2

Claudin-7 mRNA levels and candidate genes associated with the integrin/FAK pathway in colorectal cancer tissue, as determined by quantitative real-time RT-PCR. Samples from para-cancerous normal tissues (filled circle) and cancerous tissues (filled square) were analyzed for CLDN7, ITGB1BP1, PTK2, MAPK1, PTK2B, DIAPH1, GSN, RHOA, MYLK, TLN1 and MAPK3 mRNA levels relative to ACTB or GAPDH mRNA levels. The horizontal line represents the mean values. P-values were calculated using paired Student’s t-tests. Ca, cancerous tissues; para-Ca, para-cancerous normal tissues. n=6.

Expression and localization of Cldn-7 protein and selection of candidate proteins in CRC samples

To verify that the protein levels changed in accordance with the mRNA levels and to investigate the significance of Cldn-7 and associated proteins in the integrin/FAK pathway during multi-stage colon carcinogenesis, immunohistochemical staining was performed on a multi-stage colon cancer tissue array using antibodies against Cldn-7, MYLK, GSN, MAPK1/3, PTK2 and TLN1.

Tissue sections stained with hematoxylin and eosin were divided into para-cancerous normal mucosa, stage 1, stage 2, stage 3 and stage 4 samples. The expression of three proteins, Cldn-7, GSN and MAPK1/3, clearly correlated with the advanced colon cancer stage (Figure 3), with expression decreasing in the colon cancer tissues at lower stages. Cldn-7 expression was detected around the cell membrane of the basal crypt region and the superficial crypt region in para-cancerous normal colorectal epithelia. Cldn-7 immunoreactivity gradually decreased as the cancer stage increased, as follows: para-cancerous normal, 0.22±0.08 (n=90); stage 1, 0.19±0.06 (n=17); stage 2, 0.13±0.07 (n=23, vs para-cancerous normal, P<0.01); stage 3, 0.13±0.07 (n=35, vs para-cancerous normal, P<0.01); and stage 4, 0.08±0.09 (n=15, vs para-cancerous normal, P<0.01; Table 2). These results are in agreement with previous reports.15, 16 Gelsolin expression was detectable in the cytoplasm and the cytoskeleton of all colon samples examined, but no nuclear labeling was detected (Figure 3). Gelsolin expression was decreased in the cancerous tissues compared with the para-cancerous normal tissues: para-cancerous normal, 0.80±0.08 (n=90); stage 1, 0.75±0.09 (n=17); stage 2, 0.79±0.09 (n=23, vs para-cancerous normal, P<0.01); stage 3, 0.73±0.13 (n=35, vs para-cancerous normal, P<0.01); and stage 4, 0.78±0.15 (n=15, vs para-cancerous normal, P<0.01; Table 2). Semiquantitative analysis of the nuclear expression levels of MAPK1/3 revealed the following: para-cancerous normal, 0.07±0.03; stage 1, 0.04±0.03; stage 2, 0.02±0.02; stage 3, 0.04±0.04; and stage 4, 0.03±0.02. Compared with the para-cancerous normal tissue, all colon cancer tissue stages displayed a P-value of <0.01. The changes in the levels of these three proteins were in accordance with their respective changes at the mRNA level.

Figure 3
figure 3

Immunohistochemical staining for claudin-7, Gelsolin, MAPK1/3, Talin, FAK and MYLK in different-stage colorectal carcinomas and adjacent normal tissue. The first row shows a tissue section stained with hematoxylin and eosin. Immunostaining for claudin-7, gelsolin and MAPK1/3 in samples of different histological grades. Images were obtained using an Aperio ScanScope. Magnification, × 100. A full color version of this figure is available at the Journal of Human Genetics journal online.

Table 2 Statistical evaluation of the expression of Cldn-7 and related proteins in colon tumors via immunohistochemical analysis

The expression of the remaining three proteins, Talin, MYLK and PTK2, did not correlate with the advanced stage of colon cancer (Figure 3). Talin expression was primarily localized to the cytoplasm, with no significant change at either the protein or mRNA level between colon cancer tissues at different stages and para-cancerous normal tissues. MYLK showed cytoplasmic staining, and its protein expression was increased in stage 1 cancerous tissue (P<0.05), in contrast to its mRNA expression. PTK2 was detected at the cell membrane, and its expression at both the mRNA and protein levels displayed a decreasing trend in advanced stage colon cancer tissue but a significant increase (P<0.01) in early-stage colon cancer tissue (Table 2, Figure 3).

In silico analysis of genes and pathways co-expressed with Cldn-7, GSN, MAPK1 and MAPK3 in CRC tissue

To search for a network of co-expressed genes including Cldn-7, GSN, MAPK1 and MAPK3, an in silico analysis was performed using the COXPRESdb version 6.0 database.9 We first investigated genes directly associated with CLDN7 with respect to their network and their KEGG pathways (Figure 4a). Many of these genes were found to be directly connected with CLDN7, including SPINT1, which is involved in the regulation of epithelial integrity and immunity. However, we did not find a direct interaction between CLDN7 and GSN, MAPK1 or MAPK3 in the integrin/FAK pathway (Figure 4b). We identified the principal roles of MAPK1 and MAPK3 in the regulation of cancer progression and cell adhesion pathways (Figure 4c). In addition, we utilized the STRING database to construct a PPI network (Supplementary Figure S1). CLDN7 and DEGs of the integrin/FAK/ERK signaling pathway were integrated into the network. The nodes represent the genes and the lines the corresponding PPI pairs. However, CLDN7 failed to form a PPI pairs with any one gene from the integrin/FAK/ERK signaling pathway. To verify this prediction, we used a Pearson's correlation to evaluate co-expression between Cldn-7 and integrin/FAK/ERK signaling pathway genes from our microarray data. Significant correlations were found between CLDN7 and PTK2 (P=0.020) and between DIAPH1 and RHOA (P=0.002), whereas no correlations were observed for the other pairs (Supplementary Table S5). Mammalian Diaph1 is a downstream effector of the Rho family that mediates Rho action at the actin cytoskeleton;17, 18 therefore, DIAPH1 and RHOA are co-expressed. We also used the WGCNA package, a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis (version 1.47 can be used with Windows R-3.1.3), to analyze CLDN7 and genes of the integrin/FAK pathway identified by our microarray data, but no co-expression between CLDN7 and these genes was found (Supplementary Figure S2). We consider the discrepancies to be caused by the small sample number. Thus, further validation was performed in a larger cohort of patients. Meta-analysis of correlation coefficients was conducted using gene expression data from 262 colon cancer samples from the TCGA database. The meta-analysis approach identified significant correlations between the following gene pairs: CLDN7 and KRT8 (keratin 8; r=0.311; 95% CI, 0.194, 0.420), CLDN7 and ST14 (r=0.333; 95% CI, 0.217, 0.440), MAPK3 and KRT8 (r=0.280; 95% CI, 0.161, 0.391), and MAPK3 and RAB25 (r=0.291; 95% CI, 0.151, 0.420; Table 3). Although the P-values were <0.05, the lower or upper limits of the 95% CI were near zero, suggesting no correlations between Cldn-7 and genes in the integrin/FAK/ERK signaling pathway: CLDN7 and PTK2 (r=−0.212; 95% CI, −0.328, 0.090), CLDN7 and PTK2B (r=0.0194; 95% CI, 0.071, 0.311), CLDN7 and FYN (r=−0.153; 95% CI, −0.272, −0.029), CLDN7 and GSN (r=0.233; 95% CI, −0.060, 0.392), CLDN7 and MAPK3 (r=0.186; 95% CI, 0.001, 0.358) and CLDN7 and PXN (r=0.150; 95% CI, −0.004, 0.297; Table 3).

Figure 4
figure 4

In silico analysis showing the claudin-7 co-expressed gene network. The figure shows an image of the analysis performed using the COXPRESdb version 6.0 http://coxpresdb.jp/. Each gene is represented by a node. The lines between nodes indicate co-expression. The KEGG pathway is shown under the network. (a) In silico analysis showing genes directly associated with CLDN7 with respect to their network and their KEGG pathways. (b) In silico analysis showing the claudin-7, integrin, MAPK3 and PTK2 co-expressed gene network. (c) In silico analysis showing the claudin-7 and integrin/FAK/MAPK pathway member co-expressed gene network. A full color version of this figure is available at the Journal of Human Genetics journal online.

Table 3 Meta-analysis of the correlations between CLDN7 and integrin/FAK signaling pathway genes in TCGA colon cancer microarrays (n=262)

Discussion

On the basis of our microarray data results, DEGs between colorectal cancerous tissues and matched para-cancerous normal tissues were identified using thresholds (foldchange in expression of >1.5 and an FDR-adjusted P-value <0.05). The top 10 upregulated networks are primarily involved in cell proliferation and apoptosis, inflammation and immunity and cell adhesion. The crucial mechanism underlying colon cancer progression is the weakening of the endothelial barrier function in response to injury, and many signaling events can disrupt the intestinal barrier. Focal adhesions supported by the integrin/FAK pathway are the first barrier maintaining endothelial integrity.3 Endothelial FAK, an immediate downstream target of integrin, is activated through complex intracellular cascades, leading to conformational changes in the endothelial cytoskeleton and in cell junctions.19 Cldn-7 serves as another putative barrier based on our previous finding that Cldn-7 knockout mice have severe intestinal defects.8 The claudin family, which are the major structural and functional components of tight junctions, in mammals includes at least 24 members; in particular, Cldn-7 is highly expressed in the basolateral membrane of the intestinal epithelia.20 Another previous study revealed that Cldn-7 interacts with integrin α2 in the basolateral compartment of the intestinal epithelia.21 Taken together, our findings demonstrated a putative association between Cldn-7 and the integrin/FAK pathway.

Our study demonstrated a gradual decrease in the levels of Cldn-7 from colon adenocarcinoma stages 1–4; however, no significant decrease in the level of CLDN7 mRNA was observed by qRT-PCR. We thoroughly analyzed paired tissues from six patients. Interestingly, the mRNA level of CLDN7 in a 64-year-old female patient with poorly differentiated colon cancer showed a slight increase between the cancerous and para-cancerous normal tissues. However, the levels of CLDN7 mRNA in the samples from the other patients with well differentiated or moderately differentiated colon cancer displayed a significant downregulation (foldchange=0.57, P=0.020) in colon cancer tissues. This result agrees with the finding that the CLDN7 mRNA level is decreased as an early event in colorectal carcinogenesis.15 Our meta-analysis of correlation coefficients showed that CLDN7 was significantly co-expressed with ST14 and KRT8. Interestingly, co-expression of CLDN7 and ST14 (encoding matriptase) has been reported in colorectal carcinogenesis.15 However, there are no reports on the association between CLDN7 and KRT8 (a type of intermediate filament).

On the basis of our microarray data, we selected up- and downregulated genes associated with the integrin/FAK pathway according to the KEGG database. On the basis of qRT-PCR, none of the genes upregulated according to the microarray results exhibited a significant change between colon cancer and para-cancerous normal tissues. Alternatively, the levels of Gelsolin, MAPK1 and MAPK3 mRNA were confirmed to be significantly downregulated in the cancer tissues by qRT-PCR. According to the immunohistochemistry results from our tissue microarray analysis of 180 colon adenocarcinoma samples, we detected a decrease in the Gelsolin protein level beginning from an early colon cancer stage. Gelsolin is a well-known actin-binding protein involved in the interaction between the extracellular matrix and integrins via the regulation of integrin clustering.22 Previous studies have reported that the expression of Gelsolin protein and mRNA is severely downregulated, with data demonstrating that Gelsolin downregulation is an early and nearly universal event in colon carcinogenesis and is associated with the transition from adenoma to carcinoma.23, 24 In line with our observations, the cytoplasmic distribution of Gelsolin gradually decreased from stage 1 to stage 4 in colon adenocarcinoma.

MAPK1 and MAPK2 are also referred to as ERK2 and ERK1, respectively. Interactions between integrin β1 and the collagen matrix regulate cell survival by associating with FAK and initiating MAPK/ERK signaling.19 MAPK signaling pathway members have been determined to be among the gene markers most strongly associated with CRC,25 and seven MAPK genes have been identified as important for CRC.26 In our study, we found that MAPK1 and MAPK3 were downregulated at the mRNA and protein levels in colon cancer in a stage-specific manner. Our meta-analysis of correlation coefficients showed that MAPK3 significantly correlated with RAB25, which belongs to the Rab family of small GTPases, and with KRT8. The functional association with the ERK signaling pathway that mediates the effects of RAB25 on autophagy in ovarian cancer cells has been validated.27 The association between ERK and KRT8 phosphorylation may have a role in the induction of cancer cell apoptosis by an acetone extract of Bupleurum scorzonerifolium.28

In addition, we evaluated FAK (also referred to as PTK2), MYLK and Talin-1 via qRT-PCR and immunohistochemistry. The mRNA and protein levels of FAK showed a tendency toward downregulation, but these results were not statistically significant, and there were no significant differences in MYLK or Talin-1 expression at the mRNA or protein level. However, there is some controversy concerning the reported findings for Talin-1. A previous report showed that the preoperative serum levels of Talin-1 were significantly increased in patients with colon cancer.29 Talin-1 is a cytoskeletal protein, and its head region promotes the binding of integrin to the extracellular matrix. It has been demonstrated that Talin-1 expression increases cell adhesion, migration and invasion in prostate cancer.30 Thus, further studies of Talin-1 are needed to identify the difference in the expression between serum and tissue samples.

We performed a bioinformatic and statistical analysis to evaluate the interaction between Cldn-7 and components of the integrin/FAK pathway. Furthermore, we conducted a meta-analysis as validation. The in silico prediction and experimental analysis indicated that Cldn-7 does not directly regulate any of the examined integrin/FAK pathway genes. Nonetheless, we cannot rule out the possibility that Cldn-7 indirectly associates with the integrin/FAK pathway to co-regulate cancer progression, cell adhesion and neurotrophin signaling pathways (Figure 4). This result was verified using PPI data from the STRING database, Pearson's correlation analysis and WGCNA package analysis of our microarray data. The meta-analysis based on a large sample cohort showed that CLDN7 may be co-expressed with KRT8, which may be co-expressed with MAPK3. These data suggested that CLDN7 may be indirectly associated with MAPK3 via KRT8. The MAPK pathway is the central node of this network, and our previous work demonstrated that Cldn-7 inhibits lung cancer cell migration and invasion via the ERK/MAPK signaling pathway in response to growth factor stimulation.21 Further investigation is needed to elucidate the regulatory relationship between Cldn-7 and ERK/MAPK pathway members and to better improve clinical diagnosis and treatment.