Abstract
Autoantibodies against tumor-associated antigens (TAAs) have been suggested as biomarkers for early detection of gastric cancer. However, studies that systematically assess the diagnostic performance of a large number of autoantibodies are rare. Here, we used bead-based multiplex serology to simultaneously measure autoantibody responses against 64 candidate TAAs in serum samples from 329 gastric cancer patients, 321 healthy controls and 124 participants with other diseases of the upper digestive tract. At 98% specificity, sensitivities for the 64 tested autoantibodies ranged from 0–12% in the training set and a combination of autoantibodies against five TAAs (MAGEA4 + CTAG1 + TP53 + ERBB2_C + SDCCAG8) was able to detect 32% of the gastric cancer patients at a specificity of 87% in the validation set. Sensitivities for early and late stage gastric cancers were similar, while chronic atrophic gastritis, a precursor lesion of gastric cancer, was not detectable. However, the 5-marker combination also detected 26% of the esophageal cancer patients. In conclusion, the tested autoantibodies and combinations alone did not reach sufficient sensitivity for gastric cancer screening. Nevertheless, some autoantibodies, such as anti-MAGEA4, anti-CTAG1 or anti-TP53 and their combinations could possibly contribute to the development of cancer early detection tests (not necessarily restricted to gastric cancer) when being combined with other markers.
Similar content being viewed by others
Introduction
Despite decreasing incidence worldwide, almost one million people were estimated to have been diagnosed with gastric cancer in 2012 and most patients diagnosed with gastric cancer die from this disease1. Eradication of Helicobacter pylori infection, the most important risk factor of non-cardia gastric cancer, has been shown to reduce gastric cancer incidence and mortality in several randomized clinical trials2,3. Another approach to lower mortality would be early cancer detection. Screening by upper endoscopy has been implemented for individuals at high risk of gastric cancer4. A sensitive blood test would be an attractive alternative for both average-risk individuals from high-incidence countries and high risk groups from low-incidence countries5.
Autoantibodies against tumor-associated antigens (TAAs) have been found in serum of patients with various types of cancer and could serve as biomarkers for early detection of gastric cancer as well6,7,8,9,10. In a recent systematic literature review we have evaluated the potential of known autoantibody markers for this purpose11. However, direct comparison of different markers and marker combinations was difficult because of differences in the study populations and autoantibody detection methods. Furthermore, many promising autoantibody markers have not been independently validated so far.
In order to overcome these limitations, we designed a study including autoantibody measurements in over 800 serum samples from healthy controls, gastric cancer patients and persons with other diseases of the upper digestive tract by bead-based multiplex serology. With this method it is possible to measure up to 100 antibodies simultaneously, with performance characteristics comparable or even better than standard serological techniques like ELISA12.
The aim of our study was to identify autoantibody combinations that are able to detect a substantial proportion of gastric cancer patients at a reasonably high specificity.
Methods
Study design and study population
Our study followed a two-step approach with marker selection for gastric cancer detection in a training set (step 1) and subsequent evaluation of diagnostic performance of selected autoantibody markers and marker combinations in independent validation samples (step 2). In addition, we performed analyses with patients with other diseases of the upper digestive tract and participants with a diagnosis of gastric cancer during follow-up to assess disease specificity and the ability of autoantibody markers to detect precursors of clinical gastric cancer. An overview of the study design is shown in Fig.1 and detailed information on the studies from which the study populations were sampled is shown in the supplementary methods and Supplementary Table S1.
Gastric cancer cases included in the training set were recruited in southwestern Germany in the context of the DACHSplus study, a satellite substudy to the case-control study DACHS13,14,15. In addition to colorectal cancer (CRC) patients recruited for the DACHS study, patients with a primary diagnosis of other gastrointestinal cancers were enrolled in DACHSplus. Controls in the training set were selected from the BliTz study, an ongoing prospective CRC study among participants of screening colonoscopy, conducted in cooperation with more than 20 gastroenterology practices in the same geographic region as DACHS16,17,18.
Gastric cancer cases included in the validation set were recruited in the context of the ESTHER II study and the VERDI study19,20. These are two large unselected cancer patient cohort studies conducted in the entire state of Saarland, also located in the southwest of Germany. Controls of the validation set were recruited by their general practitioners in the ESTHER I study, a prospective cohort study among participants of general health check-up21.
For the main study all available gastric cancer patients from DACHSplus, ESTHER II and VERDI (n = 316) were included in the autoantibody measurements. The controls (n = 335) were randomly drawn from BliTz and ESTHER I after exclusion criteria were applied (see Supplementary Table 1 for eligibility criteria). For the additional analyses, we furthermore included esophageal cancer patients from DACHSplus (n = 35), chronic atrophic gastritis patients from ESTHER I (n = 100) as well as ESTHER I participants with a diagnosis of gastric cancer during the follow-up period (2002–2011, n = 29).
Because sex differences between subjects with and without gastric cancer (higher proportion of males in the former) would be expected in any screening population1 and as we intended to derive diagnostic performance estimates which are representative for the screening situation, we did not match cases and controls. However, we carried out a sensitivity analysis in a matched subset of the validation set as described in Supplementary Method 3. This way, the ability of autoantibody marker combinations to discriminate cases and controls independent of differences in the age and sex distributions could be assessed.
All studies were approved by the ethics committees of the University of Heidelberg and of the respective state medical boards. From each participant written informed consent was obtained. Methods were carried out in accordance with approved guidelines (e.g. good epidemiological practice, good laboratory practice).
Sample collection and handling
For BliTz participants, serum samples were taken by gastroenterologists before screening colonoscopy and for ESTHER I participants, serum samples were taken by general practitioners during a general health examination. For 14 participants of ESTHER I who developed gastric cancer during the follow-up period, also 5 year follow-up blood samples were available. Blood samples from DACHSplus participants were taken in hospitals before surgery (but after neoadjuvant therapy for some patients). Blood samples from gastric cancer patients from VERDI and ESTHER II were taken in hospitals or at patients’ homes at various time points (before surgery: 25 patients, 1–14 days after surgery: 66 patients, 15–90 days after surgery: 36 patients, >90 days after surgery: 11 patients, no surgery or time of surgery unknown: 15 patients.) All obtained serum samples were stored at −80 °C until the autoantibody measurements.
Bead-based multiplex serology measurements
We selected 64 candidate TAAs encoded by 59 genes based on previous autoantibody measurements in melanoma, ovarian cancer and pancreatic cancer patients22,23 and two systematic reviews11,24. Among them there were many cancer-germline antigens (e.g. CTAG1, CTAG2, DDX53, MAGEA1, MAGEA3, MAGEA4) and proteins from pathways known to be dysregulated in cancers (e.g. TP53, ERBB2, EGFR). Details on the 64 candidate TAAs are provided in Supplementary Table S2.
Autoantibodies against the selected TAAs were measured by multiplex serology, a fluorescent bead-based glutathione S-transferase (GST) capture immunosorbent assay, as described previously12,23. In short, TAAs were bacterially expressed as GST-X-tag fusion proteins25, loaded and affinity-purified on glutathione-casein-coupled spectrally distinct fluorescence-labeled polystyrene beads (SeroMap, Luminex Corp., Austin, Tx, USA). A mix of differently loaded bead sets provides an antigen suspension array that is presented to sera. A Luminex analyzer (Luminex Corp., Austin, Tx, USA) distinguishes the bead set by its internal bead-color and quantifies the amount of bound serum antibody detected by a secondary goat anti-human IgA, IgM, IgG antibody (Dianova, Hamburg, Germany) and the reporter conjugate streptavidin-R-phycoerythrin. The antibody reactivity is given as median fluorescence intensity (MFI) of at least 100 beads per set measured. Final antigen-specific MFI values were generated by subtraction of GST-tag and individual bead background values. Serum samples with a very high background fluorescence signal (reactivity for GST-tag >300 MFI) were excluded from further analyses.
Measurements of autoantibodies were performed in a 1:1000 dilution in the Division of Molecular Diagnostics of Oncogenic Infections, DKFZ, Heidelberg, Germany. In the first round of the antibody measurements 62 of the 64 autoantibodies were measured simultaneously. Anti-TP53 and anti-CDKN2A were measured together with antibodies against bacterial and viral antigens that are not subject of this analysis in the second round of the antibody measurements. For both measurements, the laboratory staff was blinded for the case-control status.
Antigen-loading of the beads was controlled via detection of the C-terminal tag and identity of the antigen loaded on the beads was verified by identifying the encoding plasmids via PCR and sequencing. Variation between different assay-plates was controlled by three control sera on each plate as replicates. Of these replicates, coefficients of variation (CV, = ratio of the standard deviation to the mean) were calculated for each antigen with a mean reactivity above 30 MFI. The median (range) CVs for all antigens combined in all three control sera were 16% (10–24%), 14% (11–21%) and 18% (11–25%), respectively.
Statistical analysis
For characterization of the study population we used standard descriptive statistics and hypothesis tests (Fisher’s exact test, Wilcoxon rank-sum test). The strategy for selection of autoantibodies and their cutoffs for prediction of presence of gastric neoplasms was tailored to the particular character of single autoantibody markers, which typically have rather low sensitivity at very high specificity11,24.
Individual cutoffs for each autoantibody were calculated based on the MFI values of the controls from the training set: Blood samples with MFI values exceeding the 98th percentile of the BliTz controls were considered as seropositive. To reduce noise from very weak fluorescence signals, cutoffs below 50 MFI were set to 50 MFI. Frequencies of autoantibodies against each antigen with 95% Wilson score confidence intervals (CIs)26 were determined for the different groups of study participants from the training and the validation set. As sensitivity analyses training and test set were swapped and cutoffs, sensitivities and specificities were calculated as aforementioned.
To evaluate the diagnostic performance of autoantibody combinations we generated all possible 2-, 3-, 4- and 5-marker combinations. A multi-marker test was considered as positive if the MFI value for at least one autoantibody of the combination was higher than the autoantibody-specific cutoff. All multi-marker combinations were ranked by Youden’s index (J = sensitivity + specificity −1) for the training set and for the marker combinations with the highest Youden’s indices, sensitivities and specificities with Wilson score intervals were evaluated in the validation set and in the additional samples (CAG, esophageal cancer, participants with diagnosis of gastric cancer during the follow-up period). Subgroup specific analyses were performed for early versus late stage cancers, cardia versus non-cardia gastric adenocarcinomas, men versus women, persons aged under 65 versus persons aged 65 and above and untreated patients versus patients who received radiotherapy, chemotherapy or surgery before blood withdrawal. To test for differences in the ability of the best 5-marker combination to correctly classify cases as cases and controls as controls between different subgroups Fisher’s exact test was used.
All analyses were performed with R (version 3.1.0)27. All statistical tests were two-sided and p-values below 0.05 were considered statistically significant.
Results
Study population characteristics
To study autoantibody responses against 64 TAAs we selected 829 serum samples from 815 individuals (gastric cancer patients, controls, esophageal cancer patients and CAG patients) and performed autoantibody measurements by bead-based multiplex serology. For 785 samples (95%) valid measurement results were obtained, while 44 samples had to be excluded due to insufficient amount of serum (n = 37) or high background fluorescence (n = 7). Participants with and without valid measurement results did not differ in respect of age, sex, stage, case-/control-status or study (each p-value > 0.05).
An overview of the study design and final numbers and characteristics of the different groups of participants are provided in Fig. 1 and Table 1. The mean age at recruitment was similar across all groups of participants and ranged from 61 years (controls ESTHER I study) to 66 years (gastric cancer patients ESTHER I study). With 66–68% and 81% the percentages of male subjects were significantly higher among gastric cancer and esophageal cancer cases than among controls (45–51%, p < 0.0001 and p = 0.0003, respectively). They were also higher than among CAG patients (47%, p = 0.0006 and p = 0.0015, respectively). For 23 gastric cancer patients from the training set, 27 gastric cancer patients from the validation set and all participants from ESTHER I with diagnosis of gastric cancer during the follow-up period, the UICC stage was unknown. About half of the remaining gastric cancer patients of the training set were diagnosed at an early stage (UICC 0-II), while in the validation set there were slightly more late-stage (UICC III-IV) gastric cancer patients. Due to recent changes in the gastric cancer treatment guidelines28, gastric cancer patients of the training set, who were more recently recruited in the context of the DACHSplus study, more often received neoadjuvant therapy than gastric cancer patients of the validation set (ESTHER II and VERDI study).
Diagnostic performance of single autoantibody markers for gastric cancer detection
For each measured autoantibody an individual cutoff for seropositivity was calculated based on the MFI values of the controls from the training set (cutoff = 98th percentile of controls, minimum 50 MFI). The cutoffs ranged from 50 MFI to 3633 MFI (see Supplementary Table S3). The highest cutoffs were observed for SPANXA, HIST1H2B and MPHOSPH6 indicating numerous autoantibody responses in healthy controls against those antigens. At cutoffs yielding at least 98% specificity, sensitivities for gastric cancer detection ranged from 0% to 12% in the training set and autoantibody responses were found in both early and late stage cancers. Antibody frequencies of at least five percent were seen for the antigens MAGEA4, CTAG1, CTAG2, DDX53, TP53, MAGEA3, SDCCAG8, KLK3_iso2, ERBB2_N, ERBB2_C, IGF2BP1, GRINA and UBQLN1 (see Table 2). In the validation set autoantibodies against MAGEA3, MAGEA4, CTAG1 and TP53 performed best with Youden’s indexes of 0.10 (anti-MAGEA3), 0.09 (anti-MAGEA4) and 0.07 (anti-CTAG1, anti-TP53).
For some autoantibodies, antibody reactivities ranged from rather weak to very strong reactivities as measured e.g. for anti-CTAG1 with mean reactivity of 5108 MFI in seropositive gastric cancer cases, which is 28.3× higher than the cutoff (see Fig. 2). Moreover, many cancer patients developed autoantibodies directed against several of the TAAs (Fig. 3). Especially for antigens with high structural similarity (e.g. CTAG1 and CTAG2 or MAGEA3 and MAGEA4), a high correlation in seroreactivity was observed.
After swapping of the training and test set 4 of the top 5 autoantibodies and 9 of the top 13 autoantibodies from the old training set were also among the top 5 and top 13 autoantibodies in the new training set, respectively (see Supplementary Table S4).
Diagnostic performance of marker combinations for gastric cancer detection
To evaluate the diagnostic performance of autoantibody combinations, we generated all possible 2-, 3-, 4- and 5-marker combinations and ranked them by the Youden’s index (J) for the training set. According to this criterion, the best 2-, 3-, 4- and 5- marker combinations were found for autoantibodies against MAGEA4 + CTAG1 (J = 0.15, 19% sensitivity, 96% specificity), MAGEA4 + CTAG1 + TP53 (J = 0.18, 25% sensitivity, 93% specificity), MAGEA4 + CTAG1 + TP53 + ERBB2_C (J = 0.21, 30% sensitivity, 91% specificity) and MAGEA4 + CTAG1 + TP53 + ERBB2_C + SDCCAG8 (J = 0.23, 34% sensitivity, 89% specificity). However, there are other 5-marker combinations that can detect gastric cancer patients with very similar sensitivities and specificities (see Table 3). Because specificity in the screening situation typically should not go below a limit of about 90%, we decided to not further increase the number of autoantibodies in a marker combination and to conduct the subsequent analyses only with the best performing five-marker panel.
All top 11 5-marker panels included the autoantibodies anti-MAGEA4, anti-CTAG1 and anti-TP53. Anti-ERBB2_C was included eight times, anti-SDCCAG8 was included four times and anti-DDX53 was included two times, while the other TAAs were included only once. After swapping of the training and test set new top 5-marker combinations were selected. However, all top 11 combinations also comprised anti-TP53 (see Supplementary Table S5).
Diagnostic performance of a 5-marker panel in subgroups of gastric cancer patients and patients with other diseases of the upper digestive tract
The diagnostic performance of the best performing 5-marker panel was evaluated separately for gastric cancer patients from different studies and cancer stages (see Table 4). Early and late stage cancers were both detected with similar sensitivity (35% and 32% sensitivity at 87% specificity in the validation set) and no significant differences in diagnostic performance were found between gastric cancer patients from ESTHER II and VERDI (p = 0.59). Interestingly, some of the few gastric cancer patients from the cohort study ESTHER I presented autoantibodies against TAAs several years before gastric cancer diagnosis (for details see Supplementary Table S7). The 5-marker panel was also able to detect 26% of the esophageal cancer patients. However, patients with chronic atrophic gastritis, which is a precursor of gastritis cancer, were not detected by the 5-marker panel (see Table 4).
To evaluate the ability of the best performing 5-marker panel to discriminate gastric cancer cases and controls independent of differences in the age and sex distributions we performed a sensitivity analyses in a matched subset of the validation set (120 gastric cancer cases, 97 controls) using adjustment weights. With age- and sex adjusted sensitivities and specificities of 33% and 88% diagnostic performance characteristics after adjustment were very similar to the unadjusted diagnostic performance characteristics. In accordance with this result, no significant differences between men and women or persons under 65 years and persons aged 65 and above were found in the subgroup analyses (see Supplementary Table S6). Likewise, subgroup analyses in untreated patients versus patients that received radiotherapy, chemotherapy or surgery before blood withdrawal, cardia versus non-cardia gastric cancers, persons with different times of blood withdrawal in relation to surgery or different H. pylori infection status did not reveal significant differences between those groups.
Discussion
We measured autoantibodies against 64 candidate TAAs by bead-based multiplex serology in serum samples from 329 gastric cancer patients, 321 healthy controls and 124 participants with other diseases of the upper digestive tract. Sensitivities for gastric cancer detection for single autoantibodies ranged from 0–12% at 98% specificity in the training set and a combination of five autoantibodies was able to detect about a third of the gastric cancer patients at a specificity of 87% in the validation set. Early stage cancers were detected with similar sensitivities as late stage cancers and in some patients autoantibodies were even found several years before gastric cancer diagnosis. Sensitivities for the detection of CAG and esophageal cancer by the 5-marker panel were 12% and 26%, respectively.
We selected many autoantibodies for our measurements based on promising diagnostic performance for early detection of cancer in two previously performed systematic literature reviews11,24. However, in our measurements the sensitivities observed for these autoantibodies were often considerably lower than those reported originally. The most contrary finding to previously published results was observed in case of autoantibodies against MTDH, also known as AEG-1, which have been described to be present in none of the controls but 59% of the gastric cancer cases29, compared to 2–3% in both gastric cancer cases and controls here. Some of the observed differences in diagnostic performance might be attributable to differences in the methods used to quantify autoantibodies, others to shortcomings in the study design and data analyses in former studies.
It is a disappointing but common phenomenon that initially promising candidate cancer biomarkers do not pass validation studies30,31. Possible explanations are that observed differences in serum levels of a certain biomarker in initial studies are not caused by the cancer but by differences in the study populations (e.g. different age and sex distributions in cases and controls) or in the blood sampling and storage conditions (e.g. different blood sample processing time for cases and controls)32. If the diagnostic performance of marker combinations is evaluated on the same data that were used to select the markers, results will be overoptimistic due to overfitting33. A similar effect can occur if specificity is determined on the same controls that were used to calculate the cutoffs for seropositivity34,35.
We tried to avoid overfitting and overoptimistic performance estimates by using gastric cancer cases and controls from independent studies for marker selection and validation of the 5-marker panels. To test the validity of our results, we furthermore performed a sensitivity analysis in which we swapped the training and the test set. For single autoantibodies there was a large overlap between the top markers identified with the original sets and with the swapped sets. These autoantibodies were also frequently selected for the best 5-marker combinations generated from both sets which demonstrates the robustness of our marker selection approach. After swapping training and test sets, Youden’s indexes of the top 5-marker combinations were higher for the training set and mostly lower for the validation set. This is not surprising in consideration of the fact that sample sizes for the swapped training set were smaller than for the original training set and supports our decision to pick the BliTz and DACHSplus participants as training set in the main analyses.
To our knowledge, there are only few other studies that have evaluated autoantibody combinations for the early detection of gastric cancer so far. Recently, two articles from the same university in China have been published that describe studies that tested combinations of autoantibodies against Koc ( = IGF2BP3), p62 ( = IGF2BP2), Imp1 ( = IGF2BP1), Cyclin B1 ( = CCNB1), p16 ( = CDKN2A), Survivin ( = BIRC5), c-myc ( = MYC) and p53 ( = TP53) for the early detection of gastric cardia adenocarcinoma36 or multiple cancer types including gastric cancer9. With a 7-marker combination Zhou et al. reported identifying 64% of all cases with gastric cardia adenocarcinoma (specificity: 86%)36 and the 8-marker combination of Wang et al. was reported to yield 56% sensitivity for gastric cancer detection at 86% specificity9. Seven of these eight autoantibodies were also included in our measurements but only anti-TP53 and anti-IGFBP1 were selected in the top 11 5-marker combinations. However, the apparently better diagnostic performances of shared autoantibodies in the two articles have to be viewed with caution because of small sample sizes and the fact that cutoffs were chosen based on the same controls that subsequently were used to calculate specificity. Furthermore, Wang et al. did not provide study populations characteristics or a detailed description of blood sampling and storage conditions9 which limits judgement of comparability of cases and controls in regard to these factors. In another autoantibody panel for gastric cancer early detection small peptides instead of full length proteins were used as antigens. The reported cross-validated sensitivity for a signature of 45 autoantibodies was 44% at a specificity of 90%37.
In our measurements autoantibodies were found to be not restricted to gastric cancer patients with certain cancer stages or gastric cancer subtypes. Moreover, the autoantibody measurements in esophageal cancer patients showed that autoantibodies in our best 5-marker combination can also be present in another cancer of the gastrointestinal tract. A comparison with the improved version of EarlyCDT®-Lung, a commercial autoantibody test for lung cancer early detection38,39, reveals big overlap between our best performing autoantibodies and the autoantibodies of this test: four of the seven autoantibodies used in the EarlyCDT®-Lung test (anti-MAGEA4, anti-CTAG1 = anti-NY-ESO-1, anti-DDX53 = anti-CAGE and anti-TP53) were among the five autoantibodies with the highest sensitivities for gastric cancer; the other three autoantibodies (anti-SOX2, anti-GBU4-5 and anti-HuD) were not included in our multiplex serology measurements. Also the autoantibody measurements by Wang et al. support the idea that many commonly measured autoantibodies are present in different types of cancer9.
With this knowledge, it seems likely that some of the false positive results from the control group indeed are true positive results representing patients that suffer from an undiagnosed other cancer, for example prostate cancer, which is the most frequently present but often undiagnosed cancer in old men in developed countries1. In accordance with this hypothesis is the observation that observed specificities in old persons (for whom cancer prevalence is likely to be higher) tended to be a bit lower than in young persons.
With our 5-marker panel as well as with the EarlyCDT®-Lung test only the minority of the cancer patients are detected at specificities of around 90%39. So, the question arises why the majority of cancer patients are missed. Is it just an issue of suboptimal autoantibody selection in current panels? Or do not all cancers lead to detectable immune responses? In general, there are two types of tumor antigens: neo-antigens that represent mutated tumor proteins and non-mutated self-antigens that are derived from proteins that are not present or present at lower levels in normal cells, e.g. cancer-germline antigens40. However, as neo-antigens are unique for each tumor, all important antigens we measured autoantibodies against, belonged to the second group. Even if there were mutations present in a reasonable percentage of the gastric cancer patients, it couldn’t be guaranteed that those mutations would lead to the development of autoantibodies, because immune recognition is dependent on the ability of peptides that carry the mutation to be transported and bound to MHC receptors40. The possibilities of combining autoantibody markers are further limited by the fact that many cancer-germline antigens are closely related and patient sera often react with either all or none of these TAAs. For instance autoantibodies against CTAG1 and CTAG2 or against MAGEA3 and MAGEA4 are frequently found together and the combination of such a pair of autoantibodies would not result in a large gain of sensitivity, while for example the addition of anti-TP53 to an autoantibody against a cancer-germline antigen could increase sensitivity substantially.
In addition to searching for new algorithms and autoantibody markers that complement the currently known autoantibody markers, there might be a high potential in combining known autoantibody markers with other candidate biomarkers for gastric cancer. Those could be traditional tumor markers like CEA, CA19-9 and CA72-441, markers related to chronic atrophic gastritis (e.g. H. pylori antibodies and pepsinogens)42, TAAs or other proteins43,44, microRNAs45,46 or glycosylation signatures47.
To our knowledge, this is the first study in which simultaneous measurements by bead-based multiplex-serology were performed to systematically assess the diagnostic value of autoantibodies for the early detection of gastric cancer. There are specific strengths and limitations that have to be considered. Strengths are the simultaneousness of measurements, which allows a direct comparison of autoantibody markers, the large sample size (329 gastric cancer patients and 321 controls with valid measurements in total) and the use of independent training and test sets for marker selection and validation. However, both cases of the training set and the validation set were recruited in clinics after gastric cancer diagnosis (and initial treatment for some patients) and controls of the training set came from a setting optimized for colorectal cancer screening studies rather than for gastric cancer screening studies. Although autoantibodies are known to be stable and enduring48 and we did not observe significant differences in the diagnostic performance of the 5-marker panel between subgroups of gastric cancer patients that did or did not receive neoadjuvant therapy before blood withdrawal, it cannot be ruled out that diagnostic or therapeutic interventions or lifestyle changes in response to the gastric cancer diagnosis have influenced autoantibody levels. Furthermore, the number of prediagnostic gastric cancer cases from the ESTHER I study, in which autoantibodies were detected several years before gastric cancer diagnosis, is too small to decide if these are true results or chance findings. Larger prospective studies would be necessary to answer this question.
Further topics that should be addressed in future studies are the biological role and the dynamics of autoantibodies in cancer. For example, we observed that strength of the autoantibody responses varied largely among the markers with highest sensitivities in the training set. Further studies should explore if strong and weak autoantibody responses have the same diagnostic values for early cancer detection and if strength of an autoantibody response varies in the course of the progression from a premalignant lesion to a symptomatic cancer.
In conclusion, we have conducted large scale autoantibody measurements in serum samples from gastric cancer cases and controls. The tested autoantibodies and combinations alone did not reach sufficient sensitivity for gastric cancer screening. However, with moderate sensitivities and very high specificities, some of the tested autoantibodies, e.g. anti-MAGEA4, anti-CTAG1, or anti-TP53 could be good candidates for combinations with other cancer biomarkers. As autoantibodies seem not to be specific for a certain cancer type, autoantibodies against TAAs might be particularly useful for the development of a potential minimally invasive “universal cancer test” which might serve for preliminary unspecific cancer screening to select people who would most likely benefit from (typically more costly and complex) screening for specific cancers.
Additional Information
How to cite this article: Werner, S. et al. Evaluation of the diagnostic value of 64 simultaneously measured autoantibodies for early detection of gastric cancer. Sci. Rep. 6, 25467; doi: 10.1038/srep25467 (2016).
References
Torre, L. A. et al. Global cancer statistics, 2012. CA Cancer J Clin 65, 87–108 (2015).
Ford, A. C., Forman, D., Hunt, R. H., Yuan, Y. & Moayyedi, P. Helicobacter pylori eradication therapy to prevent gastric cancer in healthy asymptomatic infected individuals: systematic review and meta-analysis of randomised controlled trials. BMJ 348, g3174 (2014).
Herrero, R., Parsonnet, J. & Greenberg, E. R. Prevention of gastric cancer. JAMA 312, 1197–1198 (2014).
Choi, I. J. Endoscopic gastric cancer screening and surveillance in high-risk groups. Clin Endosc 47, 497–503 (2014).
Leja, M., You, W., Camargo, M. C. & Saito, H. Implementation of gastric cancer screening - the global experience. Best Pract Res Clin Gastroenterol 28, 1093–1106 (2014).
Wang, X. et al. Autoantibody signatures in prostate cancer. N Engl J Med 353, 1224–1235 (2005).
Babel, I. et al. Identification of tumor-associated autoantigens for the diagnosis of colorectal cancer in serum using high density protein microarrays. Mol Cell Proteomics 8, 2382–2395 (2009).
Chapman, C. J. et al. Immunobiomarkers in small cell lung cancer: potential early cancer signals. Clin Cancer Res 17, 1474–1480 (2011).
Wang, P. et al. Evaluation of diagnostic value in using a panel of multiple tumor-associated antigens for immunodiagnosis of cancer. J Immunol Res 2014, 512540 (2014).
Anderson, K. S. et al. Protein microarray signature of autoantibody biomarkers for the early detection of breast cancer. J Proteome Res 10, 85–96 (2011).
Werner, S., Chen, H., Tao, S. & Brenner, H. Systematic review: serum autoantibodies in the early detection of gastric cancer. Int J Cancer 136, 2243–2252 (2015).
Waterboer, T. et al. Multiplex human papillomavirus serology based on in situ-purified glutathione s-transferase fusion proteins. Clin Chem 51, 1845–1853 (2005).
Brenner, H., Chang-Claude, J., Seiler, C. M., Rickert, A. & Hoffmeister, M. Protection from colorectal cancer after colonoscopy: a population-based, case-control study. Ann Intern Med 154, 22–30 (2011).
Brenner, H., Chang-Claude, J., Seiler, C. M., Sturmer, T. & Hoffmeister, M. Does a negative screening colonoscopy ever need to be repeated? Gut 55, 1145–1150 (2006).
Brenner, H., Chang-Claude, J., Seiler, C. M., Sturmer, T. & Hoffmeister, M. Case-control study supports extension of surveillance interval after colonoscopic polypectomy to at least 5 yr. Am J Gastroenterol 102, 1739–1744 (2007).
Hundt, S., Haug, U. & Brenner, H. Comparative evaluation of immunochemical fecal occult blood tests for colorectal adenoma detection. Ann Intern Med 150, 162–169 (2009).
Haug, U., Hundt, S. & Brenner, H. Quantitative immunochemical fecal occult blood testing for colorectal adenoma detection: evaluation in the target population of screening and comparison with qualitative tests. Am J Gastroenterol 105, 682–690 (2010).
Brenner, H., Tao, S. & Haug, U. Low-dose aspirin use and performance of immunochemical fecal occult blood tests. JAMA 304, 2513–2520 (2010).
Gao, L. et al. Helicobacter pylori infection and gastric cancer risk: evaluation of 15H. pylori proteins determined by novel multiplex serology. Cancer Res 69, 6164–6170 (2009).
Brenner, H., Arndt, V., Stegmaier, C., Ziegler, H. & Rothenbacher, D. Is Helicobacter pylori infection a necessary condition for noncardia gastric cancer? Am J Epidemiol 159, 252–258 (2004).
Brenner, H., Rothenbacher, D. & Weck, M. N. Epidemiologic findings on serologically defined chronic atrophic gastritis strongly depend on the choice of the cutoff-value. Int J Cancer 121, 2782–2786 (2007).
Gnjatic, S. et al. Seromic profiling of ovarian and pancreatic cancer. Proc Natl Acad Sci. USA 107, 5088–5093 (2010).
Zornig, I. et al. Prognostic significance of spontaneous antibody responses against tumor-associated antigens in malignant melanoma patients. Int J Cancer 136, 138–151 (2015).
Chen, H., Werner, S., Tao, S., Zornig, I. & Brenner, H. Blood autoantibodies against tumor-associated antigens as biomarkers in early detection of colorectal cancer. Cancer Lett 346, 178–187 (2014).
Sehr, P., Zumbach, K. & Pawlita, M. A generic capture ELISA for recombinant proteins fused to glutathione S-transferase: validation for HPV serology. J Immunol Methods 253, 153–162 (2001).
Brown, L. D., Cai, T. T. & DasGupta, A. Interval estimation for a binomial proportion. Stat Sci 16, 101–117 (2001).
R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria URL http://www.R-project.org/ (2014).
Waddell, T. et al. Gastric cancer: ESMO-ESSO-ESTRO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol 24 Suppl 6, vi57–63 (2013).
Chen, X. et al. Serum anti-AEG-1 auto-antibody is a potential novel biomarker for malignant tumors. Oncol Lett 4, 319–323 (2012).
Diamandis, E. P. Cancer biomarkers: can we turn recent failures into success? J Natl Cancer Inst 102, 1462–1467 (2010).
Buchen, L. Cancer: Missing the mark. Nature 471, 428–432 (2011).
Wagner, P. D. & Srivastava, S. New paradigms in translational science research in cancer biomarkers. Transl Res 159, 343–353 (2012).
Moons, K. G. et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann Intern Med 162, W1–W73 (2015).
Rutjes, A. W. et al. Evidence of bias and variation in diagnostic accuracy studies. CMAJ 174, 469–476 (2006).
Ziegler, A., Konig, I. R. & Schulz-Knappe, P. Challenges in planning and conducting diagnostic studies with molecular biomarkers. Dtsch Med Wochenschr 138, e2–24 (2013).
Zhou, S. L. et al. Detection of autoantibodies to a panel of tumor-associated antigens for the diagnosis values of gastric cardia adenocarcinoma. Dis Esophagus 28, 371–379 (2015).
Zayakin, P. et al. Tumor-associated autoantibody signature for the early detection of gastric cancer. Int J Cancer 132, 137–147 (2013).
Chapman, C. J. et al. EarlyCDT(R)-Lung test: improved clinical utility through additional autoantibody assays. Tumour Biol 33, 1319–1326 (2012).
Jett, J. R. et al. Audit of the autoantibody test, EarlyCDT(R)-lung, in 1600 patients: an evaluation of its performance in routine clinical practice. Lung Cancer 83, 51–55 (2014).
Heemskerk, B., Kvistborg, P. & Schumacher, T. N. The cancer antigenome. EMBO J 32, 194–203 (2013).
Shimada, H., Noie, T., Ohashi, M., Oba, K. & Takahashi, Y. Clinical significance of serum tumor markers for gastric cancer: a systematic review of literature by the Task Force of the Japanese Gastric Cancer Association. Gastric Cancer 17, 26–33 (2014).
Mizuno, S. et al. Prescreening of a high-risk group for gastric cancer by serologically determined Helicobacter pylori infection and atrophic gastritis. Dig Dis Sci 55, 3132–3137 (2010).
Subbannayya, Y. et al. Identification of differentially expressed serum proteins in gastric adenocarcinoma. J Proteomics 127, 80–88 (2015).
Zeng, Z. et al. The diagnostic value of monoclonal gastric cancer 7 antigen: a systematic review with meta-analysis. Clin Exp Med 14, 337–343 (2014).
Liu, R. et al. A five-microRNA signature identified from genome-wide serum microRNA expression profiling serves as a fingerprint for gastric cancer diagnosis. Eur J Cancer 47, 784–791 (2011).
Zhu, X., Lv, M., Wang, H. & Guan, W. Identification of circulating microRNAs as novel potential biomarkers for gastric cancer detection: a systematic review and meta-analysis. Dig Dis Sci 59, 911–919 (2014).
Ruhaak, L. R. et al. The Serum Immunoglobulin G Glycosylation Signature of Gastric Cancer. EuPA Open Proteom 6, 1–9 (2015).
Heo, C. K., Bahk, Y. Y. & Cho, E. W. Tumor-associated autoantibodies as diagnostic and prognostic biomarkers. BMB Rep 45, 677–685 (2012).
Acknowledgements
We gratefully acknowledge the excellent cooperation of clinics and practices in patient recruitment and of Labor Limbach in sample collection. We thank Dr. Katja Butterbach and Ulrike Schlesselmann for their excellent work in laboratory preparation of blood samples and Ute Koch, Monika Oppenländer and Iris Kaiser for excellent technical assistance with the autoantibody measurements. We also gratefully acknowledge Isabel Lerch, Susanne Köhler, Utz Benscheid, Jason Hochhaus and Maria Kuschel for their contribution in data collection, monitoring and documentation. This work was supported by iMed - the Helmholtz Initiative on Personalized Medicine. The BliTz study was partly funded by grants from the German Research Council (DFG, Grant No. BR1704/16-1). The ESTHER I and II studies were partly funded by Baden Württemberg State Ministry of Science, Research and Arts and by the German Federal Ministry of Education and Research. The VERDI study was partly funded by a grant from the German Cancer Aid (Deutsche Krebshilfe). The work of H.C. was partly supported by China Scholarship Council (CSC) and D.J. and I.Z. received funding from the Wilhelm Sander Foundation (Grant No. 2008.091.1).
Author information
Authors and Affiliations
Contributions
H.B. designed and supervised the study. All authors contributed to data collection. H.B., H.C. and S.W. developed the analysis strategy. S.W. conducted the statistical analyses and drafted the manuscript. All authors contributed to interpretation of data, critically reviewed the article and approved the final version submitted.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Werner, S., Chen, H., Butt, J. et al. Evaluation of the diagnostic value of 64 simultaneously measured autoantibodies for early detection of gastric cancer. Sci Rep 6, 25467 (2016). https://doi.org/10.1038/srep25467
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep25467
This article is cited by
-
Identifying Novel Biomarkers Ready for Evaluation in Low-Prevalence Populations for the Early Detection of Upper Gastrointestinal Cancers: A Systematic Review
Advances in Therapy (2021)
-
Serum p53 antibody: useful for detecting gastric cancer but not for predicting prognosis after surgery
Surgery Today (2020)
-
Autoantibody against 14-3-3 zeta: a serological marker in detection of gastric cancer
Journal of Cancer Research and Clinical Oncology (2019)
-
Serum microRNA signatures and metabolomics have high diagnostic value in gastric cancer
BMC Cancer (2018)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.