Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future

Riley, R D; Abrams, K R; Sutton, A J; Lambert, P C; Jones, D R; Heney, D; Burchill, S A

doi:10.1038/sj.bjc.6600886

Download PDF

Clinical
Open access
Published: 15 April 2003

Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future

R D Riley¹,
K R Abrams¹,
A J Sutton¹,
P C Lambert¹,
D R Jones¹,
D Heney² &
…
S A Burchill³

British Journal of Cancer volume 88, pages 1191–1198 (2003)Cite this article

1538 Accesses
123 Citations
12 Altmetric
Metrics details

This article has been updated

Abstract

Prognostic markers help to stratify patients for treatment by identifying patients with different risks of outcome (e.g. recurrence of disease), and are important tools in the management of cancer and many other diseases. Systematic review and meta-analytical approaches to identifying the most valuable prognostic markers are needed because (sometimes conflicting) evidence relating to markers is often published across a number of studies. To investigate the practicality of this approach, an empirical investigation of a systematic review of tumour markers for neuroblastoma was performed; 260 studies of prognostic markers were identified, which considered 130 different markers.

The reporting of these studies was often inadequate, in terms of both statistical analysis and presentation, and there was considerable heterogeneity for many important clinical/statistical factors. These problems restricted both the extraction of data and the meta-analysis of results from the primary studies, limiting feasibility of the evidence-based approach.

Guidelines for reporting the results of primary prognostic marker studies in cancer, and other diseases, are given in order to facilitate both the interpretation of individual studies and the undertaking of systematic reviews, meta-analysis and, ultimately, evidence-based practice. General availability of full individual patient data is a necessary step forward and would overcome the majority of problems encountered, including poorly reported summary statistics and variability in cutoff level, outcome assessed and adjustment factors used. It would also limit the problem of reporting bias, although publication bias will remain a concern until studies are prospectively registered. Such changes in practice would help important evidence-based reviews to be conducted in order to establish the most appropriate prognostic markers for clinical use, which should ultimately improve patient care.

Main

Prognostic markers (also called prognostic variables or factors) are relevant tools in the management of patients with cancer, and also many other medical conditions, because they help to stratify patients for treatment by identifying different risk groups in order to reduce morbidity and mortality. They include biological, clinical, genetic, histological and pathological features. For example, carcinoembryonic antigen (CEA) is a prognostic marker in colorectal cancer (Eche et al, 2001).

An evidence-based approach to identifying the most valuable prognostic markers for a given disease is clearly important because it is common for evidence relating to markers to be published across a number of studies, often with conflicting results (Altman, 2001, pp 228–247; Riley et al, 2003a). Furthermore, a frequent difficulty in assessing the clinical value of prognostic markers is the relatively small number of patients in primary research studies, sometimes a consequence of disease rarity and limited resources, such that each primary study has low statistical power for detecting any benefits of prognostic staging. The use of systematic review, and in particular meta-analysis, methodology may therefore be important and allow a useful assessment of the prognostic power of markers (Altman, 2001, pp 228–247). A systematic review is the preferred means of identifying and combining existing evidence (Egger et al, 2001). Meta-analysis is the statistical analysis of the review, which seeks to combine all the relevant results found from the literature identified in a quantitative way to produce results more precise than is possible from the individual studies (Sutton et al, 2000).

In this paper, we use a recently performed systematic review of prognostic tumour markers studied in neuroblastoma to demonstrate the problems encountered when using this approach, and highlight how they limit evidence-based practice. We then generalise the problems to other areas of oncology, and indeed other disease settings, and ultimately provide specific guidelines for reporting primary prognostic marker studies.

Methods

Neuroblastoma is a neuroblastic tumour of the primordial neural crest and is the most common extracranial solid tumour of childhood. The study of prognostic markers for this disease forms an active research area within which a large body of evidence exists. This makes it an appropriate area for an empirical investigation, and as such the problems identified in this study are highly likely to generalise to other disease settings. A brief description of the systematic review strategy adopted is now given.

Search strategy

The three on-line bibliographic databases Medline, Embase and Cancerlit were chosen as a basis for identifying the relevant literature from 1966 to February 2000. Papers written in a non-English language were excluded. A full description of the search strategy and inclusion/exclusion criteria is provided in Riley et al (2003b). One investigator performed the assessment of the papers, with second and third investigators independently checking a sample of them. To be included in the review, a paper had to provide a quantitative result or give tabulated individual patient data evaluating the use of a tumour marker in neuroblastoma from a primary research study of humans. To be classified as relevant to prognosis, a paper had to present data, in the form of summary statistics or individual patient data, relating tumour marker levels at a measured point in time to the outcome of patients at the end of a specific follow-up period. Owing to the large number of potential markers, we focused on genetic/biological markers rather than histological markers.

Data extraction for meta-analysis

From each of the papers included, information was extracted on the tumour markers studied. Meta-analysis of those markers on which eight or more papers provided data was considered. The log_e(hazard ratio) and its variance were the essential information required from each study, as they provide an important comparative estimate of the risk of death/disease recurrence between two groups of patients. Furthermore, there are several indirect estimation methods available when these statistics are not directly reported (Parmar et al, 1998), and the log_e(hazard ratio) has an approximate normal distribution for large samples, making it particularly amenable to meta-analysis techniques. We make the assumption of proportional hazards throughout this paper.

It was common for a paper to report more than one prognostic result by relating one or more markers to overall survival and/or disease-free survival, and also by providing unadjusted and/or adjusted results (e.g. adjusted for age, stage of disease). Estimates of the log_e(hazard ratio) and its variance comparing two groups defined by a single marker level were sought from all the overall survival and disease-free survival reports. An unadjusted estimate was preferred for each as prior knowledge indicated that adjusted results were likely to be highly inconsistent in the factors for which adjustment was made (Altman, 2001, pp 228–247). An adjusted estimate was sought in the absence of an unadjusted result. Although some markers take only binary values (e.g. chromosome 1p – deletion or no deletion), it was also usual for primary studies to dichotomise continuous variables using a cutoff level in order to categorise patients into high- and low-risk groups.

A five-step sequential process (Figure 1) using 10 different direct and indirect methods (Table 1), based on the approach of Parmar et al (1998), was used in an attempt to obtain the log_e(hazard ratio) and its variance. Studies with samples smaller than 25 were not included in Steps 2–5 because they were not considered large enough to justify estimation methods. A more detailed version of Figure 1 and a more in-depth description of the extraction procedure are provided in Riley et al (2003b).

Table 1 Description of the methods used to obtain estimates of the log_e(hazard ratio) and its variance

Full size table

Results

Literature search results

A total of 3415 papers were identified from the literature search. After assessment, 260 papers were classified as ‘relevant’ to prognosis, and these studied a total of 130 different tumour markers for risk stratification of patients (for references see Riley et al, 2003b).

Data extraction of prognostic marker results

The 13 most commonly studied prognostic markers were each selected for an in-depth study to establish their individual value as a prognostic tool (Table 2). Expression of CD44 gene was studied in eight papers and the other 12 markers were studied in 10 or more prognosis papers (Table 2). This involved 211 (81.2%) of all the prognosis papers. Within these there were 575 reports of prognostic power assessment where levels of any of these 13 tumour markers were related to overall survival or disease-free survival by summary statistics or IPD.

Table 2 Names of the 13 markers grouped by tumour marker class, with the number of prognosis papers identified for each, the number of reports when it was related to either overall or disease-free survival by summary statistics or individual patient data, and the number of successful estimates made of the log_e(hazard ratio) and variance; evidence of heterogeneity is shown for outcome, cutoff levels, age, stage and adjustment factors

Full size table

Only 204 (35.5%) estimates of both the log_e(hazard ratio) and its variance could be calculated from Steps 1–5 using Methods 1–10 (Figure 1, Table 1). In particular, the log_e(hazard ratio) and its variance were both directly provided on only three occasions in the 575 reports (0.005%) (Table 1, Method 1), and all were from a single paper (Berthold et al, 1997). Fortunately, individual patient data were frequently presented within this literature, and from this a further 41 direct estimates were made (Step 2, Method 2). The remaining 160 successful estimates were obtained using Methods 3–10 (Table 1), the most frequently required of which used a P-value/χ²-statistic in combination with group numbers and total number of events, that is, deaths/recurrences of disease (102 times) (Methods 8 and 9).

Problems limiting meta-analysis

Poor reporting of primary studies

Primary studies of prognostic tumour markers are clearly essential and we observed many important results across the literature that have implications for clinical practice. However, the general standard of reporting primary studies was inadequate, and it was disappointing that we only managed to obtain 35.5% of the estimates required despite the intensive, time-consuming extraction procedure (Figure 1). This hindered the use and interpretation of meta-analysis because we could not incorporate the majority of results reported in the literature and consequently introduced a strong potential for bias. Among the 371 reports that did not enable estimates to be made, there were five common reporting problems, most of which can be simply addressed (Figure 2. Encouragingly, there was some evidence that the reporting of prognostic markers has improved over the last 10 years because all the papers that did provide a hazard ratio or log_e(hazard ratio) were published after 1990. However, these papers still only represented approximately 17% of the total literature identified over this period (i.e. only 26 out of 157 papers published after 1990 reported a hazard ratio).

Heterogeneity of clinical and statistical factors

The synthesis of our estimates was also restricted by the large variability in both clinical and statistical factors. For each estimate of the log_e(hazard ratio) and its variance obtained, the cutoff level used to dichotomise the continuous markers, stage of disease, age of patients and outcome (overall or disease-free survival) were recorded, and also whether the estimate was unadjusted or adjusted and, if so, what adjustment factors were used. There was great diversity in these features (Table 2). For example, for the marker MYC-N there were 94 estimates of the log_e(hazard ratio) and variance obtained but these involved nine different cutoff points, nine different stage groups, four different age groups, 77 unadjusted/17 adjusted estimates and two different outcomes (Table 3). Furthermore, of the 17 estimates that were adjusted for other prognostic markers or clinical features (using a Cox regression model) only two were adjusted for exactly the same set of factors, and these were from the same article (Maris et al, 2000).

Table 3 Heterogeneity in the 94 estimates of the log_e(hazard ratio) and its variance obtained for marker MYC-N

Full size table

This inconsistent and variable reporting was reflected equally in the estimates obtained for the other 12 markers (Table 2). The type of treatment of patients and the method of measuring the markers were not recorded, but both would have added further heterogeneity to that observed.

Publication bias and reporting bias

The common problem of publication bias, and other reporting biases, may still affect our data extraction; some results that do not generate formal statistically significant or clinically valuable findings may not have been published, because of a reluctance of journals to report or of researchers to present negative findings. Such problems severely limit the conclusions that can be drawn from meta-analyses because not all the available evidence can be included, and therefore the pooled results are likely to be biased. We investigated the estimates obtained for MYC-N and indeed there did appear to be evidence of publication bias, with a number of studies with smaller hazard ratios considered to be missing (Riley et al, 2003). This problem is likely to be closely related to the problem of small sample sizes in some primary studies (Figure 2, key problem 5).

Should we proceed with meta-analysis?

The poor reporting, potential for publication bias and, in particular, the large heterogeneity across studies meant it was practically impossible to perform reliable meta-analyses that would determine the clinical importance of each marker studied. Even the analysis of subgroups of estimates was not considered realistic because it was virtually impossible to obtain subgroups that reflected patients with similar features. For example, for marker MYC-N there were 48 overall survival estimates obtained, of which 41 were unadjusted, and 30 related to ‘all’ stages and ‘all’ ages. Furthermore, only eight of these 30 estimates related to the most commonly used cutoff level of ‘1 copy number’, and there is then the additional problem of heterogeneity for treatment used and method of measuring the marker, not to mention the potential impact of publication/reporting bias. The subgroup numbers were even smaller for the other, less-studied markers; for example, lactate dehydrogenase had only two unadjusted overall survival estimates relating to the most common cutoff level (1500 U l⁻¹) and patients of ‘all’ ages and ‘all’ stages.

The only possible benefit of meta-analysis using the estimates that we extracted is to highlight the results of previous studies and help prioritise which markers should be studied in the future. We take such an approach elsewhere (Riley et al, 2003b), but for the purposes of this feasibility study it is clear that no firm clinical policy decisions can be made from our evidence-based review.

Discussion

Appraisal of the systematic review and data extraction

During the systematic review, we evaluated 3415 papers overall and identified 260 with results from studies assessing the prognostic power of tumour markers. This will have identified the majority of the English-language literature, but inevitably some papers will have been excluded unintentionally. However, it seems plausible that the reporting in such papers, and equally non-English papers, would be equally poor and heterogeneous.

We used the indirect methods suggested by Parmar et al (1998) to increase the number of occasions an estimate of the log_e(hazard ratio) and its variance could be obtained. However, the estimates they provide are only approximate and simply make the best possible use of the results presented. Questions still exist about how best to combine indirect estimates with direct estimates. For this reason, we did not use other indirect methods. For example, given further assumptions, we could have used estimates of the proportion surviving to 2, 3, 5 or 10 years to obtain estimates of the log_e(hazard ratio) and its variance (Vale et al, 2002). However, the papers were equally inadequate at presenting these survival statistics. For example, in the 26 prognosis papers for the serum marker lactate dehydrogenase, only 12 gave actuarial estimates of the proportion surviving, and only six of these also gave a confidence interval or standard error. They were also heterogeneous–five estimates were for overall survival, six were for disease-free survival and one was unspecified; estimates were made at 2, 3, 4 or 5 years. Further, very few reported numbers at risk explicitly, as required for reliable estimation.

Generalisations to other prognostic markers

Although these reporting problems were observed for tumour markers within the neuroblastoma literature, they have also limited reviews in other paediatric cancers (Riley et al, 2003a), and it seems plausible that the reporting will be equally poor for prognostic markers in other areas of oncology, and indeed other disease areas. Altman (2001, pp 228–247) discusses the potential problems involved in systematic reviews of prognostic markers, in particular that of poor and heterogeneous reporting of primary studies. Cutoff points are frequently used to dichotomise continuous markers and define groups, while different outcomes, adjustment factors and groups of patients are common features across prognostic studies. Inadequate reporting and presentation of survival data has been shown to be a concern in the cancer literature (Altman et al, 1995).

Reliable and clinically useful meta-analyses of observational and nonrandomised studies, such as the majority of prognostic marker studies, are generally difficult to perform (Fleiss and Gross, 1991). Other recent systematic reviews of prognostic markers have encountered similar problems to the ones we identified. Parker et al (2001) performed a systematic review in prostate cancer to establish whether age is a prognostic marker, but the incomplete and heterogeneous nature of the reports prohibited any quantitative overview. Similarly, a systematic review of prognostic laboratory variables in patients with unresected colorectal liver metastases was limited by the heterogeneity and poor quality of individual studies (Friedburg et al, 2001). Zandbergen et al (2001) performed a systematic review of biochemical markers of brain damage for identifying poor outcome in anoxic-ischaemic coma, but conclusions were limited by small sample sizes and different cutoffs and/or laboratory techniques.

Meta-analyses of prognostic markers have been facilitated when individual patient data were available (Look et al, 2002), in particular to determine a consistent cutoff level (Sakamoto et al, 1996). For those investigators currently interested in performing a quantitative review of prognostic markers, we recommend that they consider asking authors for individual patient data and/or the extra information they require, such as the log_e(hazard ratio) and its variance, as this approach is likely to be the most productive.

Towards guidelines for improved reporting of prognostic markers

It is clearly important that the quality of primary studies, and the reporting of their results improve if clear conclusions and policy recommendations are to be formed about prognostic markers. Altman and Lyman (1998) have proposed important guidelines for both conducting and evaluating prognostic marker studies, including the need for prospective registration of studies. Alongside these, we have developed simple guidelines on how to report results to facilitate both interpretation of individual studies and the undertaking of systematic reviews, meta-analysis and, ultimately, evidence-based practice (Figure 3. Collaboration of research groups is required to promote such practice and achieve both the consistency and standards required. Ideally, both summary data and individual patient data should be reported according to our guidelines. It is important that time to event is incorporated within prognostic marker analyses, and thus the hazard ratio is preferred to other measures of relative risk such as the odds ratio, which relates to a fixed time-point and ignores censoring. However, in addition authors may wish to present the more familiar actuarial % survival at n years preferably with a confidence interval and the number of patients at risk at that time in each group.

Benefits of individual patient data

Although improved reporting of summary statistics is very important, the availability of individual patient data is the most viable way forward in order to produce valid and clinically useful evidence-based reviews of prognostic markers. Subject to any restrictions imposed by data protection laws and guidelines, presentation or availability of full individual patient data using our guidelines would overcome variability in cutoff level, type of estimate (unadjusted or adjusted), outcome assessed (overall or disease-free survival) and adjustment factors; the study of markers in subgroups of patients (e.g. different ages, treatments) would also be easier. It would also eliminate the problem of extracting estimates when inexact P-values are presented, and would remove the need for arbitrary extraction decisions when an individual study presents a marker's results for a range of cutoff values. Furthermore, if levels of all the prognostic markers measured (even those producing nonsignificant results) are provided, then the problem of reporting bias would be reduced. However, publication bias might still be a concern if some studies are not published and do not make IPD available; prospective registration of studies is therefore also important to counteract this.

Individual patient data would also enable direct estimates of the hazard ratio, and other statistics of interest, when data were available but not used, analysed or presented properly in the primary study. A total of 41 (20%) of the 204 estimates that we obtained in the neuroblastoma review were direct estimates calculated from individual patient data that would not have otherwise been possible. It is clearly important to include predominately direct estimates in any quantitative synthesis. In fact, the potential for substantial differences in meta-analysis of survival data when using results provided within the literature instead of individual patient data has recently been shown in the head and neck cancer literature (Duchateau et al, 2001). Individual patient data would also allow model assumptions, for example proportional hazards, to be checked as necessary, and enable the baseline survival function to be estimated.

Presentation or availability of individual patient data would permit more appropriate meta-analyses (Stewart and Parmar, 1993), and would further facilitate the identification of different publications whose results relate to the same or overlapping set of patients. It would also allow an evaluation of combinations of markers, which may produce more specific and accurate prognostic assessments. If it is not appropriate or feasible to provide individual patient data within a paper itself, then there is the opportunity to publish on the Internet (Hutchon, 2001). Of course, even making individual patient data available on the web is not without its problems, with the nonpermanency of individual web-pages, and so perhaps a central repository to collate and manage individual patient data is needed within each disease area. The United Kingdom Children's Cancer Study Group have already initiated this type of approach within paediatric oncology (Mott et al, 1997). Authors may also wish to state in their paper that the IPD is available upon request (with contact details indicated) for those requiring it for evidence-based reviews.

We acknowledge that there are additional issues that arise when conducting individual patient data reviews (Stewart and Clarke, 1995), especially cost and time, but these have to be weighed against the substantial problems we encountered. Of course, even when prioritising the IPD approach, the meta-analyst will in practice end up with a mixture of estimates obtained from IPD and estimates obtained from summary statistics; hence, meta-analysis methods that take these different sources into account are needed.

Cutoff levels

The use of different cutoffs makes synthesis of results particularly difficult. Of added concern is the possibility that the choice of cutoff level in a report may be specifically chosen to optimise the difference between the groups and produce a result with the maximum statistical or clinical significance possible (Altman et al, 1994; Altman and Lyman, 1998). If there is good clinical reason to use a cutoff level, then it should be specified at the start of a study and clearly reported within the results (Figure 3). However, Altman (2001) suggests that continuous markers should not be dichotomised because, among other reasons, this approach discards potentially important quantitative information and considerably reduces the power to detect a real association between the marker and outcome. Hence, we encourage researchers to analyse and report results (e.g. hazard ratio) of continuous markers on their original continuous scale. Importantly, availability of individual patient data including exact marker levels would allow data to be reanalysed where cutoff levels were not consistent, and also where continuous marker results were desired but results using a cutoff level were given (or vice versa) (Figure 3). Indeed, the most appropriate analysis of continuous prognostic markers may require nonlinear modelling techniques, as highlighted by Sauerbrei et al (1999); consultancy with statisticians or others experienced with such techniques is recommended in this situation.

Adjustment factors

It is clear that once important prognostic markers have been identified, they need to be evaluated against, and also used in combination with, other known clinically useful prognostic factors, such as clinical characteristics (e.g. age, stage of disease) or indeed other marker levels. Prognostic marker results that are adjusted for other known prognostic factors will have the greatest implications for clinical practice, and subsequently meta-analyses of adjusted results are the necessity. However, if authors are inconsistent in the sets of adjustment factors they use, it becomes very difficult and impractical to pool results across studies and make a proper evaluation of markers over and above other factors. For the 17 adjusted MYC-N estimates, there were 16 different sets of adjustment factors, each containing one or more of age, stage of disease, Shimada index, lactate dehydrogenase and eight other prognostic markers. Individual study estimates of risk (e.g. hazard ratio) can be influenced by which adjustment factors are used (Rushton and Jones, 1992), and so there may be an additional reporting bias concern if researchers specifically only report those adjusted estimates with the most statistically significant result.

We recommend that research groups collaborate and identify the most commonly used prognostic tools in current practice, so that adjusted results of new prognostic marker studies can use consistent sets of adjustment factors. These identified prognostic tools should also be presented within the available individual patient data alongside the new markers being studied (Figure 2). This would allow adjusted results to be calculated independently across studies, using consistent sets of adjustment factors. Prognostic indexes could also be calculated across studies and evaluated in a meta-analysis if desired. Lambert et al (2002) have shown that individual patient data are generally required when investigating patient characteristics as effect modifiers in a meta-analysis, and, for prognostic markers, our study shows that the most valid and clinically useful meta-analysis results will only be obtained from an individual patient data approach.

Conclusion

Prognostic markers are important tools in the management of patients with cancer and many other diseases, and as such primary studies of prognostic markers are essential. However, the design and evaluation of such studies can be greatly improved (Altman and Lyman, 1998). Furthermore, we have shown that a change in how prognostic marker studies are reported is needed to provide more effective and meaningful results, and also allow important evidence-based reviews to be conducted. To facilitate such improved reporting, we have attempted to compile guidelines regarding how summary statistics and individual patient data should be presented. In particular, the availability of full individual patient data, including all markers considered, is the most viable way forward to produce valid and clinically useful evidence-based reviews and meta-analyses. Individual patient data would limit the large problems of poor and heterogeneous reporting that we observed, and also reduce the potential impact of reporting bias. Prospective registration of studies alongside the availability of IPD would also help restrict the potential for publication bias. These guidelines all point to researchers working together towards planned pooled analyses, currently a particularly important concept for epidemiological research (Blettner et al, 1999).

Research groups within each disease area should be encouraged to collaborate and facilitate these changes in practice; for example, by defining a clear set of important adjustment factors and by initiating central repositories to collate and manage individual patient data. This move towards a more evidence-based approach to the study and reporting of prognostic markers will help properly establish the most appropriate individual, and potential combinations of markers to be used in clinical practice, and should thereby improve patient care.

Change history

16 November 2011
This paper was modified 12 months after initial publication to switch to Creative Commons licence terms, as noted at publication

References

Altman DG, Lausen B, Sauerbrei W, Schumacher M (1994) Dangers of using ‘optimal’ cutpoints in the evaluation of prognostic factors. J Natl Cancer Inst 86: 829–835
Article CAS Google Scholar
Altman DG, Lyman GH (1998) Methodological challenges in the evaluation of prognostic factors in breast cancer. Breast Cancer Res Treat 52: 289–303
Article CAS Google Scholar
Altman DG (2001) Systematic reviews of studies of prognostic variables. In Systematic Reviews in Health Care: Meta-analysis in Context, Egger M, Davey Smith G, Altman DG (eds) pp 228–247. London: BMJ Publishing Group
Altman DG, De Stavola BL, Love SB, Stepniewska KA (1995) Review of survival analyses published in cancer journals. Br J Cancer 72: 511–518
Article CAS Google Scholar
Berthold F, Sahin K, Hero B, Christiansen H, Gehring M, Harms D, Horz S, Lampert F, Schwab M, Terpe J (1997) The current contribution of molecular factors to risk estimation in neuroblastoma patients. Eur J Cancer 33(12): 2092–2097
Article CAS Google Scholar
Blettner M, Sauerbrei W, Schlehofer B, Scheuchenpflug T, Friedenreich C (1999) Traditional reviews, meta-analyses and pooled analyses in epidemiology. Int J Epidemiol 28(1): 1–9
Article CAS Google Scholar
Duchateau L, Pignon JP, Bijnens L, Bertin S, Bourhis J, Sylvester R (2001) Individual patient-versus literature-based meta-analysis of survival data: time to event and event rate at a particular time can make a difference, an example based on head and neck cancer. Controlled Clin Trials 22(5): 538–547
Article CAS Google Scholar
Eche N, Pichon MF, Quillien V, Gory-Delabaere G, Riedinger JM, Basuyau JP, Daver A, Buecher B, Conroy T, Dieu L, Bidart JM, Deneux L (2001) Standards, options and recommendations for tumor markers in colorectal cancer. Bull Cancer 88(12): 1177–1206 [French]
CAS PubMed Google Scholar
Egger M, Davey Smith G, Altman DG (eds) (2001) Systematic Reviews in Health Care: Meta-analysis in Context. London: BMJ Publishing Group
Book Google Scholar
Fleiss JL, Gross AJ (1991) Meta-analysis in epidemiology, with special reference to studies of the association between exposure to environmental tobacco smoke and lung cancer: a critique. J Clin Epidemiol 44: 127–139
Article CAS Google Scholar
Friedberg B, Watine J, Miedouge M (2001) Unresected colorectal liver metastases: prognostic value of laboratory variables. Gastroenterol Clin Biol 25(11): 962–966
CAS PubMed Google Scholar
Hutchon DJR (2001) Publishing raw data and real time statistical analysis on e-journals. Br Med J 322(7285): 530
Article CAS Google Scholar
Lambert PC, Sutton AJ, Abrams KR, Jones DR (2002) A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. J Clin Epidemiol 55(1): 86–94
Article CAS Google Scholar
Look MP, van Putten WL, Duffy MJ, Harbeck N, Christensen IJ, Thomssen C, Kates R, Spyratos F, Ferno M, Eppenberger-Castori S, Sweep CG, Ulm K, Peyrat JP, Martin PM, Magdelenat H, Brunner N, Duggan C, Lisboa BW, Bendahl PO, Quillien V, Daver A, Ricolleau G, Meijer-van Gelder ME, Manders P, Fiets WE, Blankenstein MA, Broet P, Romain S, Daxenbichler G, Windbichler G, Cufer T, Borstnar S, Kueng W, Beex LV, Klijn JG, O'Higgins N, Eppenberger U, Janicke F, Schmitt M, Foekens JA (2002) Pooled analysis of prognostic impact of urokinase-type plasminogen activator and its inhibitor PAI-1 in 8377 breast cancer patients. J Natl Cancer Inst 94(2): 116–128
Article CAS Google Scholar
Maris JM, Weiss MJ, Guo C, Gerbing RB, Stram DO, White PS, Hogarty MD, Sulman EP, Thompson PM, Lukens JN, Matthay KK, Seeger RC, Brodeur GM (2000) Loss of heterozygosity at 1p36 independently predicts for disease progression but not decreased overall survival probability in neuroblastoma patients: a Children's Cancer Group study. J Clin Oncol 18(9): 1888–1899
Article CAS Google Scholar
Mott MG, Mann JR, Stiller CA (1997) The United Kingdom Children's Cancer Study Group – the first 20 years of growth and development. Eur J Cancer 33(9): 1448–1452
Article CAS Google Scholar
Parker CC, Gospodarowicz M, Warde P (2001) Does age influence the behaviour of localized prostate cancer? BJU Int 87: 629–637
Article CAS Google Scholar
Parmar MKB, Torri V, Stewart L (1998) Extracting summary statistics to perform meta-analyses of the published literature for survival endpoints. Stat Med 17: 2815–2834
Article CAS Google Scholar
Riley RD, Burchill SA, Abrams KR, Heney D, Sutton AJ, Jones DR, Lambert PC, Young B, Wailoo AJ, Lewis IJ (2003a) A systematic review of molecular and biological markers in tumours of the Ewing's sarcoma family. Eur J Cancer 39: 19–30
Article CAS Google Scholar
Riley RD, Burchill SA, Abrams KR, Heney D, Lambert PC, Jones DR, Sutton AJ, Young B, Wailoo AJ, Lewis IJ (2003b) Systematic review Q2and evaluation of the use of tumour markers in paediatric oncology: Ewing's sarcoma and neuroblastoma. NHS Health Technology Assessment (No. 97/15/03) 2003. Vol. 7, No. 5
Rushton L, Jones DR (1992) Oral contraceptive use and breast cancer risk: a meta-analysis of variations with age at diagnosis, parity and total duration of oral contraceptive use. Br J Obstet Gynaecol 99: 239–246
Article CAS Google Scholar
Sakamoto J, Teramukai S, Koike A, Saji S, Ohashi Y, Nakazato H (1996) Prognostic value of preoperative immunosuppressive acidic protein in patients with gastric carcinoma. Findings from three independent clinical trials. Tumor Marker Committee for the Study Group of Immunochemotherapy with PSK for Gastric Cancer. Cancer 77(11): 2206–2212.
Article CAS Google Scholar
Sauerbrei W, Royston P, Bojar H, Schmoor C, Schumacher M (1999) Modelling the effects of standard prognostic factors in node-positive breast cancer. German Breast Cancer Study Group (GBSG). Br J Cancer 79(11–12): 1752–1760
Article CAS Google Scholar
Stewart LA, Clarke MJ (1995) Practical methodology of meta-analyses (overviews) using individual patient data. Cochrane Working Group. Stat Med 14: 2057–2079
Article CAS Google Scholar
Stewart LA, Parmar MKB (1993) Meta-analysis of the literature or of individual patient data: is there a difference? The Lancet 341: 418–422
Article CAS Google Scholar
Sutton AJ, Abrams KR, Jones DR, Sheldon TA, Song F (2000) Methods for Meta-analysis in Medical Research. London: John Wiley
Google Scholar
Vale CL, Tierney JF, Stewart LA (2002) Effects of adjusting for censoring on meta-analyses of time-to-event outcomes. Int J Epidemiol 31(1): 107–111
Article Google Scholar
Zandbergen EGJ, de Haan RJ, Hijdra A (2001) Systematic review of prediction of poor outcome in anoxic-ischaemic coma with biochemical markers of brain damage. Intens Care Med 27: 1661–1667
Article CAS Google Scholar

Download references

Acknowledgements

We wish to thank Doug Altman for his helpful comments and advice during the drafting of this paper. This work was funded by the NHS Health Technology Assessment (HTA) programme [Project no. 97/15/03].

Author information

Authors and Affiliations

Department of Epidemiology and Public Health, University of Leicester, 22-28 Princess Road West, Leicester, LE1 6TP, UK
R D Riley, K R Abrams, A J Sutton, P C Lambert & D R Jones
Department of Medical Education, University of Leicester, University Road, Leicester, UK
D Heney
Cancer Research UK Clinical Centre, St James's University Hospital, Beckett Street, Leeds, UK
S A Burchill

Authors

R D Riley
View author publications
You can also search for this author in PubMed Google Scholar
K R Abrams
View author publications
You can also search for this author in PubMed Google Scholar
A J Sutton
View author publications
You can also search for this author in PubMed Google Scholar
P C Lambert
View author publications
You can also search for this author in PubMed Google Scholar
D R Jones
View author publications
You can also search for this author in PubMed Google Scholar
D Heney
View author publications
You can also search for this author in PubMed Google Scholar
S A Burchill
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R D Riley.

Rights and permissions

From twelve months after its original publication, this work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Riley, R., Abrams, K., Sutton, A. et al. Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future. Br J Cancer 88, 1191–1198 (2003). https://doi.org/10.1038/sj.bjc.6600886

Download citation

Received: 29 April 2002
Revised: 27 January 2003
Accepted: 29 January 2003
Published: 15 April 2003
Issue Date: 22 April 2003
DOI: https://doi.org/10.1038/sj.bjc.6600886

Keywords

This article is cited by

Structured reporting to improve transparency of analyses in prognostic marker studies
- Willi Sauerbrei
- Tim Haeussler
- Marianne Huebner
BMC Medicine (2022)
Preoperative prognostic factors associated with postoperative delirium in older people undergoing surgery: protocol for a systematic review and individual patient data meta-analysis
- Tayler A. Buchan
- Behnam Sadeghirad
- Benjamin T. Dodsworth
Systematic Reviews (2020)
Outcomes and recurrence patterns according to breast cancer subtypes in Korean women
- Yoontaek Lee
- Eunyoung Kang
- Sung-Won Kim
Breast Cancer Research and Treatment (2015)
Protocol for a systematic review of the diagnostic and prognostic utility of tests currently available for the detection of aspirin resistance in patients with established cardiovascular or cerebrovascular disease
- Smriti Raichand
- David Moore
- David A Fitzmaurice
Systematic Reviews (2013)
Overview of data-synthesis in systematic reviews of studies on outcome prediction models
- Tobias van den Berg
- Martijn W Heymans
- Henrica CW de Vet
BMC Medical Research Methodology (2013)

Abstract

Main

Methods

Search strategy

Data extraction for meta-analysis

Results

Literature search results

Data extraction of prognostic marker results

Problems limiting meta-analysis

Poor reporting of primary studies

Heterogeneity of clinical and statistical factors

Publication bias and reporting bias

Should we proceed with meta-analysis?

Discussion

Appraisal of the systematic review and data extraction

Generalisations to other prognostic markers

Towards guidelines for improved reporting of prognostic markers

Benefits of individual patient data

Cutoff levels

Adjustment factors

Conclusion

Change history

16 November 2011

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Structured reporting to improve transparency of analyses in prognostic marker studies

Preoperative prognostic factors associated with postoperative delirium in older people undergoing surgery: protocol for a systematic review and individual patient data meta-analysis

Outcomes and recurrence patterns according to breast cancer subtypes in Korean women

Protocol for a systematic review of the diagnostic and prognostic utility of tests currently available for the detection of aspirin resistance in patients with established cardiovascular or cerebrovascular disease

Overview of data-synthesis in systematic reviews of studies on outcome prediction models

Search

Quick links