Introduction

Authorship in scientific publications documents research output and is conducive to one’s career1,2,3. Authorship was labeled as one of the major currencies in science4. Honorary authorship (HA) refers to authorship assigned to individuals that should not have been included as authors of a publication, because they made no or insufficient contributions to qualify as authors5. HA can occur when there is little external control, when major power imbalances between researchers exist, but also absent of those imbalances. To limit the inappropriate assignment of authorship and to reduce authorship disputes, various organizations have developed authorship guidelines6,7,8. Such guidelines are essential, but without their adoption by the pertinent journals, the practice is likely to continue9. The criteria for authorship developed by the International Committee of Medical Journal Editors (ICMJE)7 are the most well-known guidelines in the health sciences and are often used to assess if an honorary author was listed in a particular publication10,11,12. The ICMJE criteria have been modified over the years13, still some consider them too restrictive8 and their wording such as, ‘substantial’, ‘critically’, ‘important’, and ‘appropriately’ are prone to subjective interpretation.

Surveys may shed light on HA prevalence and its potential drivers and when repeated, they may be used to monitor the effects, if any, of initiatives aimed to reduce HA. We assessed HA prevalence in health sciences using five prespecified HA operationalizations. These five operationalizations all pertain to single target publications on which surveyees were questioned (Review items 1–5):

  1. 1.

    Researchers perceiving other co-author(s) as honorary author(s) on a publication.

  2. 2.

    Researchers having been approached by others to include honorary author(s) on a publication.

  3. 3.

    Researchers admitting being an honorary author on a publication.

  4. 4.

    Researchers admitting adding honorary author(s) on a publication.

  5. 5.

    Researchers admitting having approached others to include honorary author(s) on a publication.

Per our protocol, items review 1 and 2 were our primary objective, and 3–5 secondary. In addition, we planned: (a) to look at all five objectives separately when no authorship criteria were disclosed to survey respondents, e.g., based on the personal perception/opinion of the surveyee and when ICMJE-criteria were disclosed; and (b) to check for HA prevalence changes over time5. All the objectives were planned to be assessed only in surveys that asked for HA in one specific publication (e.g., researchers last publication), and not in surveys asking if researchers ever in their lifetime (all scholarly corpus) had at least one HA.

Methods

We reported the study according to the Preferred Reporting Items for Systematic review and Meta-Analysis (PRISMA)14,15. Table 1 lists key terminology5,7,16,17. The protocol for this systematic review was registered in the Open Science Framework (OSF) (https://osf.io/5nvar/) peer reviewed and published5. In this section we briefly report on the review methods. Details can be found in the protocol5 and Appendix. Differences between the protocol and the completed systematic review are explained in the (Appendix. Additional item A, pages 2 and 3). The main difference was omitting double arcsine transformation prior to statistical pooling, because recent work has shown it to be invalid in meta-analysis of proportions18,19.

Table 1 Glossary of terms.

Eligibility criteria

We included publications in health sciences which reported on results of surveys in any language, and in any setting and at any time point on a series of pre-defined authorship issues in one particular publication selected by the survey authors, i.e., the publication at issue20,21 (Appendix. Additional item B, page 4).

Information sources and search strategy

PubMed, Lens.org, and Dimensions.ai were searched for eligible surveys from inception until January 5 2023. No language or date filters were applied, except health sciences filters for the full search strategies for Lens.org and Dimensions.ai. References of the included studies were also searched for additional eligible surveys. Search strategies were piloted and can be found in the protocol5 (Appendix. Additional item C, page 5).

Survey selection process and data collection

The publication screening and data collection procedures were conducted by two authors (RMR and DC) independently. Disagreements were resolved through discussions, by contacting authors to obtain additional information, and through arbitration by a third author (GTR). Rayyan22 was used for title and abstract screening. Full texts of potentially eligible surveys were retrieved and assessed for eligibility. References of eligible studies were screened for additional eligible surveys. We report a list of all manuscripts that were initially selected for full text screening, but were excluded with rationale for doing so. All data to be extracted from the eligible surveys were collected in pilot-tested data forms (Appendix. Additional item D, pages 6–10).

Study risk of bias assessment

We developed a critical appraisal tool with a 14 items checklist to assess the methodological quality of each eligible result of each included survey5 (Appendix. Additional item E, pages 11 and 12). We also assessed how the non-implementation of these quality safeguards could have affected the results of the survey. In line with the AMSTAR-2 tool23 we labeled 7 of the 14 items as ‘critical’, due to their large influence on the validity of results. We adopted the 4 ratings of the AMSTAR-2 tool, i.e., ‘high’, ‘moderate’, ‘low’, and ‘critically low’ to rate the overall confidence for each assessed result. The Appendix (Additional item E, page 14) has details of ratings. All assessments and ratings were done by 2 reviewers (RMR and DC) independently and disagreements were resolved as stated above. Additional information on development and use of the critical appraisal tool is in the protocol5 and in the Appendix (Additional item E, pages 11–14).

Occurrence measures and synthesis methods

Prevalence was the occurrence measure used in the quantitative syntheses and in the presentation of single outcomes. Prevalences were given with their exact (Wilson) 95% confidence intervals. For each outcome we measured the prevalence as defined in our objectives. We also reported the response rates in each eligible survey. The definitions of all outcomes and the respective numerators and denominators followed our protocol (Appendix Table A15, page 15). The (target) publication on which the surveyee was surveyed was the unit of analysis for the 5 review items, i.e., only occurrences of HA issues in the specific publication singled out by the surveyors were assessed, and not for example occurrences of HA issues in all manuscripts a surveyee had ever published.

We first conducted a systematic narrative synthesis for all outcomes. Our protocol and the Appendix Additional item F, page 16) list the criteria for precluding meta-analyses. We pooled prevalences and displayed them in forest plots with their 95% confidence intervals using random effects models. The metaprop command in Stata 18 was used to perform the statistics24. We checked whether respondents participated in one survey more than once, i.e., whether surveyees had published multiple articles in the eligible time span and were asked to submit a questionnaire for each published article (Appendix Table A6, page 7). To deal with missing data we contacted the corresponding author or a co-author if the corresponding author did not reply within 2 weeks. Authors were contacted by email and 2 reminders were sent, one and two weeks after the initial email, respectively. When no response was received the data were coded as missing.

Investigation of heterogeneity and sensitivity analyses

We assessed the presence and extent of heterogeneity by visually inspecting the overlap of the confidence intervals in the forest plots, conducting the test of homogeneity (Chi2), and calculating the estimate of between study variance (tau2) and I2 to measure the inconsistency in the results25. We also sought explanations for diversity through meta-regression and subgroup analyses of both survey-and methodology-related explanatory variables5 (Appendix, Additional item F, page 16). We also considered specific issues to explore in sensitivity analyses, e.g., the impact of the quality or the characteristics of the survey design of certain reviews on the results of this systematic review5.

(Non) reporting bias assessment

We adopted the term non-reporting bias over reporting bias as suggested by Cochrane26. We used various strategies to address non-reporting biases as reported in our protocol and Appendix (Additional item G, page 17). We did not conduct tests for funnel plot asymmetry, because there is no evidence that proportional data adequately adjust for these graphical tests27.

Certainty assessment

We used the GRADE approach to assess the overall certainty of the body of evidence28. According to GRADE we assigned four levels of certainty: ‘high’, ‘moderate’, ‘low’, and ‘very low certainty’28. We presented the GRADE ratings for each outcome together with our rationales in a summary of findings table. Guidance for grading the certainty of evidence for a review item is in the Appendix (Additional item H, pages 18 and 19).

Results

Study selection and study characteristics

Figure 1 shows the study flow14,15. Our searches identified 1952 records. After deduplication, 1584 records remained for screening and 16 were eligible. The references of these 16 surveys provided another 3 eligible articles making 19 eligible surveys in total, which used 51 eligible questions to assess the prevalence of HA issues. The Appendix (Additional item I, page 20)shows all included surveys and full text reports that were assessed for eligibility and that were subsequently excluded with the rationales for exclusion (Appendix. Additional item J, pages 25–28). We contacted the authors of 15 of the 19 eligible surveys either to verify eligibility or to obtain missing information. The questionnaires for all 19 included surveys were reported in the pertinent manuscripts or obtained via the authors. Complete gender/sex breakdowns for all considered categories were not implemented in any of the included surveys and only the terms ‘males’ and ‘females’ were used29. Males were the predominant respondents (> 51%) in the 13 of 14 surveys that reported the sex distribution among respondents. The percentage of surveyees that were an associate professor or higher was at least 30% in 10 of 11 surveys reporting this proportion. The prevalences of the countries or continents of origin of surveyees could not be reliably extracted, because of imprecise, partial or non-reporting of this information. The Appendix (Additional item I, pages 21–24) shows additional characteristics of the 19 surveys.

Figure 1
figure 1

PRISMA flow diagram.

Assessment of methodological quality

The overall confidence in 24 results was rated as ‘critically low’ and ‘low’ for 27 results, i.e., based on the 14-item quality checklist5. Table 2 shows the 7 items of this checklist that we considered ‘critical’ alongside the frequency of their ratings30. The characteristics of responding surveyees and the review items were defined for all 51 questions (items 6 and 8). However, it was unclear whether the respondents were representative of the target population due to poor reporting on socio-demographics, response rates, and the intended target population. Further, various quality safeguards were not implemented and could have affected the results, i.e., limitations in the survey methods for 37% (19/51) of results, low response rates or weighting issues in most results (86% (44/51), and inadequate sample sizes for 35% (18/51) of results. The ratings for each of the 7 critical items of the 51 results are given in the Appendix (Additional item K, pages 29–32).

Table 2 Prevalence of answers to the seven critical items of the quality checklist.

Response rates

All response rates except 231,32, of the 19 eligible surveys were below 50% (Tables 3 and 4) (Appendix, Additional item L, pages 33–36). Three types of denominators were used to calculate these rates: N1: Number of emails with questionnaires sent, N2: Number of emails with questionnaires not bounced, N3: Number of emails with questionnaires for which the surveyee was available, which implies for example ‘not on vacation’, ‘not on strike’, ‘not on maternity leave’ etc. We prefer N2 or N3 over N1, but often only N1 was given. In Tables 3 and 4 we report which denominator was used to calculate response rates. When the same denominators were used, we synthesized these rates quantitatively using the random effects model (Tables 3 and 4) (Appendix, Additional item L, pages 33–36).

Table 3 Survey response rates and results. Part 1.
Table 4 Survey response rates and results. Part 2.

Results of individual surveys and of syntheses

Surveys used different questions to assess HA. In line with our protocol, we assigned these into the main HA categories in our review. A total of 51 questions provided 51 HA prevalences. Five identical questions (questions 1a, 1b, 1c, 2, and 4a listed in Tables 3 and 4) were used in multiple surveys, which permitted synthesizing of 45 HA prevalences in 5 meta-analyses (using the random effects model). The remaining 6 HA prevalences were obtained from six questions used only once (question 1d, 3a-c, and 4b). Tables 3 and 4 show all questions and results and Figs. 2, 3, 4, 5 and 6 the pertinent forest plots. Additional information on the results is in the Appendix (Additional item L, pages 33–36).

Figure 2
figure 2

Forest plot for Review item 1a. Perceived honorary authorship of a co-author (without researchers being referred to any specific criteria for authorship).*Effect Size (pooled percentage) 26% [95% CI 21–31]. Heterogeneity χ2 = 42.25 [df = 5]; P < 0.001. Variation in pooled percentage attributable to heterogeneity: I2 = 88.16%. Between-study variance estimate τ2 = 0.00. Test of pooled percentage = 0: z = 10.13; P < 0.001. *The results of each individual survey were based on the answers to the same question regarding a specific publication by the surveyee.

Co-author(s) as honorary author(s) on a publication (Review item 1)

Perceived HA of a co-author (without researchers being referred to any specific criteria for authorship). (Review item 1a)

A pooled average of 26% [95% CI 21–31] of researchers (based on data from 6 surveys, and a total of 2,758 respondents) perceived that at least one of their co-author(s) on the publication at issue had not contributed sufficiently to deserve authorship (Table 3) (Fig. 2). These judgments were self-perceived and not based on pre-defined criteria for authorship. Three of the 6 surveys were conducted on publications in the field of radiology. In five surveys, first authors were the target population (Appendix Table A17, page 22). The majority of the respondents in these surveys were male (> 51% in the 4 surveys that reported this prevalence) and the percentage of respondents being at least the rank of associate professor or having higher was 32% or more in 4 of the 5 surveys that reported this percentage (Appendix Table A18, page 23).

Perceived HA based on ICMJE criteria (Review item 1b)

A pooled average of 18% [95% CI 15–21] of researchers (based on data from 11 surveys, and a total of 4,272 respondents) perceived that at least one (of their) co-author(s) had not contributed sufficiently to deserve authorship on the publication at issue based on the ICMJE criteria for authorship, they were asked to consider (Table 3) (Fig. 3). The 11 surveys on this HA issue were conducted in a broad spectrum of biomedical fields (Appendix Table A17, page 22). Nine of these surveys were conducted on corresponding authors and the majority of respondents were male (> 61% in 8 of the 9 surveys that reported this percentage). The percentage of respondents at least of the rank of associate professors was higher than 32% in the 6 surveys that reported this proportion (Appendix Table A18, page 23).

Figure 3
figure 3

Forest plot for Review item 1b. Perceived honorary authorship based on ICMJE criteria*. Effect Size (pooled percentage) 18.0% [95% CI 15–21]. Heterogeneity χ2 = 85.56 [df = 10]; P < 0.001. Variation in pooled percentage attributable to heterogeneity: I2 = 88.71%. Between-study variance estimate τ2 = 0.00. Test of pooled percentage = 0: z = 10.37; P < 0.001. *The results of each individual survey were based on the answers to the same question regarding a specific publication by the surveyee.

HA based on researchers’ reported contributions of co-authors compared to the ICMJE criteria. (Review item 1c)

A pooled average of 51% (95% CI 47–56, based on 15 surveys and 5111 respondents) of HA was found when researchers were asked to declare their co-author(s’) contributions on the publication at issue (and those were then compared to ICMJE criteria by the researchers conducting the survey) (Table 3) (Fig. 4). These 15 surveys were conducted on researchers from various biomedical disciplines. Nine of these surveys were conducted on corresponding authors, 5 on first authors, and 1 on a mix of corresponding, first, and senior (last) authors. The majority of respondents were male (> 51% in the 11 of 15 surveys that reported this prevalence) and the percentage of respondents ranked at least associate professors was at least higher than 32% in most surveys. Table 3 and the Appendix (Appendix Tables A17 and A18, pages 22 and 23) show additional information.

Figure 4
figure 4

Forest plot for Review item 1c. Honorary authorship based on researchers’ reported contributions of co-authors compared to the ICMJE criteria*. Effect Size (pooled percentage) 51% [95% CI 47–56]. Heterogeneity χ2 = 147.74 [df = 14]; P < 0.001. Variation in pooled percentage attributable to heterogeneity: I2 = 90.52%. Between-study variance estimate τ2 = 0.01. Test of pooled percentage = 0: z = 22.75; P < 0.001. *The results of each individual survey were based on the answers to the same question regarding a specific publication by the surveyee.

Researchers having been approached by others to include honorary author(s) on a publication (Review item 2)

A pooled average of 10% [95% CI 9–12] of researchers (based on 11 surveys and a total of 3,663 respondents) had been approached by others to include honorary author(s) on the publication at issue (Table 3) (Fig. 5). Surveys asking this question did not report whether they provided researchers with the ICMJE criteria for authorship. The 11 surveys were conducted on a wide spectrum of biomedical publications. Seven of the surveys were conducted on corresponding authors, three on first authors, and 1 on a mix of corresponding, first, and senior (last) authors. The majority of respondents were male and the percentage of respondents that were at least associate professors at least 32% (Table 3, Appendix Table A18, page 23).

Figure 5
figure 5

Forest plot for Review item 2. Researchers having been approached by others to include honorary author(s) on a publication*. Effect Size (pooled percentage) 10% [95% CI 9–12]. Heterogeneity χ2 = 21.81 [df = 10]; P = 0.02. Variation in pooled percentage attributable to heterogeneity: I2 = 54.16%. Between-study variance estimate τ2 = 0.00. Test of pooled percentage = 0: z = 13.70; P < 0.001. *The results of each individual survey were based on the answers to the same question regarding a specific publication by the surveyee.

Researchers admitting being an honorary author(s) on a publication (Review item 3)

We found ranges from 0.15% (1/666) [95% CI 0.0038–0.83]31 to 33.3% (67/201) [95% CI 27–40]32 of researchers being classified as honorary author(s) based on the type of contribution(s) they reported on the publication at issue (Table 4). These self-declared contributions were made when researchers were told the ICMJE criteria, however, we were unable to meta-analyze these data, because of differently phrased questions, different lists of possible contributions, and even possible overlap of results (Appendix Tables A28–A31, pages 35 and 36).

Researchers admitting adding an honorary author(s) on a publication (review item 4)

A pooled average of 16% [95% CI 13–18] of first authors (first authors, based on 2 surveys31,33 and a total of 823 respondents) admitted having added an honorary author(s) on the publication at issue (Question 4a) (Table 4) (Fig. 6). In both surveys, first authors declared this when being told the ICMJE criteria for authorship. Additionally, one of these surveys33 asked the same question to the last authors on the same manuscript at issue and found that 11.5% (10/87) [95% CI 6–20] of last authors admitted having added an honorary author(s) (Question 4b) (Table 4). We did not pool this result, because the publication at issue was the unit of analysis and we therefore could not use the results for both first authors and last authors in the same meta-analysis (Appendix Tables A26 and A32, pages 34 and 36).

Figure 6
figure 6

Forest plot for Review item 4a. Researchers (First authors) admitting adding an honorary author(s) on a publication*. Effect Size (pooled percentage) 16% [95% CI 13–18]. Test of pooled percentage = 0: z = 12.53; P < 0.001. *The results of each individual survey were based on the answers to the same question regarding a specific publication by the surveyee.

Researchers admitting having approached others to include honorary author(s) on a publication (review item 5)

No surveys assessed this outcome.

Investigation of heterogeneity and sensitivity analyses

Visual inspection of the forest plots, the tests of homogeneity (Chi-squared), and the estimate of between study variance (tau2) of the 5 quantitatively synthesized results all demonstrated evidence of between-study heterogeneity (Figs.  2, 3, 4, 5 and 6). The variation in the prevalence proportions attributable to heterogeneity (I2) was larger than 88% for 3 out of 5 main outcomes (Figs.  2, 3, 4, 5 and 6). Meta-regression showed that the prevalence of HA based on researchers’ reported contributions of co-authors compared to the ICMJE criteria decreased by 1.3% per year between 2011 and 2021 [95% CI − 2.5 to − 0.07], p = 0.04) (Question 1c). The percentage of respondents being at least associate professors had no association with the prevalence of researchers that perceived other co-author(s) as honorary author(s) on a publication based on a list of co-author’s contributions (review question 1c) (Appendix Table A34, page 40).

Three of 11 subgroup analyses found a significant (p < 0.05) association between an explanatory variable and the quantitatively synthesized results for a specific survey question. When the survey was conducted by a research group whose first author was affiliated with a research institute in the Netherlands it was less likely (Q (1) = 5.60, p = 0.018) that researchers perceived other co-author(s) as honorary author(s) on a publication based on a list of co-author’s contributions (review question 1c) than when these surveys were conducted in countries other than the Netherlands (Table Question 1c). When the survey was conducted within 1 year after the publication of the manuscript, perceived honorary authorship based on ICMJE criteria (Question 1b) and honorary authorship based on researchers’ reported contributions of co-authors compared to the ICMJE criteria (Question 1c) were less likely (respectively Q (1) = 6.62, p = 0.01 for question 1b and (Q (1) = 5.79, p = 0.016 for question 1c) than when these surveys were conducted after more than 1 year (Table Question 1b and 1c). The Appendix (Additional item M, pages 37–43) reports all pre-specified explanatory variables that were assessed, why some of the predictors were excluded and has details on all investigations of heterogeneity. Sensitivity analyses were not conducted, because the sources in which eligible surveys were identified and the survey design and quality of the included surveys were similar.

Non-reporting biases in syntheses

An overall judgement about risk of bias due to missing results in a synthesis (non-reporting biases)26 was ‘moderate’ probability of risk of bias for 9 review items and ‘high’ probability of risk of bias for 2 review items. The Appendix (Additional item N, pages 44–47) reports the methods and results and explanations for the overall judgment on non-reporting biases.

Certainty of evidence

The certainty of evidence (GRADE)28 were either low or very low for all outcomes of this systematic review (Table 5). The rationales for assigning certainty grades are in the Appendix (Additional item O, page 49).

Table 5 Summary of findings.

Discussion

We identified 19 surveys assessing honorary authorship in health sciences using 51 different operationalizations and statistically pooled 45 of those into 5 review outcomes. Results indicate that HA prevalence was 26% when respondents were asked if there are honorary authors on their publication at issue, and not explicitly informing authors about criteria for authorship. The pooled prevalence was 18% when they were asked the same question but ICMJE criteria were disclosed to them, and 51% when respondents were asked to declare their co-author(s) contributions and these contributions were then compared to the ICMJE criteria independently. This indicates that how questions are asked may affect HA estimates, but also that what researchers perceive as HA and may differ from how authorship is defined using ICMJE criteria. Previous qualitative research also revealed these effects34.

Nevertheless, the apparent high prevalences of HA (regardless of how questions are phrased and definitions used), confirm previous findings that authorship issues are among the most prevalent Questionable Research Practices (QRPs) in science, affecting both young and old researchers35,36. A slight ray of hope is that we also found an indication that the prevalence of HA when respondents are asked to declare co-author(s) contributions and these are compared to the ICMJE criteria has been decreasing over time. While our study was not designed to detect reasons for this phenomenon, we hope it is due to the increase in raising awareness of QRPs, and greater emphasis on research integrity integration, as well as promotion of authorship templates at the start of a research project37. We also found that 10% of researchers stated they had been approached by others to include honorary author(s) on the paper at issue, and 16% of researchers who admitted having added an honorary author to that paper. These pooled prevalences are lower than those mentioned above, which is not surprising, as meta-analyses of falsified, fabricated or plagiarized data have also shown that researchers perceive others to be more fallible to committing the same fallacies than they themselves38,39.

Previous reviews on HA were narrative1,40,41, integrative42 or also systematic36, but reviewed predominantly other issues than those assessed in our systematic review. During our study selection, multiple surveys were excluded, because they had different objectives such as estimating the occurrence of ‘HA issues in past publications’43,44,45,46,47. Compared to that work, obviously, our focus on ‘the prevalence of HA issues in the publication on which researchers were surveyed’, yielded lower estimates.

The limitations of our study can be looked at on 2 levels, one for the review itself, and the second regarding the surveys we pooled. Regarding the review itself, we have assessed results of surveys only in the health sciences, and further research should be conducted on other disciplines. In addition, as our search strategy included full words such as authorship as did Marušić’ systematic review36, we may have missed studies that did not include those words in their abstract or title. As for the surveys, we undertook pooling of results even though our findings indicated the low methodological quality of surveys. We did so intentionally, as surveys remain the main mechanisms for estimating the prevalence’s of misconduct and QRP, and the low response rates and inadequate reporting has been found for surveys addressing many topics, which also led to creation of several survey reporting guidelines. When using the pooled estimates, we therefore recommend always keeping in mind that they originate from survey research and its inherent deficiencies48.

HA inflates the publication output by honorary authors, which could benefit their career and even income. HA is not a victimless crime, because it dilutes the output of those authors that did the work. It also harms the trust between researchers, creating stressful relationships and poor collaboration, which could even slow down the research progress. It is very likely that it also affects the research culture, which a recent a recent survey has found to be lacking49. Measuring the prevalence of HA issues is important, because it demonstrates the need to implement strategies to address these problems and shows whether these strategies are effective. The high prevalence of various honorary authorship issues identified in this systematic review of survey research showed the need tackle these problems. Multiple strategies to reduce HA issues such as courses in research integrity, publication ethics, authorship responsibilities of the pertinent stakeholders1,6, authorship/contribution discussions during the various research phases7, adopting and fine-tuning of authorship contributor criteria7,50, sanctions for honorary authorship, and considering and implementing new principles for assessing the quality of scientists51 have been proposed, but call for testing. This systematic review also demonstrated the need to improve the quality of survey research on HA issues.

Conclusions

The pooled estimates in this systematic review appear to confirm the idea that honorary authorship is a highly prevalent questionable research practice in the health sciences. However, due the poor quality of the surveys yielding the data, this conclusion cannot be made with certainty. Future research should be aimed at testing and implementing methods aimed at reducing this practice, and cultivating a culture that values quality over the quantity of publications.