Introduction

Dyspepsia is a complex symptom of gastroduodenal lesions represented by epigastric pain or burning, postprandial fullness, or early satiety. Although many cases are accompanied by an organic pathology that can explain the symptoms, over three-quarters of cases have no organic pathologies1,2. Such cases are called functional dyspepsia (FD) and have been defined by the Rome criteria for standardization. Although there are subtle differences depending on the criteria, it is commonly defined as a case where the above symptoms that cannot be explained by investigation including esophagogastroduodenoscopy (EGD) continue for more than 6 months3. The prevalence of FD varies depending on the literature and the way it is defined, but is known to be approximately 10–30%1. FD is the most common functional gastrointestinal disorder (FGIDs) and is estimated to have the highest healthcare burden4. FD can cause social and economic losses by increasing medical expenses and negatively impacting individual mental health5,6.

For these reasons, it is necessary to understand the global prevalence of FD, its trend over time, and changes due to the Rome criteria revision to better inform medical policies and determine directions for future research. There have been systematic reviews and meta-analyses on the global prevalence of uninvestigated dyspepsia (UD), indicating patients with dyspepsia who were not received routine investigation and multinational prevalence research conducted by the Rome Foundation based on the Rome IV criteria2,7,8,9. However, no study has comprehensively researched the global prevalence of FD alone based on the time periods of the Rome I to IV (1990 to 2022) criteria. In this study, we estimated the global prevalence of FD diagnosed by the Rome criteria, and also conducted subgroup analysis according to country, Rome criteria, survey method, study year, country level, geographical region, and sex.

Methods

We performed a systematic review and meta-analysis of relevant studies to investigate global and national trends in the prevalence of FD using the Rome criteria in different countries. Subgroup analysis was done by sex, continent, country level, Rome criteria used for diagnosis, and study year. This systematic review adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines10 and the protocol was registered with PROSPERO (registration no. CRD42022381641).

Search strategy and study selection

We searched the PubMed/MEDLINE, Embase, and Google Scholar databases from January 1, 1990, to September 22, 2022, to find studies that reported the incidence or prevalence of FD in adults according to the Rome Foundation diagnostic criteria. We limited the search to 1990 because the Rome criteria were first described in 1991. After excluding pediatric studies and those that did not use the Rome criteria, studies were required to have recruited participants from the general population or have used datasets that were not skewed so could somewhat represent the general population. Those who reported the prevalence of functional dyspepsia in skewed samples, such as university students, certain professions such as military personnel, or a specific sex or age, were excluded. Studies with the same target population or with fewer than 50 participants were also excluded.

The databases were searched using the terms “functional dyspepsia” or “chronic dyspepsia”, which were combined using the set operator “AND” with studies identified using the terms “incidenceorprevalence” as free text terms. A total of 2,917 studies were initially selected, and studies that did not meet the selection criteria by title and abstract were excluded. There were no language restrictions, and, if necessary, the contents were verified by translation. A recursive search was performed using the bibliographies of all eligible papers. Two investigators (KL and WC) independently assessed the eligibility of the study and disagreements were resolved through discussion with a third investigator (DKY).

Data analysis and quality assessment

Data were independently extracted by two investigators (KL and WC) using Microsoft Excel (version 2013; Microsoft, WA, US), with any discrepancies resolved by the opinion of a third investigator (DKY). The following data were collected for each study: study design, country, method of data collection (face-to-face interview, telephone interview, interviewer-administered questionnaire, self-completed postal questionnaire, self-completed questionnaire given to the participant at an appointment, or self-completed Internet-based questionnaire), Rome criteria used to define FD, total number of participants providing complete data, mean age of participants, number of participants with FD, number of male and female participants, and number of male and female participants with FD. When the prevalence of FD was reported according to more than one set of criteria within an individual study, we extracted the participants using the Rome definition. The adapted Newcastle–Ottawa Quality Assessment Scale (NOS)11 was used to evaluate the data quality for cross-sectional studies (Tables S1 and S2). This scale has a total score of nine points; high-quality studies have seven or more points, medium-quality ones have four to six points, and low-quality studies have three points or less.

A random effects model was used to calculate the pooled prevalence of FD. We performed extensive subgroup analyses stratified by country, sex, study year (1990–2002, 2003–2012, and 2013–2020), country income level (developed and developing countries), geographical region by continent (Africa, Asia, Australia, Europe, North America, and South America), Rome criteria used to define FD, and source of participant data (interview- and non-interview-based data). Countries were divided into developed and developing countries based on the UN Human Development Index (cut-off point 0.8). Based on the results of the subgroup analysis by study year. The global prevalence of FD was compared using DerSimonian and Laird (DL) pooled estimates12 with 95% confidence intervals (CIs). To reduce the error rate, the Hartung-Knapp-Sidik-Jonkman (HKSJ) method was also used in the analysis13. Heterogeneity was assessed using the I2 statistic14 and Egger’s test was performed for publication bias. The prediction interval (95% PI) was used to make our main results for the summary of estimates more robust in order to assess the uncertainty of our findings using Bayesian statistics15. Microsoft Excel (version 2013; Microsoft, WA, US) and R software (version 4.1.2; R Foundation, Vienna, Austria) were used to calculate the main results and generate all tables. A two-sided p value of < 0.05, was considered significant16,17,18,19,20,21,22,23,24,25.

Patient and public involvement

No patients were involved in setting the research question or outcome measures, nor were they involved in developing plans for the design or implementation of the study. None of the patients were asked to advise on the interpretation or writing of the results. However, we plan to disseminate the results of this study to all study participants or wider relevant communities on request.

Results

The search strategy identified 6,063 studies. Of these, 44 studies met the eligibility criteria8,9,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67, representing 80 separate adult study populations, comprising 256,915 participants from 40 countries worldwide (Figs. 1, 2 and Table 1). Most studies were conducted in a single country, with the exception of a three-nation study conducted in Canada, the UK, and the USA8 and a multinational survey conducted in 33 different countries9. FD data of patients from 1993 to 2020 were included; most were cross-sectional studies, but several population-based cohort studies, case–control studies, and randomized control studies that could calculate the prevalence were also included. Most of the study population was a randomly sampled general population; however, samples from health check-up visitors, general hospital visitors (not patients), and large-scale hospital staff were also included. Face-to-face interviews and internet-based self-completed questionnaires accounted for the majority of the survey methods, but there were also some methods using self-completed postal questionnaires, telephone interviews, or self-completed questionnaires at appointments. The detailed characteristics of all included studies are shown in Table S1.

Figure 1
figure 1

Literature search and study selection.

Figure 2
figure 2

A graphical abstract of the prevalence of functional dyspepsia by continent.

Table 1 Prevalence of functional dyspepsia.

The global pooled prevalence using the DL method of FD in the 80 included adult population was 8.4% (95% confidence interval [CI] 7.4–9.5; I2 = 98.8%), and the global prevalence using the HKSJ method was 9.9% (95% CI 8.5–11.3), with a 95% PI of 2.6–23.8 (Table 1). The lowest prevalence of FD reported was 0.7% (95% CI 0.5–1.0) in an Indian population cross-sectional study that administered the Rome IV questionnaire during face-to-face household interviews. The highest prevalence was 38.3% (95% CI 35.1–41.6), which was reported in a Pakistani population cross-sectional study that used the Rome III criteria in face-to-face interviews (Table 2).

Table 2 Prevalence of functional dyspepsia by country.

Prevalence of FD according to Rome criteria and survey year

In total, three studies stated that they used the Rome I criteria in three separate countries, five used the Rome II criteria in five separate countries, 27 used the Rome III criteria in 14 separate countries, and 45 used the Rome IV criteria in 38 separate countries. The pooled prevalence of FD was highest according to Rome I criteria and lowest according to Rome IV criteria (11.9% [95% CI 5.1–25.4] vs. 6.8% [95% CI 5.8–7.9], respectively). The prevalence using the HKSJ method also had the highest frequency when using Rome I criteria and the lowest frequency when Rome IV criteria was used (13.6% [95% CI 0.0–37.8] vs. 7.9% [95% CI 6.6–9.1], respectively). Table 2 summarizes the pooled data for the prevalence of FD worldwide according to country, using all iterations of the Rome criteria.

There was a total of 41 studies whose survey year was specified in the research methods: eight between 1990 and 2002, 15 between 2003 and 2012, and 53 between 2013 and 2020. The pooled prevalence was the highest between 1990 and 2002 and the lowest between 2013 and 2020 (12.4% [95% CI 8.2–18.3] vs. 7.3% [95% CI 6.1–8.7], respectively), and the prevalence using the HKSJ method showed similar results (Table 1). The prevalence of FD tended to decrease over time (Fig. 3).

Figure 3
figure 3

Global prevalence of functional dyspepsia from 1990 to 2020.

Prevalence of FD according to country income level and continent

53 out of the 80 surveys were conducted in developed countries, and the remaining 37 were conducted in developing countries. The prevalence in developed countries was 8.0% (95% CI 6.8–9.3) using the DL method and 10.0% (95% CI 8.3–11.7) using the HKSJ method. In developing countries, it was 9.1% (95% CI 7.4–11.2) and 12.1% (95% CI 9.5%–14.8%), respectively, showing a higher trend in developing countries (Table 1).

When the included study population was classified by continent, Asia had the largest number of studies (36 studies), followed by Africa and South America (4 studies). In both DL and HKSJ methods, the prevalence was highest in South America at 10.7% (95% CI 7.8–14.6) and 11.3% (95% CI 3.9–18.6), respectively, but the continent with the lowest prevalence was Asia with DL method (7.8%, 95% CI 7.8–14.6) and with the HKSJ method, it was North America (8.7%, 95% CI 7.3–10.5; Table 1 and Fig. 2).

Prevalence of FD according to survey method

We analyzed several survey methods by classifying them into two categories. These were the interview-type in which a trained interviewer participates in the survey in any way, and a non-interview type in which a questionnaire is prepared and submitted by the participant. For each Rome criterion used in the definition, the prevalence of the two types of survey methods was compared, but in the case of Rome I, there was no interview-type study, so a comparison could not be made. In Rome II, the interview-type showed a higher pooled prevalence (12.1% [95% CI 5.3–25.3] vs. 8.6% [95% CI 2.2–27.8], respectively) than the non-interview type, and showed a similar trend in the HKSJ method. However, in Rome III and Rome IV, which accounted for the majority of included studies, the interview-type tended to be significantly lower than the non-interview type (10.1% [95% CI 7.7–13.2] vs. 12.0% [95% CI 8.1–17.3], and 4.4% [95% CI 2.5–7.7] vs. 7.8% [95% CI 6.7–8.9]).

Prevalence of FD according to sex

The prevalence of FD was divided into male and female categories and analyzed by dividing the Rome criteria into Rome overall, Rome I, II, III, and IV (Table 3). The prevalence of women in all categories was significantly higher, and the largest difference was observed in Rome IV with an odds ratio for female to male of 2.68 (95% CI 2.31–3.12).

Table 3 Prevalence of functional dyspepsia for each of the Rome criteria by sex.

Study quality assessment

A total of 44 studies included in the analysis were evaluated for quality using the NOS scale. Except for one study that showed poor selection quality, all other studies showed good or fair selection and outcomes quality. In the comparability section, 81.8% (36/44) of the studies were of good or fair quality (Table S2). Twenty-six studies were evaluated as high-quality, and the remaining 18 studies were evaluated as medium-quality. None of the included studies was of low quality.

Discussion

Key findings

This systematic review and meta-analysis integrated data from 44 original articles and over 256,915 participants from 40 countries across six continents (Africa, Asia, Australia, Europe, North America, and South America), which reported the prevalence of FD in the community using all iterations of the Rome criteria from 1990 to 2020. We investigated whether the prevalence of FD could vary according to country, continent, and sex, even if the same definition was applied. The estimated global prevalence of FD was 8.4% (95% CI 7.4–9.5). This study also found differences in prevalence between developed and developing countries, with a higher prevalence in developing countries. The prevalence according to the Rome criteria definitions showed a change of less than 1.5% between Rome I, II, and III but showed a sharp decrease in the prevalence with the Rome IV criteria. Regardless of the definition used, we found that the prevalence of FD was significantly higher in women than that in men. Finally, according to study year, we found that the worldwide prevalence of FD showed gradual decreased.

Plausible mechanism

The prevalence of FD showed a gradual decrease by the study year, which can be attributed to the following reasons. First, there is a decrease in the prevalence of Helicobacter pylori (H. pylori) infection, which is assumed to be a cause of FD68,69,70. For various reasons, such as the prevention of gastric cancer, H. pylori eradication is widely practiced, and as a result, the prevalence of H. pylori infection worldwide is decreasing and this result was seen in a systematic review of global prevalence; hence, the prevalence of FD is also estimated to have decreased69,70,71. This claim is supported by the fact that H. pylori prevalence is high in developing countries and low in developed countries68. H. pylori infection without peptic ulcers or gastric cancer may be associated with FD, and there is substantial evidence that eradication improves symptoms in FD patients with H. pylori infection70.

Second, it could be because the diagnostic accuracy for structural disease has increased with the development of various technologies and the evaluation techniques for gastroesophageal non-structural organic causes, such as motility disorder72,73,74. The third explanation is a reduction in infectious gastroenteritis (IGE). It is a well-known fact that the incidence of FD increases after suffering from IGE75,76. It can be assumed that the decrease in the incidence of IGE along with the improvement of global sanitation contributed to the decrease in the prevalence of FD, and the difference in prevalence between developed and developing countries in our study supports this rationale77. Lastly, the Rome criteria for FD diagnosis has become increasingly stringent as classifications have become more diverse78. Especially, in Rome III, postprandial distress syndrome, one of the subtypes of FD, could be identified even if symptoms occurred just once a week, but in Rome IV criteria, the requirements were made more stringent to three times a week79. This is supported by the reduced prevalence using the Rome IV criteria compared to Rome I, II, and III.

When the odds ratio for prevalence by sex was analyzed, women consistently showed higher odds for every Rome criteria as well as overall prevalence, compared with men. This trend agrees with the other global prevalence studies of functional constipation (FC), irritable bowel syndrome (IBS), and other FGIDs using the Rome criteria80,81. This consistent trend indicates that differences between males and females due to various causes affect the disease prevalence. Visceral hypersensitivity is known to have a significant effect on sex differences in animal models; however, it has not been clearly identified in humans76. Other possible causes include gastric emptying and colon transit time, the degree of sensation for visceral pain due to structural differences in hormones or the brain, brain response to stress, ghrelin, heredity, low-grade inflammation of the duodenum, and food; however, a clear cause has not yet been identified76,82.

Strengths and limitations

This study has several limitations. First, few studies have been conducted in Africa or South America, and more studies have been conducted in developed countries than in developing countries. Second, there are more studies using Romes III and IV criteria than those using Romes I and II. Third, there was variability in the data collection methods, such as face-to-face or telephone interviews versus self-completed internet-based or paper questionnaires, which may have caused differences in ascertaining the prevalence of FD. In fact, when subgroup analysis was performed by dividing the survey types in this study, a difference in prevalence was observed despite the Rome criteria used for the definition being the same. Fourth, although an atypical population was excluded, some study populations were not pure general population. However, these studies had sufficient sample sizes and included patients of different ages. Fifth, because the subtype definition of FD was different in Rome I, II, and Rome III and IV, the prevalence was not investigated separately for each subtype. Sixth, demographic data such as sex, age, or socioeconomic status could not be obtained from all studies, and only sex was available for subgroup analysis83,84 Seventh, all the studies included in the analysis used a means to exclude organic causes and defined the participants as having FD accordingly, but few studies performed routine EGD, and most studies were health check-up population studies. In some studies, ultrasound or CT of the abdomen was performed in addition to EGD. Therefore, the prevalence of true FD in the study populations may be questionable85. However, it is practically impossible to accurately distinguish FD from organic dyspepsia by any process, and each country has a different approach to the investigation of dyspepsia1,86,87. Even if investigations, including EGD, were not conducted in the study, it cannot be said that participants were not received investigation. In addition, it is difficult to conduct research that routinely performs invasive tests such as EGD on the general population88. Therefore, in many studies, even if it was not a routine investigation, after using a method to exclude organic dyspepsia, FD is defined by the Rome criteria and the prevalence is investigated. This is also the case in the large-scale multinational study conducted by the Rome Foundation8,9. Lastly, there was considerable heterogeneity. Given that our analyses involved heterogeneity, it's important to note that some results may be subject to inaccuracy. Therefore, interpretation should be approached with caution, considering these aspects. To the best of our knowledge, our study is the first to investigate the global prevalence of FD, including the prevalence based on Rome I to IV criteria. And we performed an extensive literature search and sophisticated statistics, including the DL method, HKSJ method, and 95% PI, to reduce the error rate and obtain more accurate results.

Further considerations

As mentioned above, the decrease in the prevalence of FD can be assumed to be due to the development of various medical examination techniques and a decrease in the morbidity of H. pylori and IGE. However, the cause of FD remains unclear, as mentioned in the discussion on sex differences. Therefore, efforts to find a clear cause for FD are still needed. Reason analysis and institutional efforts are needed to reduce the frequency of FD in developing countries as much as in developed countries. In addition, despite global cross-sectional studies, prevalence studies in Africa and South America are still insufficient9. For an accurate comparison, additional epidemiological studies are needed in these countries in the future.

Although the same criteria were used in our study, there was a slight difference in prevalence according to the survey method, which is similar to studies on other FGIDs using the Rome criteria80,81. This fact suggests that although the meaning of the Rome criteria is sufficient as a global standard, the rate at which patients report symptoms may vary depending on the survey method; therefore, standardization of the survey method or a process for correcting the prevalence according to the survey method is needed. These aspects should be considered when revising the Rome criteria in the future.

Conclusions

This systematic review and meta-analysis determined the global prevalence of FD to range from 11.9%, as per the Rome I criteria, to 6.8% based on the Rome IV criteria spanning the years 1990 to 2020. Despite heterogeneity across studies, our analysis unveiled noteworthy insights into the global prevalence of FD. Notably, there were variations between developed and developing countries, with a higher prevalence observed in developing nations. Additionally, our findings consistently demonstrated a significantly higher prevalence of FD in women compared to men, irrespective of the diagnostic criteria employed. This study provides crucial estimates of the current global burden of FD, offering valuable insights for future healthcare planning and research priorities.