A Dementia mortality rates dataset in Italy (2012–2019)

Fania, Alessandro; Monaco, Alfonso; Amoroso, Nicola; Bellantuono, Loredana; Cazzolla Gatti, Roberto; Firza, Najada; Lacalamita, Antonio; Pantaleo, Ester; Tangaro, Sabina; Velichevskaya, Alena; Bellotti, Roberto

doi:10.1038/s41597-023-02461-z

Download PDF

Data Descriptor
Open access
Published: 25 August 2023

A Dementia mortality rates dataset in Italy (2012–2019)

Alessandro Fania^1,2^na1,
Alfonso Monaco ORCID: orcid.org/0000-0002-5968-8642^1,2^na1,
Nicola Amoroso^2,3,
Loredana Bellantuono^2,4,
Roberto Cazzolla Gatti⁵,
Najada Firza^6,7,
Antonio Lacalamita^1,2,
Ester Pantaleo^1,2,
Sabina Tangaro ORCID: orcid.org/0000-0002-1372-3916^2,8,
Alena Velichevskaya⁹ &
…
Roberto Bellotti^1,2

Scientific Data volume 10, Article number: 564 (2023) Cite this article

1580 Accesses
1 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Dementia is on the rise in the world population and has been defined by the World Health Organization as a global public health priority. In Italy, according to demographic projections, in 2051 there will be 280 elderly people for every 100 young people, with an increase in all age-related chronic diseases, including dementia. Currently the total number of patients with dementia is estimated to be over 1 million (mainly with Alzheimer’s disease (AD) and Parkinson’s disease (PD)). In-depth studies of the etiology and physiology of dementia are complicated due to the complexity of these diseases and their long duration. In this work we present a dataset on mortality rates (in the form of Standardized Mortality Ratios, SMR) for AD e PD in Italy at provincial level over a period of 8 years (2012–2019). Access to long-term, spatially detailed and ready-to-use data could favor both health monitoring and the research of new treatments and new drugs as well as innovative methodologies for early diagnosis of dementia.

Place of death trends among patients with dementia in Japan: a population-based observational study

Article Open access 27 December 2019

A nationwide trend analysis in the incidence and mortality of Creutzfeldt–Jakob disease in Japan between 2005 and 2014

Article Open access 23 September 2020

Insights from Turkey's big data: unraveling the preventability, pathogenesis, and risk management of Alzheimer's disease (AD)

Article Open access 12 March 2024

Background & Summary

The term Dementia defines a chronic, progressive syndrome affecting cognitive and functional abilities and memory¹. In the last years, the World Health Organization (WHO) classified dementia as a public health worldwide priority with about 55 million people affected by dementia and a cumulative incidence of about 1 new case diagnosed every 3 seconds^2,3.

WHO estimates that by 2050 the patients will exceed 139 millions. Alzheimer disease (AD) and Parkinson disease (PD) are the most common forms of dementia. Currently, there is no cure for these diseases but treatments therapeutic to relieve symptoms. Both AD and PD are fueled by aging populations and their incidences increase with age. Despite age being the strongest risk factor, most dementias, such as AD and PD, are currently idiopathic with no cause identified with certainty. This complex and multifactorial etiology of dementia leaded to examined several factors that could influence the evolution of the disease such as genetic and environmental causes (pollution, life-style, etc...).

Italy has one of the oldest populations in the World with an estimation of more than one million dementia cases⁴. According to the WHO data report published in 2020, dementia deaths in Italy represented the fifth causes (7.64% of total deaths) with AD and PD together that appeared the first cause of death among neurodegenerative diseases⁵. In recent years, the incidences of AD and PD have shown a constantly increasing average trend in Italy which can be interpreted by aging of the population, with the consequent increase in dementia. Currently in Italy there are about 600,000 AD patients and 300,000 PD patients⁶.

Modern healthcare research on multifactorial diseases such as PD and AD and dementia in general can benefit from today’s vast availability of data. Databases that include regularly collected population-based outcomes and exposure data and clinical and biological information on individuals can aid research also through the use of modern data analysis techniques such as machine and deep learning.

In this work we present an eight-year (2012–2019) dataset on AD and PD mortality rates in the form of Standardized Mortality Ratios (SMR)⁷ for AD and PD in Italy at provincial scale. SMR is an useful index to compare mortality rates of different populations compared to a standard. Figure 1 shows the distribution of SMR averaged over the considered time window (Panel a for AD, and panel b for PD).

The goal of this dataset is to provide an inclusive, readily accessible, and comprehensive source of information about the current situation of dementia in Italy to local and national stakeholders, policymakers, and researchers. Additionally, it aims to offer researchers with readily available data to conduct specific research. The SMR dataset can be found on the Dryad public data repository and all the source data⁸, additional information, and R codes necessary to build the dataset are available on Zenodo⁹.

Methods

Data source

Raw data used for the computation of SMR were collected on the Italian National Institute of Statistic website (ISTAT, http://www.istat.it/en/, last access: 26/01/2023) for a time window of 8 years (2012–2019). ISTAT is a public organization which provides statistical information about Italian territory and population. The list of variables used for computing the SMR dataset is presented in Table 1. In the following sections we detailed the single variables used for the SMR calculation.

Table 1 List of variables, with respective definitions and data sources, used to compute Standardized Mortality Ratio.

Full size table

Age-specific number of deaths by cause of reference population (Mi)

ISTAT provides the number of death due to a specific cause, grouped by age (M_i). Every year, ISTAT aggregates the information coming from the registry offices, through surveys, compiled by the Civil State and physicians or necroscopes. Data are then provided on the ISTAT data warehouse, where is publicly downloadable. Specifically, on the ISTAT website, following the path Health statistics, Causes of death, Mortality by territory of residence, Cause and age, it is possible to create a data table with the required information, such as age, territory gender, causes of death, and year. In particular we considered the total number of deaths at national level by causes divided in the following 20 age groups: 0–4, 5–9, 10–14, 15–19, 20–24, 25–29, 30–34, 35–39, 40–44, 45–49, 50–54, 55–59, 60–64, 65–69, 70–74, 75–79, 80–84, 85–89, 90–94, over 95 years.

Observed number of deaths (Om)

Mortality data are found on the ISTAT website at the path Health statistics, Causes of death, Mortality by territory of residence, Cause-prov. We selected number of deaths for Alzheimer and Parkinson diseases at provincial level (O_m). The medical information contained in individual death certificates is encoded according to the World Health Organization’s (WHO) ICD-10 (International Statistical Classification of Diseases and Related Health Problems).

Resident population at provincial (ni) and national levels (Ni)

The total number of resident population at provincial level (n_i) was collected on the ISTAT data warehouse from the following path: Population and households, Intercensuary population, Reconstructed resident population -Years 2001–2019, Italy-regions-provinces. The Italian population data are estimated by ISTAT through population censuses of 2001, 2011 and 2018. The reconstruction process includes also demographic flows (births, deaths, migrations, acquisitions of citizenship). In addition to being grouped by province, the data is grouped by age, with a step of 1 year, from 0 to over 100 years. Through an aggregation process, we computed directly the national values (N_i).

Computation of standardized mortality ratios

In this work we computed the Standardized Mortality Ratios (SMR) for AD and PD in Italy at provincial scale. The Standardized Mortality Ratio represents the ratio between the number of deaths actually observed and the number of deaths expected, i.e. the cases that would have occurred if the study population had the same mortality as the reference population considering the different distribution by age. In other words, the SMR shows the surplus or shortage of deaths among various populations after accounting for the impacts caused by differences in age composition. The crude death rate reflects the actual situation of a total population, without considering age differences within the population. As a result, populations with a higher proportion of elderly individuals tend to exhibit higher death rates. Consequently, when analyzing trends in death rates over time, accounting for the population’s age distribution reduces the age bias. In fact, age plays a significant role in most risk factors, particularly mortality due to neurodegenerative diseases. Differences in the population structure of the geographical units of study can lead to misinterpretation of increased risk as a result of comparing unstandardized rates, especially when dealing with older populations on average. To address this, it can be used an age standardized ratio. Standardization aims to eliminate the impact of varying age distributions in populations under comparison.

The ISTAT provides the Standardized Mortality Rates only at the regional level (i.e., only for the 21 Italian regions and not for the 107 provinces). This index is quite similar to the SMR, although it differs in that it shows the standardized rate, which is the number of deaths per 10,000 inhabitants in relation to a specific reference population, rather than a ratio as in the SMR.

The expression of SMR is as follows⁷:

$$SMR=\frac{{O}_{m}}{{E}_{m}}$$

(1)

where O_m and E_m are the observed and the expected number of deaths by cause respectively. E_m has the following definition:

$${E}_{m}=\mathop{\sum }\limits_{i=1}^{I}{R}_{i}^{M}\ast {n}_{i}$$

(2)

in which ${R}_{i}^{M}$ is the age-specific death rates of the reference population and n_i is the age-specific population size for the given locality. In particular ${R}_{i}^{M}$ is defined:

$${R}_{i}^{M}=\frac{{M}_{i}}{{N}_{i}}$$

(3)

${R}_{i}^{M}$ is obtained dividing the number of deaths by age and cause of the reference population M_i with the age-specific reference population size N_i. SMR values greater or less than 1 indicate a risk, respectively, higher or lower than that observed in the reference population. In other words, SMR values greater than 1 show a higher mortality than the Italian national one; lower values indicate a lower mortality level than the Italian national mortality. As a result SMR is an useful index to compare mortality rates of different populations compared to a standard.

Data Record

Our dataset of Italian AD and PD mortality ratios is available for download on Dryad⁸. Specifically, the dataset includes the SMR data for AD and PD for the time window 2012–2019 at provincial level. The root folder in Dryad (“DATA”) contains 2 folders (named “AD” and “PD”) with 4 sub-folders inside each: “SMR”, “Mi”, “Ni”, “Om”. The first sub-folder includes data on SMR, while the second, the third and the fourth ones includes data on the number of death by cause, the observed number of deaths and the number of resident population at provincial level respectively. Data are provided for single province in Comma Separated Values (CSV) files. The format of CSV file for the SMR computation is the following: the first column reports the statistical code of each province; the second column contains the name of each province; the columns between the 3th and the 11th hold the value of SMR for the years 2012–2019; the columns among the 12th and 15th report four statistical calculation of SMR (mean, standard deviation, max and min values respectively). A “readme.txt” file is present in each SMR sub-folders with an explanation of the data structure.

On Zenodo⁹ is available the code in R language to reproduce the SMR computation starting from the raw data. In the main folder “Code” there are two sub-folders named “AD” and “PD” containing the scripts used for the SMR computation.

Technical Validation

In order to validate our computation of SMR, we compared it with the Standardized Mortality Rates at the regional level provided by ISTAT (SMR_ISTAT). To do this check we aggregated our provincial values of SMR to the upper level. We expected a good correlation between SMR_ISTAT and SMR values at regional level due to the common information they contain. Results of this analysis, performed each year, are showed in Figs. 2, 3 for AD and PD respectively, and reported in Table 2 in terms of Pearson correlation¹⁰ and the coefficient of determination (R²). The correlation between the two indices is quite strong for AD with an average value (over the considered time window) of Pearson correlation of 0.822. We found lower correlation values for PD with a mean Pearson coefficient of 0.743. In particular, the years 2015 and 2018 show the worst agreement between SMR_ISTAT and SMR.

Table 2 Table of performance metrics computed for each considered year.

Full size table

Data shuffling at regional level

To verify the robustness of the results reported in the previous section we applied a data shuffling procedure both on our SMR values and SMR_ISTAT at regional level. In this procedure the SMR data sets (our computation and ISTAT) for AD and PD are randomly re-sampled 100 times without replacement for each year. Then the two statistical indicators (Pearson correlation and R²) are evaluated considering these new samples. In this way we compared the agreement between SMR and SMR_ISTAT reported in the previous section and the worst case performance. In other words, we assessed how far our results are statistically from the random distribution. Our findings, reported in Table 2 present differences statistically significant with the worst case results (Table 3).

Table 3 Table of performance metrics after the shuffling procedure on the SMR and SMR_ISTAT values for each considered year.

Full size table

Slope analysis

After the computation of SMR, we investigated the trends of these values over the time period considered for each Italian province. In fact, using the eight years available, we carried out a linear regression from which we considered the gradient of the line. The results obtained, respectively for Alzheimer’s and Parkinson’s are shown in the panel a and b of Fig. 4. Computing the average value of the slopes over all provinces, we found slightly positve trends for AD (0.003) and PD (0.002). However, these values are very variable, with standard deviations of 0.035 and 0.030 for AD and PD respectively, highlighting a strong dependence on the SMR value of the single province. There are 56 provinces that present a positive trend for AD and 57 for PD. Eight values are too few to obtain a very robust estimation of the slope, but a future addition of data will provide a more precise assessment of the trend of AD and PD mortality over the time for the Italian territory.

Usage Notes

This paper presents a dataset on mortality rates (in the form of Standardized Mortality Ratios, SMR) for AD e PD in Italy at provincial level between 2012 and 2019. The dataset is open to public use without limitation. The permanent storage is at⁸

Code availability

Data used for the computation of SMR for AD and PD at Italian provincial level are available from ISTAT (see the paragraph Data Source). We implemented the procedure described in the Methods section. Data processing was performed in R 4.2.2¹¹ and the used algorithms is available on Zenodo⁹.

References

WHO. Mental health of older adult. figshare https://www.who.int/en/news-room/fact-sheets/detail/mental-health-of-older-adults (2017).
WHO. Dementia. https://www.who.int/news-room/fact-sheets/detail/dementia (2022).
Gervasi, G. et al. Integrated care pathways on dementia in italy: a survey testing the compliance with a national guidance. Neurol Sci. 41, 917–924, https://doi.org/10.1007/s10072-019-04184-9 (2020).
Article PubMed Google Scholar
Canevelli, M. et al. A national survey of centers for cognitive disorders and dementias in italy. J Alzheimers Dis 84, 1849–1857, https://doi.org/10.3233/JAD-210634 (2021).
Article Google Scholar
WHO. World health statistics 2020: monitoring health for the sdgs, sustainable development goals. https://www.who.int/publications/i/item/9789240005105 (2020).
MASAL. Dati epidemiologici. https://www.salute.gov.it/portale/demenze/dettaglioContenutiDemenze.jsp?lingua=italiano&id=2402&area=demenze menu=vuoto#:~:text=Attualmente%20il%20numero%20totale%20dei,sul%20piano%20economico%20e%20organizzativo (2022).
Cazzolla Gatti, R. et al. A ten-year (2009–2018) database of cancer mortality rates in italy. Sci Data 9 (2022).
Fania, A. et al. A Dementia mortality rates dataset in Italy (2012–2019). Dryad, Dataset. https://doi.org/10.5061/dryad.18931zd2m (2023).
Fania, A. et al. A Dementia mortality rates dataset in Italy (2012–2019). Zenodo. https://doi.org/10.5281/zenodo.7802438 (2023).
Freedman, D., Pisani, R. & Purves, R. Statistics (international student edition) (WW Norton Company, New York, 2007).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.r-project.org/ (2018).
ISTAT data warehouse. http://dati.istat.it/Index.aspx?lang=en&SubSessionId=b0d7addd-af05-470b-9983-ef407a697769.

Download references

Acknowledgements

We are thankful to the Italian Statistical Institute (ISTAT) for providing the raw data on mortality and demography.

Author information

These authors contributed equally: Alessandro Fania, Alfonso Monaco.

Authors and Affiliations

Dipartimento Interateneo di Fisica M. Merlin, Universitá degli Studi di Bari Aldo Moro, Via G. Amendola 173, Bari, 70125, Italy
Alessandro Fania, Alfonso Monaco, Antonio Lacalamita, Ester Pantaleo & Roberto Bellotti
Sezione di Bari, Istituto Nazionale di Fisica Nucleare (INFN), Via A. Orabona 4, Bari, 70125, Italy
Alessandro Fania, Alfonso Monaco, Nicola Amoroso, Loredana Bellantuono, Antonio Lacalamita, Ester Pantaleo, Sabina Tangaro & Roberto Bellotti
Dipartimento di Farmacia - Scienze del Farmaco, Universitá degli Studi di Bari Aldo Moro, Via A. Orabona 4, Bari, 70125, Italy
Nicola Amoroso
Dipartimento di Biomedicina Traslazionale e Neuroscienze (DiBraiN), Universitá degli Studi di Bari Aldo Moro, Piazza G. Cesare 11, Bari, 70124, Italy
Loredana Bellantuono
Department of Biological Sciences, Geological and Environmental (BiGeA), Alma Mater Studiorum – University of Bologna, Piazza Porta S. Donato 1, Bologna, 40126, Italy
Roberto Cazzolla Gatti
Dipartimento di Economia e Finanza, Universitá degli Studi di Bari Aldo Moro, Largo Abbazia S. Scolastica, Bari, 70124, Italy
Najada Firza
Catholic University Our Lady of Good Counsel, Rr. Dritan Hoxha 123, Laprake, Tirana, 1031, Albania
Najada Firza
Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Universitá degli Studi di Bari Aldo Moro, Via G. Amendola 165/a, Bari, 70126, Italy
Sabina Tangaro
Biological Institute, Tomsk State University, Lenin Ave., 36, Tomsk, 634050, Russia
Alena Velichevskaya

Authors

Alessandro Fania
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Monaco
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Amoroso
View author publications
You can also search for this author in PubMed Google Scholar
Loredana Bellantuono
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Cazzolla Gatti
View author publications
You can also search for this author in PubMed Google Scholar
Najada Firza
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Lacalamita
View author publications
You can also search for this author in PubMed Google Scholar
Ester Pantaleo
View author publications
You can also search for this author in PubMed Google Scholar
Sabina Tangaro
View author publications
You can also search for this author in PubMed Google Scholar
Alena Velichevskaya
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Bellotti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization A.F., A.M.; Methodology A.F., A.M., N.A., N.F.; Formal analysis A.F., A.M.; Writing (Original Draft) A.F., A.M.; Writing (Review Editing) A.F., A.M., N.A., L.B., R.C.G., N.F., A.L., E.P., S.T., A.V., R.B.; Data Curation A.F.; Software A.F.; Visualization A.F., A.M.; Validation A.M., N.A., R.C.G.

Corresponding author

Correspondence to Alfonso Monaco.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fania, A., Monaco, A., Amoroso, N. et al. A Dementia mortality rates dataset in Italy (2012–2019). Sci Data 10, 564 (2023). https://doi.org/10.1038/s41597-023-02461-z

Download citation

Received: 21 April 2023
Accepted: 09 August 2023
Published: 25 August 2023
DOI: https://doi.org/10.1038/s41597-023-02461-z

This article is cited by

Machine learning and XAI approaches highlight the strong connection between $O_3$ and $NO_2$ pollutants and Alzheimer’s disease
- Alessandro Fania
- Alfonso Monaco
- Roberto Bellotti
Scientific Reports (2024)