A ten-year (2009–2018) database of cancer mortality rates in Italy

Cazzolla Gatti, Roberto; Di Paola, Arianna; Monaco, Alfonso; Velichevskaya, Alena; Amoroso, Nicola; Bellotti, Roberto

doi:10.1038/s41597-022-01729-0

Download PDF

Data Descriptor
Open access
Published: 21 October 2022

A ten-year (2009–2018) database of cancer mortality rates in Italy

Roberto Cazzolla Gatti¹^na1,
Arianna Di Paola²^na1,
Alfonso Monaco ORCID: orcid.org/0000-0002-5968-8642^3,4,
Alena Velichevskaya⁵,
Nicola Amoroso^4,6^na2 &
…
Roberto Bellotti^3,4^na2

Scientific Data volume 9, Article number: 638 (2022) Cite this article

2972 Accesses
4 Citations
14 Altmetric
Metrics details

Subjects

Cancer

Abstract

In Italy, approximately 400.000 new cases of malignant tumors are recorded every year. The average of annual deaths caused by tumors, according to the Italian Cancer Registers, is about 3.5 deaths and about 2.5 per 1,000 men and women respectively, for a total of about 3 deaths every 1,000 people. Long-term (at least a decade) and spatially detailed data (up to the municipality scale) are neither easily accessible nor fully available for public consultation by the citizens, scientists, research groups, and associations. Therefore, here we present a ten-year (2009–2018) database on cancer mortality rates (in the form of Standardized Mortality Ratios, SMR) for 23 cancer macro-types in Italy on municipal, provincial, and regional scales. We aim to make easily accessible a comprehensive, ready-to-use, and openly accessible source of data on the most updated status of cancer mortality in Italy for local and national stakeholders, researchers, and policymakers and to provide researchers with ready-to-use data to perform specific studies.

Measurement(s)	Standardized Mortality Ratio
Technology Type(s)	Public database

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Austin D. Reed, Sara Pensa, … Walid T. Khaled

Best practices for single-cell analysis across modalities

Article 31 March 2023

Lukas Heumos, Anna C. Schaar, … Fabian J. Theis

An atlas of epithelial cell states and plasticity in lung adenocarcinoma

Article Open access 28 February 2024

Guangchun Han, Ansam Sinjab, … Humam Kadara

Background & Summary

The prevailing theory, formulated around the 1950s, considers cancer as a set of about 200 diseases characterized by abnormal cell growth, escaping the normal control mechanisms of the organism¹. The process of transformation of a normal cell into a neoplastic cell occurs through various stages with the accumulation of genetic, functional, and morphological anomalies².

The most known causes of DNA alterations in the genesis of cancer include environmental pollution, genetic alteration, infections, and unhealthy lifestyles such as tobacco and alcohol over-consumption^3,4. However, in some cases, no specific causes can be still attributed to neoplastic cell formations, In Italy, approximately 400,000 new cases of malignant tumors are recorded every year, of which 200,000 in men and 180,000 in women. Overall, every day, about 1,000 Italian citizens receive a new malignant cancer diagnosis⁵.

Excluding skin cancers (non-melanoma), prostate cancer prevails in men which accounts for ~20% of all diagnosed cancers; follow by the tumor of the lung (15%), colorectal (14%), bladder (12%), and stomach cancer (4%). Breast cancer accounts for ~30% of women’s cancers, followed by colorectal (12%), lung (12%), thyroid (5%), and uterus (5%)⁶.

As in many other industrialized areas, in Italy cancers are the second cause of death (~30% of all deaths), after cardiovascular diseases (37%). In men, cancers and cardio-circulatory diseases cause approximately the same number of deaths (~35%) while in women the cardio-circulatory diseases are more relevant than tumors (40% vs 25%)⁵. Therefore, the probability of dying from cancer in Italy is approximately 1 out of 3 for men and 1 out of 4 for women⁵.

The frequency of deaths caused by tumors in the Italian areas covered by the Cancer Registers is, on an annual average, about 3.5 deaths per 1,000 men and about 2.5 per 1,000 women, for a total of about 3 deaths every 1,000 people⁷. These data, if scaled on a daily average, suggest that every day about 500 people die in Italy because of a tumor. Nevertheless, during the last 40 years, Italians’ life expectancy increased by about 10 years in both sexes⁸. Moreover, if in the 1950s the Italian population was mainly made up of children and very few elderly people, in 2050 the forecast is a population consisting largely of elderly people and few children. This entails an increased risk of developing cancer in a population older and affected by comorbidities⁸.

Furthermore, there exists a heterogeneous distribution in Italy for the main epidemiological indicators of cancer (incidence, survival, mortality, and prevalence), with a North-South gradient for most tumor types⁷. In fact, the standardized incidence rate (in the European population) for all cancers among men is ~5% lower in the Center and ~15% in the South and Islands compared to North and for women by ~5% and ~17%, respectively. Underlying these differences may be protective factors (different lifestyles, food habits, reproductive factors) that persist in the regions of Central and South/Islands, but mainly a lower exposure to carcinogenic factors (in particular, environmental pollution; see for a recent report⁹).

Overall, survival has recently increased by 54% in 2005–2009 against 51% in 2000–2004, 46% in 1995–1999, and 39% in 1990–1994 for men and by 63% against 60%, 58%, and 55% in the corresponding periods for women¹⁰. Specifically, there is an improvement in survival for some of the most frequent tumor sites: colorectal (currently 65% for both sexes), female breast (87%), and prostate (91%). However, for some poor prognosis cancers, survival improvements have been limited in recent years, as in the case of lung cancer, pancreas, and gallbladder⁷.

Nonetheless, it is worth noting that, the global cancer mortality rate has not always and not everywhere significantly decreased¹¹, despite almost a century of advanced research to find cures for tumors and improve survival.

For instance, during the last years, in Italy, mortality has decreased significantly in the whole country except in the South and Islands, where the rates are substantially unchanged⁷.

Although several indicators and reports are developed every year to monitor the cancer situation in Italy, more recent, long-term (at least a decade) and spatially detailed data (up to the municipality scale) are neither easily accessible nor available for public consultation by the citizens, scientists, research groups and associations. Here we present a ten-year (2009–2018) database on cancer mortality rates (in the form of Standardized Mortality Ratios; hereafter SMR) for all macro-types of cancers in Italy at municipal, provincial, and regional scales (Fig. 1). This dataset aims to make available a comprehensive, ready-to-use, and openly accessible source of data on the most updated status of cancer mortality in Italy for local and national stakeholders, researchers, and policymakers and to provide researchers with ready-to-use data to perform specific studies.

The SMR dataset¹² is available on the Dryad public data repository for open access. Source data, supplementary information, and Python codes to build the dataset are available on Zenodo^13,14.

Methods

Data source

The list of variables required for building the SMR database is reported in Table 1. The study period encompasses ten years: 2009–2018 and 23 macro-categories of cancer types listed in Table 2. Raw data, except for the observed number of deaths by cause at the municipal level, were retrieved from the public data warehouse of the Italian National Institute of Statistics (ISTAT, http://www.istat.it/en/, last access: 26/01/2022), a public organization producing the official statistics in Italy. The observed number of deaths by cause at the municipal level was shared by the ISTAT upon request.

Table 1 List of symbols, definitions, and data used to compute the SMR for different cancer types and territorial levels of aggregation.

Full size table

Table 2 IDC-10 denominations and codes.

Full size table

Details on single variables are provided in the following subparagraphs. Source data supporting the computation of the SMR for different cancer types and levels of aggregation were uploaded to the SMR Database.

Age-specific number of deaths by cause of reference population (M _i)

ISTAT provides the deaths by age and causes occurring in Italy by aggregating information from the attending physician, registrars, and necropsies, with annual updates. M_i is available on the I.Stat data warehouse (http://dati.istat.it/?lang=en, last access 22/11/2021) following the path: Health statistics, Causes of death, Cause and age. From an interactive window on the I.Stat portal it is possible to customize the request of data, selecting the territory (from provinces to national level), the age or age-group (arranged by 5 years), gender, causes of death, and year. To our purposes, we selected the total number of deaths at national level by causes listed in Table 2 grouped into 20 age-groups of 5 years each whose intervals are: 0–4, 5–9, 10–14, 15–19, 20–24, 25–29, 30–34, 35–39, 40–44, 45–49, 50–54, 55–59, 60–64, 65–69, 70–74, 75–79, 80–84, 85–89, 90–94, over 95 years.

The age-specific census data on population at municipal (n _i) and national level (N _i)

ISTAT estimates the resident population by age based on population censuses that occurred in 2018, 2011, and 2001. The last update of the census data was released in March 2021. The total number of resident population at the municipal level was retrieved on the I.Stat data warehouse following the path: Population and Households, Inter Censuses Population, Estimated Resident Population for years 2002–2019. The population size at the upper levels (i.e., provincial, regional, and national) was determined by aggregation.

At the time this document was written, Italy counted 7093 municipalities distributed over 110 provinces. However, since the number of municipalities and provinces had changed frequently in the last decades, the number of existing municipalities may change over the years. Indeed, some municipalities no longer exist (since absorbed by other municipalities), while others changed denominations. Moreover, few municipalities have missing data. The estimates of annual SMR considered all these variations (see “Data Processing”).

Deaths on the resident population by cause (°m)

Data on mortality at the municipal level are available only upon request to the ISTAT cont@ct center (https://contact.istat.it/index.php?Lingua=Inglese, last access:22/11/2021). For privacy reasons, data on mortality at the municipal level omit the age distributions (from personal communication). Moreover, for the same reason, if the frequency of deaths by municipality and sex is less than 3, ISTAT obscures the cause of death; Furthermore, in a few cases, the municipality might not be indicated due to errors in the death form compiled by doctors or for a lack of understanding.

Data on mortality provided by ISTAT are encoded according to the International Classification of Diseases and Related Health Problems (ICD-10), an international disease classification system defined by the World Health Organization (WHO)¹⁵ and includes all the causes of death as reported in¹⁶. Table 2 provides the IDC-10 definitions and codes for cancer’s causes of death used for the computation of the Italian SMR Database.

Ancillary datasets

Several ancillary datasets were also included in the analysis: i) the list of statistical codes and denominations of administrative units (i.e. municipalities, provinces, and regions) adopted by the ISTAT (available at https://www.istat.it/it/archivio/6789, last access: 20/10/2021, last release 2019); ii) the list of statistical codes and denominations of municipalities that have been abolished since 2009. Such a list provides both the old statistical codes and denominations of the abolished municipalities and the current ones; iii) The shapefile of administrative units available at https://www.istat.it/it/archivio/222527, last access 22/11/2021) to map the results at municipal- and regional-level scale.

Data Processing

Computation of SMR

As most causes of death vary significantly with people’s age and sex, data on mortality are commonly analyzed through a standardized index to improve comparability over time and between areas.

A versatile index for neutralizing the effects of age structure is the Standardized Mortality Ratio (SMR)^17,18. The SMR expresses the real differences in disease frequency of a study cohort compared to the general population (i.e. Reference Population).

Henceforth, the whole Italian population would be used as the reference population and capital letters will be used for all the reference variables, namely the reference population size and mortality, whereas lowercase letters will refer to variables at the local scale (i.e. municipal, provincial or regional scale). For example, the variables expressing the number of deaths for the reference and municipal population are “M” and “m”, respectively. Moreover, since the SMR is a weighted average of the age-specific mortality rates (see below), where each weight accounts for the ratio of people within an age group compared to the reference population, a subscript “i” is used to indicate a given age-group, where i = 1,2, …, I with I = 20 being the number of the age-groups of 5 years each.

The procedure presented here to estimate the SMR can be used for either municipal, provincial or regional levels. The estimation of Italian SMR at municipal and provincial levels represents a novel ready-to-use database, while that for the regional level is used for the technical validation.

For a given locality, year, and cause of death, the SMR is the ratio between the observed number of deaths (°m) and the number of expected deaths (^Em):

$$SMR=\frac{{}^{O}m}{{}^{E}m}$$

(1)

Where °m should be an available observational data and ^Em is estimated as the weighted sum of age-specific population size for the given locality (n_i) per age-specific death rates of the reference population (^MR_i):

$${}^{E}m={\sum }_{i=1}^{I}{}^{M}{R}_{i}\times {n}_{i}$$

(2)

^MR_i could be provided by a public health organization or be estimated as the ratio between the age-specific number of deaths of the reference population (M_i) to the age-specific reference population size (N_i):

$${}^{M}{R}_{i}={M}_{i}\times {N}_{i}$$

(3)

Thus, the value of ^Em is weighted by the age distribution of deaths and population size.

SMR assumes value 1 when the number of observed and expected deaths are equal. Hence, if the incidence of a given cause of death was equally distributed over the entire reference population, the score of SMR for a given locality and year would approximate 1.

In real life, the SMR is commonly different from 1 since the incidence of a given cause of death could be strongly affected by some local environmental and/or socio-economic factor. For a given locality and/or year, the more the value of SMR is greater than 1, the more the mortality incidence compared to the expected one (i.e. excess of deaths), while the more the value of SMR is lower than 1, the lower is the mortality incidence (i.e. defect of deaths). Therefore, for any given locality showing an excess of deaths beyond those expected (i.e., SMR >1), there must be another one with a defect of expected deaths (i.e., SMR <1). Overall, the distribution of SMR across the whole reference population is centered around 1.

Following Eqs. (1–3), the SMR was computed for each year of the period 2009–2018 and for a single cause of death listed in Table 2 by using the data listed in Table 1 at three different levels of aggregation: municipal, provincial (equivalent to the European classification NUTS 3) and regional (i.e., NUTS2). The SMR was also computed for the broad category of malignant tumors (i.e. C00-C979, hereinafter cancer macro-type C), and the broad category of malignant tumors plus non-malignant ones (i.e. C00-C979 plus D0-D489, hereinafter cancer macro-type CD).

At the time of writing this paper, Italy counts 7093 municipalities. Hence, to aid data comparison both on a spatial and temporal scale, the SMR of single years refers to the currently existing 7093 municipalities whose list of denominations and codes (from ISTAT) are also included in the SMR Database. Data on both mortality and population size from no longer existing municipalities were aggregated (summed) into the municipality to which they currently belong.

The same reasoning applied to the provinces and regions: the SMR at the upper levels of aggregations was estimated for the current provinces (107 units) and regions (20 units) by aggregating municipal data on mortality (°m) and population size by age (n_i).

Along with the SMR values for single years, we added the time-series average and related 90% and 95% lower confidence levels (when at least three years of real value exist) as additional ready-to-be-used data. Indeed, many epidemiologic studies suggest adopting cautionary lower confidence levels for statistical elaboration and/or descriptive statistics of SMR^19,20,21.

A lower 90% and 95% confidence interval were computed according to the Byar method¹⁷. However, since the SMR Database provides the SMR values for single years, a user can decide to calculate the confidence limits differently or with different confidence thresholds.

According to the Byar method, the approximate lower (α_low) and upper (α_up) limits for a specified confidence level (α), are:

$${\alpha }_{low}{=}^{O}m{\left(1-\frac{1}{{9}^{O}m}-\frac{{z}_{\alpha /2}}{{3}^{O}{m}^{1/2}}\right)}^{3}$$

(4)

and

$${\alpha }_{up}=({}^{O}m+1){\left(1-\frac{1}{9({}^{O}m+1<?RemoveMO1?>)}+\frac{{Z}_{\alpha /2}}{3{({}^{O}m+1)}^{1/2}}\right)}^{3}$$

(5)

with z_α/2 is equivalent to the 100(1-α) percentile of the standard normal distribution N(0,1), with (1-α) equal to the desired confidence level. Accordingly, Z = 1.64 and 1.96 for 90% and 95% confidence level, respectively.

Knowing the lower limit for a 100(1-α)% confidence level, the lower confidence level of SMR is given by

$$SM{R}_{low}=\frac{{\alpha }_{low}}{{}^{E}m}$$

(6)

The resulting SMRs are presented on the outline of a relational database where the municipal statistical code works as a key feature. Data are presented as a Comma Separated Value file (CSV) of 7904 rows (i.e. 7093 municipalities) per 14 columns (i.e., administrative statistical code, SMR for single years embracing the period 2009–2018, mean, and lower 90% and 95% confidence levels).

Missing data and exceptions

Before computing the SMR, two major issues were addressed: missing data and available data from abolished municipalities. These issues were both present in mortality data at the municipal level. We assumed that a municipality should record at least one death per year for whatever cause among those included in the data source. If such a record exists, even for causes other than cancer, then the data on causes listed in Table 2 is considered present yet equal to zero (i.e., zero cancer deaths). Conversely, if a municipality does not have any mortality records for a given year, then data for such a municipality and year is considered missing.

Data on deaths from abolished municipalities, when available, had been used for the computation of the yearly SMR. To this end, data on deaths from abolished municipalities were aggregated to the current belonging municipality through a sum. Overall, municipalities with missing data on deaths range between 4.4% (in 2009) and 5.5% (in 2014) compared to the total of currently existing 7903 municipalities. The number of abolished municipalities has progressively decreased over the study period from 313 in 2009 to 79 in 2018.

The age-specific census data on population had only 6 municipalities with missing data and the list of municipalities embraces only those currently present. This means that the aggregation of census data from abolished municipalities has already been done by the ISTAT.

Overall, when for a given year a municipality has missing data on mortality or resident population, the SMR value is not calculated and marked as “nan” (i.e., not-a-number).

Census and deaths data on reference populations had no missing value. Missing data for a given municipality and year return a missing value on the SMR for that municipality, year, and all the cancer types.

Data Record

The ten-year (2009–2018) database of Italian cancer mortality rates is available for download on Dryad¹². Specifically, the database contains the SMR data for the period 2009–2018 by cause listed in Table 2 at three levels of aggregation: municipal, provincial and regional; On Zenodo^13,14 are also available ii) the figure maps of average SMR for single cancer types and levels of aggregation; iii) the scripts in Python language to reproduce the elaboration along with the raw source data.

Within the root folder in Dryad (“DATA”) there are three main sub-folders: “SMR”, “Observed mortality”, and “Expected mortality”. The first one holds data on SMR, while the second and third ones hold data on the observed and expected number of deaths (i.e., ^Om and ^Em), respectively.

Data are provided for each level of aggregation (i.e., municipal, provincial, and regional) in specific sub-folders, hence in each of the three main folders, there are other three sub-folder. The sub-folders in “SMR”, in turn, contain the computation of ten-year SMR by cause as listed in Table 2 plus that for CD for single administrative units in Comma Separated Values (CSV) files. The format of CSV files is always the same across the levels of aggregation: in the rows are the single administrative units (i.e. municipalities or provinces or regions); the columns report the statistical code of the administrative units (first column), the value of SMR for the years 2009–2018 (2th-11th columns), the ten-years average (12th column), the 90% and 95% lower confidence levels (13-14th column) estimated according to the Byars method. A “readme.txt” file is present in each SMR sub-folders to easily access and understand the data. An example of the SMR records in a CSV file is given in Table 3.

Table 3 Example of SMR data presented in a CSV file (cause C, municipal level, available in the dataset under the path “SMR\SMR_municipal_level\SMRc.txt”.

Full size table

Similarly, within the main folder of ref. N2 (“Figure maps”, provided as additional information) on Zenodo there are three sub-folders, one for each level of aggregation, holding the maps of average SMR for single cancer type as listed in Table 2. Lastly, in the main folder of ref. N3 (“scripts”, provided as additional information) there are the source data used for the computation of SMR, the scripts used to produce all the elaborations, the shapefiles of administrative boundaries, and some intermediate outputs saved as pickle file, namely a homologous Python module extent (“pkl”) that can be reloaded to produce figures and further analysis in the python language by end-users. The Python script files are denoted through leading ordinal numbers that reflect the order of execution to reproduce the results.

Technical Validation

To ascertain that the estimation of SMR was as accurate as possible, two checks were made. First, since the SMR refers to a reference population, data on mortality at the municipal and national levels must be consistent. This means that the sum of deaths by cause for a given year among municipalities should be equal to the national number of deaths for the same cause and year. However, due to a few cases where data are omitted or not available, the total deaths by cause and year at the municipal level can be fewer than those at the national one. The higher the difference, the more the resulting SMR is biased. Moreover, such a proof of consistency allows us to also confirm that the procedure and the encoding for the selection of specific IDC-10 causes were properly implemented.

Table 4 shows the differences between deaths provided at the national level and those obtained from the sum of deaths by year and cause at the municipal level. Values in Table 4 are expressed as percentages in relation to the national score. In most cases, the differences are less than 0.5% with the only exception for death causes C16.1-C17 (i.e., malignant stomach tumors) where the differences range between 1–2%. Such low percentages of discrepancies between the national and municipal data are completely consistent with what is reported by ISTAT (i.e. few cases where municipalities are omitted or cause of deaths blacked out) and widely acceptable in terms of SMR precision which would suffer a small underestimation (0–2%).

Table 4 Difference between deaths at the national level and those obtained from the sum of municipalities by year and cause of death.

Full size table

The second check consisted of the comparison of the SMR with the Standardized Mortality Rates provided by the ISTAT (SMR_istat) at the regional level. Such an index contains information very similar to the SMR here described, albeit different in absolute terms since it expresses the standardized rate, namely the number of deaths per 100,000 inhabitants compared to a reference population, rather than a ratio as in our case (Eq. 1).

Hence, the values of SMR at the regional level must be strongly correlated to the SMR_ISTAT albeit much different in absolute values.

Figure 2 shows the scatter plots between SMR and SRM_ISTAT and related coefficients of Pearson correlation by causes listed in Table 2. The correlation between the two indices is very strong in all the cancer types (R² > 0.91). It is worth noting that from Fig. 2 some discrepancies between years in the SMR_istat emerged, for instance, in the cancer type “of which other malignant tumors “ after 2016. Specifically, the SMR_ISTAT data in Fig. 2 seems divided into two broad clusters, one embracing the years 2016–2018, and one embracing 2009–2015. Indeed, starting from the reference year 2016, ISTAT has adopted the edition of the ICD-10 updated to 2016 for the codification of the causes of death which presents numerous changes in the guidelines for the selection of the initial cause, as well as some modifications in the classification of some pathologies compared to the previous one used by ISTAT up to 2015 (i.e. ICD-10 2009 version, further information available at https://www.istat.it/it/archivio/6708, last access 22/11/2021). The tumor type “of which other malignant tumors”, embraces all the tumor types other than those specified in Table 2 and it could be more sensitive to the variations of the IDC-10 coding system.

Usage Notes

The interannual variability of SMR for a given administrative unit might be large under small populations. Indeed, being the SMR a rate standardized over the population size, the expected mortality (i.e., ^Em) in small populations might result low (e.g. 10⁻²) and in turn, according to Eq. 1, even a few deaths (say 1 or 2) in a year could yield a relatively high SMR as shown in Fig. 3. For this reason, we recommend avoiding using single-year estimates and using the SMR at lower 90% or 95% confidence intervals averaged over 3–5 years, at least.

Code availability

Data used for the production of the Italian SMR database are available from ISTAT (see the paragraph Data Source). The elaborations have followed the procedure described in the Methods section and can be implemented in whatever numerical computing environment (e.g., R, Matlab, Python). In our case, the algorithms used were created in Python 3.7 and are available on Zenodo¹⁴.

References

Tsai, H. C. & Baylin, S. B. Cancer epigenetics: linking basic biology to clinical medicine. Cell research 21(3), 502–517 (2011).
Article CAS Google Scholar
Pihan, G. A., Wallace, J., Zhou, Y. & Doxsey, S. J. Centrosome abnormalities and chromosome instability occur together in pre-invasive carcinomas. Cancer research 63(6), 1398–1404 (2003).
CAS PubMed Google Scholar
Danaei, G., Vander Hoorn, S., Lopez, A. D., Murray, C. J. & Ezzati, M., Comparative Risk Assessment collaborating group (Cancers. (2005). Causes of cancer in the world: comparative risk assessment of nine behavioural and environmental risk factors. The Lancet 366(9499), 1784–1793 (2005).
Article Google Scholar
Cazzolla Gatti, R. Why We Will Continue to Lose Our Battle with Cancers If We Do Not Stop Their Triggers from Environmental Pollution. International Journal of Environmental Research and Public Health 18(11), 6107 (2021).
Article Google Scholar
AIRC. https://www.airc.it/cancro/informazioni-tumori/cose-il-cancro/numeri-del-cancro#:~:text=L’incidenza%20dei%20tumori%20in,3%20si%20ammaler%C3%A0%20di%20tumore. accessed on 10/01/2022 (2022).
Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a cancer journal for clinicians 68(6), 394–424 (2018).
Google Scholar
AIRTUM. https://www.registri-tumori.it/cms/ accessed on 10/01/2022 (2022).
Lallo, C. & Raitano, M. Life expectancy inequalities in the elderly by socioeconomic status: evidence from Italy. Population health metrics 16(1), 1–21 (2018).
Article Google Scholar
Cazzolla Gatti, R. et al. The spatial association between environmental pollution and long-term cancer mortality in Italy. Science of the Total Environment 853, 1–12 (2022).
Google Scholar
La Vecchia, C., Rota, M., Malvezzi, M. & Negri, E. Potential for improvement in cancer management: reducing mortality in the European Union. The oncologist 20(5), 495–498 (2015).
Article Google Scholar
Torre, L. A., Siegel, R. L., Ward, E. M. & Jemal, A. Global cancer incidence and mortality rates and trends—an update. Cancer Epidemiol. Prev. Biomark 25, 16–27 (2016).
Article Google Scholar
Di Paola, A. et al. A ten-year (2009–2018) database of cancer mortality rates in Italy, Dryad, Dataset, https://doi.org/10.5061/dryad.ns1rn8pvg (2022).
Di Paola, A. et al. A ten-year (2009–2018) database of cancer mortality rates in Italy. Zenodo. https://doi.org/10.5281/zenodo.6571949 (2022).
Di Paola, A. et al. A ten-year (2009-2018) database of cancer mortality rates in Italy. Zenodo. https://doi.org/10.5281/zenodo.6571947 (2022).
WHO, World Health Organization—WHO. International Statistical Classification of Diseases and Related Health Problems, Tenth Revision. World Health Organization: Geneva, Switzerland I–III (1992).
WHO https://icd.who.int/ accessed on 10/01/2022 (2019).
Breslow, N. E. & Day, N. E. The design and analysis of cohort studies. International Agency for Research on Cancer. Statistical methods in cancer research 82, 1–406 (1987).
Google Scholar
Ulm, K. Simple method to calculate the confidence interval of a standardized mortality ratio (SMR). American journal of epidemiology 131(2), 373–375 (1990).
Article CAS Google Scholar
Dahiru, T. P-value, a true test of statistical significance? A cautionary note. Annals of Ibadan postgraduate medicine 6(1), 21–26 (2008).
PubMed PubMed Central Google Scholar
Sterne, J. A. & Smith, G. D. Sifting the evidence—what’s wrong with significance tests? Physical therapy 81(8), 1464–1469 (2001).
Article Google Scholar
Thomas, B. What future quantitative social science research could look like: confidence interval for effect sizes. Educ Research 31, 25–32 (2002).
Article Google Scholar

Download references

Acknowledgements

We are thankful to the Italian Statistical Institute (ISTAT) for providing the raw data on mortality and demography.

Author information

These authors contributed equally: Roberto Cazzolla Gatti, Arianna Di Paola.
These authors jointly supervised this work: Nicola Amoroso, Roberto Bellotti.

Authors and Affiliations

Department of Biological, Geological and Environmental Sciences (BiGeA), Alma Mater Studiorum – University of Bologna, Bologna, Italy
Roberto Cazzolla Gatti
Institute for BioEconomy (IBE), National Research Council of Italy (CNR), Rome, Italy
Arianna Di Paola
Dipartimento Interateneo di Fisica, Università degli Studi di Bari Aldo Moro, Bari, Italy
Alfonso Monaco & Roberto Bellotti
Istituto Nazionale di Fisica Nucleare, Sez. di Bari, Italy
Alfonso Monaco, Nicola Amoroso & Roberto Bellotti
Biological Institute, Tomsk State University, Tomsk, Russia
Alena Velichevskaya
Dipartimento di Farmacia – Scienze del Farmaco, Università degli Studi di Bari Aldo Moro, Bari, Italy
Nicola Amoroso

Authors

Roberto Cazzolla Gatti
View author publications
You can also search for this author in PubMed Google Scholar
Arianna Di Paola
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Monaco
View author publications
You can also search for this author in PubMed Google Scholar
Alena Velichevskaya
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Amoroso
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Bellotti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.C.G. Conceptualization, Methodology, Formal analysis, Writing (Original Draft), Writing (Review & Editing), Visualization, Supervision, Validation. A.D.P. Investigation, Methodology, Formal analysis, Writing (Original Draft), Writing (Review & Editing), Visualization, Software, Validation, Data Curation, Project administration. A.M. and N.A. Investigation, Methodology, Formal analysis, Writing (Original Draft), Writing (Review & Editing), Validation. A.V. Preliminary analysis, Writing (Review & Editing). R.B. Writing (Review & Editing), Validation.

Corresponding author

Correspondence to Alfonso Monaco.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cazzolla Gatti, R., Di Paola, A., Monaco, A. et al. A ten-year (2009–2018) database of cancer mortality rates in Italy. Sci Data 9, 638 (2022). https://doi.org/10.1038/s41597-022-01729-0

Download citation

Received: 13 June 2022
Accepted: 12 September 2022
Published: 21 October 2022
DOI: https://doi.org/10.1038/s41597-022-01729-0

This article is cited by

Machine learning and XAI approaches highlight the strong connection between $O_3$ and $NO_2$ pollutants and Alzheimer’s disease
- Alessandro Fania
- Alfonso Monaco
- Roberto Bellotti
Scientific Reports (2024)
A Dementia mortality rates dataset in Italy (2012–2019)
- Alessandro Fania
- Alfonso Monaco
- Roberto Bellotti
Scientific Data (2023)