Background & Summary

Volcanic rocks are a record of magmatic processes, from the source origin of the magma, its residency in the crustal plumbing system, and its eventual eruption at the earth’s surface1,2. These processes are reflected in the petrological and geochemical composition of the rocks, which reveal changes in magma storage and transport3,4 and are a key factor for understanding eruptive styles5,6. Understanding these processes is thus important for volcanic hazard forecasting and mitigation. Moreover, the global composition of volcanic rocks also give clues to the formation and evolution of the continental crust7. Information regarding volcanoes, their eruptive histories, and their rock compositions can be found in various global databases, including the following two:

  • The Geochemistry of Rocks of the Oceans and Continents (GEOROC8,9,10,11,12,13,14,15,16,17,18, https://georoc.eu) database, from the Digital Geochemistry Infrastructure (DIGIS), contains numerous volcanic rock samples, together with their location, material, and chemical rock composition. It also contains rock names as provided by the authors of the respective publications. The GEOROC database contains information about the geochemistry of volcanic rocks, namely the quantitative chemical composition (e.g., the concentration of oxides, elements, and some isotopes) together with various types of material that were analysed (rock, glass, mineral) of each sample.

  • The Volcanoes of the World (VOTW) database of the Smithsonian’s Global Volcanism Program (GVP19,20, https://volcano.si.edu/) provides information pertaining to eruptions from Holocene volcanoes. A list of up to 5 major and minor rock names are provided for some volcanoes, although without the chemical composition of the rocks.

The most abundant element in volcanic rocks is silica (expressed as SiO2) and together with sodium (Na2O) and potassium (K2O) are used to name volcanic rocks, glass, and melt inclusions in 15 fields according to Le Bas et al.21,22 and Le Maitre et al.23 in a Total-Alkali Silica (TAS) diagram. This classification was originally made using geochemical data of rock samples compiled from the published literature archived in Computer Library of Analysed Igneous Rocks (CLAIR) database24.In this data descriptor, we assign rock types (major and minor) of the global Holocene volcanoes, based on the quantified mass (wt%) of the total alkali and silica oxide of the volcanic rock samples from the GEOROC database, without using pre-defined rock names (as assigned by the authors of the respective publication). We use the TAS of Le Bas21,22 categorization to assign a rock name to a volcanic GEOROC sample. This requires agglomerating GEOROC sample datasets belonging to the corresponding GVP volcano25, which is achieved by matching the names and locations from both databases. This results in a new dataset, which lists volcanoes, five major and minor rock types, with their percentages, following the TAS nomenclature21,22,23 and then translated into the GVP classification19,20 (see Table 1). The major and minor rock types are given by material type (whole rock, glass, and melt inclusion), and for all materials combined. Case examples are provided to validate the data. This dataset has an important reuse value, since it provides data at a compact, yet finer granularity compared to what is currently available in the GVP database. We however acknowledge that it remains a temporal snapshot of GVP and GEOROC which are evolving databases. The rock names for major rocks are expected to remain mostly stable over time, whereas minor rock names, which by definition rely on a small abundance, are susceptible to small variations.

Table 1 Rock names correspondence.

Methods

The GEOROC database

We downloaded in June 2021 the GEOROC precompiled files from the Geochemistry of Rocks of the Oceans and Continents (GEOROC8,9,10,11,12,13,14,15,16,17,18, https://georoc.eu of the Digital Geochemistry Infrastructure (DIGIS)). The files used are available in Dataverse “GEOROC data from 2021”26. The GEOROC database has changed since then, and is currently delivering updates every three months which are associated with a DOI. The precompiled files are grouped in 11 folders, mainly according to their tectonic settings: Archean cratons, complex volcanic settings, continental flood basalts, convergent margins, intraplate volcanics, ocean basin flood basalts, ocean island groups, oceanic plateaus, rift volcanics, seamounts and submarine ridges. Melt inclusions are stored in a separate folder. In this paper we only use volcanic rock samples: there are ~380,000 in total, out of which more than 138,000 are relevant for Holocene volcanoes. In Dataverse “GEOROC data from 2021”26, we also added one folder for complementing the data for three volcanoes, Fuji, Merapi and Rinjani, which we compiled and is not part of the GEOROC dataset. We note that on top of being a temporal snapshot, GEOROC data may not necessarily be representative of the actual rock compositions of the volcanoes, as samples may have been collected because they were out of the ordinary, which may lead to a sampling bias. One suggestion to mitigate these biases is to consider linking the samples with mapped geological units, and thus estimate the abundances of different rock types more accurately. However, implementing such an approach would be a substantial undertaking. It would require extensive fieldwork, detailed geological mapping, and careful sample collection and analysis. This complex and time-consuming project may not be feasible for all volcanic areas or studies, considering the resources and efforts involved.

Matching sample locations with volcano locations

The GVP lists volcanoes with their name and geographical location (latitude, longitude, country, region, and subregion), but in the GEOROC database, a sample has a LOCATION field, a LOCATION COMMENT field, a range of latitudes (LATITUDE MIN, LATITUDE MAX) and a range of longitudes (LONGITUDE MIN, LONGITUDE MAX). The LOCATION field is of the form REGION/SUBREGION/SUBSUBREGION. To match GVP volcanoes and GEOROC samples, we first match their latitudes/longitudes, and then refine by names. Volcano names in the GEOROC database are typically found inside the LOCATION field, but sometimes inside the LOCATION comment. Based on the 2021 data, 1416 Holocene volcanoes listed by GVP had major rock data, which, after matching with GEOROC data, yield 1044 volcanoes with major rock data as per GEOROC25. Note that the original GVP data we used in the paper corresponds to the 2021 release.

Computing rock names from chemical compositions

The rock classification diagram following the TAS scheme21,22,23, was used to identify the rock names, which rely on three main oxide values in volcanic rocks: silica (SiO2) together with sodium (Na2O) and potassium (K2O). The 15 rock names following the TAS fields (Fig. 1a) are the following: 1) FOIDITE, 2) PICROBASALT, 3) BASALT, 4) TEPHRITE/BASANITE 5) TRACHYBASALT, 6) BASALTIC ANDESITE, 7) ANDESITE, 8) BASALTIC TRACHYANDESITE, 9) TRACHYANDESITE, 10) DACITE, 11) RHYOLITE, 12) TRACHYTE/TRACHYDACITE, 13) PHONO-TEPHRITE, 14) TEPHRI-PHONOLITE, 15) PHONOLITE. Using the fields SIO2(WT%), NA2O(WT%) and K2O(WT%) from GEOROC, we computed for each sample a point on the TAS diagram, corresponding to a rock name, in the above list.

Fig. 1
figure 1

(a) The Total Alkali Silica (TAS) diagram described in Le Bas et al.21,22 and Le Maitre et al.23 with the 15 field names used to classify the GEOROC volcanic rocks in this paper, which are the following: (1) FOIDITE, (2) PICROBASALT, (3) BASALT, (4) TEPHRITE/BASANITE (5) TRACHYBASALT, (6) BASALTIC ANDESITE, (7) ANDESITE, (8) BASALTIC TRACHYANDESITE, (9) TRACHYANDESITE, (10) DACITE, (11) RHYOLITE, (12) TRACHYTE/TRACHYDACITE, (13) PHONO-TEPHRITE, (14) TEPHRI-PHONOLITE, (15) PHONOLITE. (b) The same TAS diagram with with the corresponding rock types based on GVP categorisation.

To ensure the rock name computation is accurate, one needs to account for the iron concentration, reported as FeO and Fe2O3. GEOROC contains data fields for FE2O3(WT%), FEO(WT%), and FEOT(WT%). For samples where only FEOT(WT%) is reported we used this value. For samples where FE2O3(WT%) and FEO(WT%) are reported, we recalculated Fe2O3 into FeO and added it to the reported FEO(WT%), so that for all samples we report FEOT(wt%) as: \(\frac{{\rm{FE}}2{\rm{O}}3\left({\rm{WT \% }}\right)}{1.111}+{\rm{FEO}}\left({\rm{WT \% }}\right)\). Moreover, we normalized SIO2(WT%), NA2O(WT%), K2O(WT%) and all other oxides to 100(wt%). We only compute a rock name for samples such that 35 < SIO2(WT%) < 80, FEOT(WT%) > 0, NA2O(WT%) > 0 and K2O(WT%) > 0, which is done after normalization.

We do not exclude any sample compositions if their totals are too high/low. We do not filter samples that are altered either. We note that >95% of the volatiles are water and most of them (including water) are lost to the atmosphere during eruption. Thus, the concentration of remaining volatiles in the rock is <1% and does not have an influence on the normalization. We would also like to add that it is a standard procedure of the community to report normalized oxide compositions to 100% on an anhydrous basis to be able to compare between samples.

Materials

Each GEOROC volcanic rock sample can represent one out of four types of material analysed: whole rock (WR), volcanic glass (GL), melt inclusion (INC) and mineral/ component including groundmass (MIN) according to the field MATERIAL. Noting that GVP describe their major rocks based on whole rock analysis19,20, we thus compute GEOROC major and minor rocks for each material separately, namely for whole rock, volcanic glass and melt inclusions. Using only whole rock (WR) provides a comparison with GVP records. The other records complement the understanding of volcanic rocks and are useful to compare between different types of material analyzed, e.g., melt inclusions compared to whole rock samples.

Major and minor rocks

For each GVP Holocene volcano, we matched the corresponding GEOROC samples. Each match is a volcanic rock sample of a specific material and is assigned with a major/major rock name, following the described procedure above. The assignment of major rock type is done by number of occurrences in each TAS field (Fig. 1a, Table 1). The most common rock type among all samples from a given volcano becomes its major rock 1, then followed sequentially by major rock 2 to 5 based on the order of abundance. Following GVP rock-type assignment19,20, major rock types are those that consist of more than 10% of the total population, when known and quantified. Those that are less then 10% are labelled as minor rock types. We note that when two rock names appear the same number of times, one of the two is chosen arbitrarily over the other, but the user can see the percentages and evaluate.

Rock names

The dominant rock names for Holocene volcanoes provided by GVP includes the five most common ones, listed in order of decreasing abundance (Fig. 1b, Table 1). The GVP classification is different to the general TAS scheme (Fig. 1a, Table 1) described by Le Bas et al.21,22 and Le Maitre et al.23. To be able to compare the major rock names obtained from the GEOROC database to the GVP major rock names, we thus translated the TAS nomenclature to the GVP classification. A correspondence between the names is given in Table 1.

Data Records

We provide two data files, available in Dataverse “Major rocks from the GEOROC database”27. The first row of each file contains the headers, which are the same for both files, and are: Volcano Number, Volcano Name, material, major rock 1, % of major rock 1,major rock 2, % of major rock 2,major rock 3, % of major rock 3, major rock 4, % of major rock 4, major rock 5, % of major rock 5, minor rock 1, % of minor rock 1, minor rock 2, % of minor rock 2, minor rock 3, % of minor rock 3, minor rock 4, % of minor rock 4, minor rock 5, % of minor rock 5.

The rock names in the file “completeGEOROCmajorminorrocks2021.csv”were computed from the GEOROC data based on TAS nomenclature21,22,23. In the file “completeGEOROCmajorminorrocks2021_GVPnames.csv”, the rocks are named according to GVP rock name convention19,20. For both data files, the first (empty) field before the first comma stores the row number (which starts from 0 and increases, before ordering the data by volcano number.). A description of the content of each field is provided in Table 2. When no major or minor rock is available, ‘No Data’ is indicated.

Table 2 Dataset fields and content.

The datasets described in this paper are available in open access repositories:

Technical Validation

To validate the data, we first visually compared the TAS diagram obtained from GEOROC data samples with the five major rock types that were computed and stored in the new dataset. In Fig. 2, four TAS diagrams are shown for Kilauea volcano: one for whole rock samples, one for volcanic glass samples, one for melt inclusion samples, and one for all samples combined. We can compare their relative abundances on the TAS diagrams, and their major rock names. A similar visual comparison is provided for the Taranaki volcano (New Zealand) on Fig. 3. This volcano has fewer samples than Kilauea. Detail explanation on the assignments of major and minor rock types with case example of the Taranaki volcano can be seen in the appendix (supplementary information).

Fig. 2
figure 2

TAS diagram for Kilauea volcano, Hawaii. There are 4722 GEOROC volcanic rock samples, categorised into 3 different types of material analysed (2334 whole rocks, 1238 volcanic glasses and 1150 melt inclusions).TAS plot for different materials and their respective ordering of their 5 major rock types are presented: (a) All available type of materials, (b) only from whole rock analyses [Basalt (83.7%), Picrobasalt (12.0%), No Data, No Data, No Data], (c) only for volcanic glasses [Basalt (74.1%), Basaltic Andesite (17.0%), No Data, No Data, No Data], and (d) only for melt inclusions [Basalt (75.7%), Basaltic Andesite (21.2%), No Data, No Data, No Data].

Fig. 3
figure 3

TAS diagram for Taranaki volcano, New Zealand. There are 145 GEROC volcanic rock samples, categorised into 3 different types of material analysed (137 whole rocks, 6 volcanic glasses and 2 melt inclusions). TAS plot for different materials and their respective ordering of their 5 major rock types are presented: (a) All available type of materials, (b) only from whole rock analyses [Basaltic Trachyandesite (38.7%), Basalt (17.5%), Trachybasalt (12.4%), Basaltic Andesite (11.7%), Trachyandesite (11.7%)], (c) only for volcanic glasses [Rhyolite (83.3%), Trachyte/Trachydacite (16.7%), No Data, No Data, No Data], and (d) only for melt inclusions [Trachyte/Trachydacite (100.0%), No Data, No Data, No Data, No Data].

To validate the soundness of our dataset, we provide an overall comparison with major rock types that are available in the GVP database, using only whole rock samples. Major Rock 1 for Kilauea volcano is Basalt/Picro-Basalt (GVP19, https://volcano.si.edu/volcano.cfm?vn=332010). For Taranaki volcano, the Major Rock 1 is Andesite/Basaltic Andesite (GVP19, https://volcano.si.edu/volcano.cfm?vn=241030). We do not expect to find the exact same result, since different compositional datasets are being used by GVP and this paper. GVP estimated rock types for each volcano based upon geological, geochemical and/or petrological studies published prior to 2010, with relative abundances estimated from eruptive histories and precedence given to rock types composing younger edifices20. In contrast, our dataset uses published analyses compiled within GEOROC downloaded in June 202126 following the procedures previously described, but does not weight samples according to eruptive history or mapped extent. Nevertheless, we found that the overall distribution of major rock types are similar (Fig. 4), and that the two datasets agree on the distribution of the first major rock among andesite, basalt, dacite and rhyolite.

Fig. 4
figure 4

Overall presentation of the GEOROC volcanic rock’s chemical composition of the global Holocene volcanoes plotted in sunburst diagram showing the rock names, with major rock 1 at the inner circle, followed by major rock 2 in the middle, and major rock 3 in the outer circle. To illustrate the comparison with GVP major rocks, we translated the 15 TAS classifications of the GEOROC following the GVP nomenclature20. Both GEOROC and GVP are based on whole rocks analyses. (a) The GVP major rock types covering 1416 Holocene volcanoes, and (b) The GEOROC major rock types covering 1044 volcanoes.