Background & Summary

Submarine mass movements are common occurrences on marine domains, from the shallow coasts to the deepest areas of the oceans1. The resulting landslides can be characterised by a variety of deposit features and morphologies, influenced by the mechanic properties of the original strata, the dynamics of the flow processes, regional geology and seismicity. Although singular massive deposits attract the attention for detailed studied, the regions where they occur can record geological evidence of hundreds or thousands of smaller-scale landslides, often poorly covered by available data and of limited focus of analysis.

Submarine landslides are a primary geohazard in marine environments. Tsunamis generated from landslides on the flank of subaerial topography flowing into the sea2, or from large collapses on fully submerged morphologic features3 are a major concern. Moreover, geotechnical installations and infrastructures resting on the seafloor such as submarine communication cables, pipelines or any purpose-build platform are sensible to mass movements4. Submarine landslides impact in marine biological communities, either by acting as habitat hotspots on their scars and remobilised elements or by disturbing and modifying seafloor ecology during emplacement5. Recognising submarine landslide extents has further political implication as these are used to set international ZEE boundaries under the definition of the UN Convention on Law of the Sea6. It is thus crucial to understand the distribution patterns and morphometric trends of submarine landslides according to the regional setting in which they occur, and aim to unravel insights on their causes and deposits1,7.

Efforts have been made to compile databases of submarine landslides to better understand their distribution and characteristics in marine settings around the world1,7. Regional compilations are available from the US Atlantic margin8,9, the Mediterranean Sea10, the Spanish margins11 or Australia12. Global data compilations have also allowed the comparison of landslides in distinct geological settings7,13,14. However, extensive submarine landslide characterisation is still lacking in many continental margins, and adequate characterisation depends on the quality of available data. Such is the case of the West and Southwest Iberian Margin, on the Northeast Atlantic Margin. This is an area of relevant geological risk, with frequent seismic activity resultant from the NW-ward collision of the African and European tectonic plates15,16. This has led to the occurrence of several high magnitude earthquakes (Mw > 7), of which the 1755 Lisbon Earthquake and tsunami is one of the major documented natural disasters17. Furthermore, the chains of large seamounts that occur in the area create major bathymetric features rising up to five kilometres above from the abyssal plain depths16, and are associated to intermediate to high seismicity, which is known to be a landslide trigger. Instability susceptibility studies conducted on the study area indicate that large extents of the continental slope and seamounts are prone to failure18,19. Yet, few submarine landslide studies exist, and these focused on specific case studies3,20,21,22.

It is thus crucial and timely to provide a broader perspective of the distribution and morphometric trends of submarine landslides offshore Iberia. This work presents the MAGICLAND (Marine Geo-hazards Induced by underwater Landslides in the SW Iberian Margin) database, which covers the geographical area from 33° 45′ N to 43° N and from 6° 22′ W to 16° 15′ W, and compiles geomorphological data of 1552 submarine landslides based on the interpretation of DEM bathymetric grids provided by EMODnet23 (Fig. 1). Our results are crucial to understand the broad distribution of geohazards in the area, and aim to contribute to global efforts to compile landslide information in different geological and oceanic settings. This dataset is openly available through the Open Science Framework data repository24 for the use and benefit of the international marine and geohazard community. Further contributions or analyses based on, and complementing the MAGICLAND database will be welcome.

Fig. 1
figure 1

Map of the study area offshore West and Southwest Iberia shown as a blend of the bathymetry and slope gradient rasters. The red lines trace the limit of morphological scars identified on the EMODnet 2018 DEM. The grey polygons adjacent to the scars depict the landslide area, but only major ones are discernible at the scale presented. Contour lines were calculated from the DEM using a spacing of 100 m.

Methods

This section describes the methodology workflow used for data acquisition and preparation. This was set in three main stages, namely the Digital Elevation Model (DEM) data loading, the manual digitizing of landslide features, and volume calculation procedures. The main steps for each stage are summarised in Fig. 2.

Fig. 2
figure 2

Workflow diagram of the submarine landslide mapping and analysis procedure used to compile the MAGICLAND database.

DEM loading and referencing

The mapping of scars and landslide features was based on the 2018 version of the EMODnet DTM (or DEM) for European seas covering the West and Southwest Iberian margins23 (Fig. 1), respectively referred to as WIM and SWIM. The EMODnet datasets result from the compilation of numerous bathymetric surveys made available by providers of 24 European countries, and include satellite derived bathymetry information derived from Landsat 8 imagery. Despite a general harmonization of the EMODnet data the density (and resolution) of its coverage is variable, a consequence of the data collection procedures and survey resources19. For this work, we used the XYZ data version of the EMODnet 2018 F3 DTM tile23 with a general 1/16 × 1/16 arc minutes grid, which at the latitude of Iberia corresponds to a resolution of approximately 115.6 m × 115.6 m. The XYZ data were loaded in the GIS software to produce DEM bathymetry and slope map rasters. Both rasters were reprojected from the WGS84 projection (EPSG:4326), used as default by EMODnet, to the WGS 84 UTM29N coordinate system (EPSG: 32629). All digitizing operations and morphometric measurements are based on the latter, with this also being the default projection system for the data provided in the repository24.

Landslide morphometric mapping

Mapping of the landslide morphological features observed on the DEMs was made using 2D and 3D visualisation perspectives on GIS software to delimit the scars and limits. Landslide morphometric mapping followed, as possible, established criteria1 for direct measurement features (Table 1 and Fig. 3), complemented by additional morphological and calculated parameters. Each landslides feature was identified with a unique reference ID (Scar_ID) to which all morphometric parameters recorded as point, line or polygon shapefiles were associated. Specifically, point features identify the location of each slide using an XYD reference; line features were used to trace the scar limit, and landslide length and width; and polygon features to delimit the landslide perimeter and the deposit section (when present or identifiable). The initial spatial association of these shapefiles used an automated proximity detection between the different features. The final merged shapefile was examined for consistency and for the correct match between the different elements, i.e., to make sure that all lines and polygons were associated to the same reference point and Scar_ID. This is a crucial quality check step as the automated process is prone to associate erroneous neighbour points in closely-spaced or overlapping features. The inaccurate records on line and polygon shapefiles derived from the automated merge were manually edited, and the corrected shapefiles were re-associated using the Scar_ID parameter. After the manual interpretation of the features on the DEM, automated processes were used to calculate additional parameters to populate the database. The parameter list and description are provided in Tables 1 and 2. In the instances where it was possible to identify the landslide deposit, this was delineated according to the morphological character displayed on the bathymetry and slope DEMs (Fig. 3). A second set of additional parameters was digitized for the deposit length, width, perimeter and area. The equivalent parameters were determined for the landslide evacuation section by subtracting the value of the deposit parameters from the total measurements.

Table 1 List of the morphometric parameters used in the MAGICLAND database.
Fig. 3
figure 3

Schematic diagram of analysed landslide parameters. (a) Profile view along the landslide run-out, including the subdivision between the evacuation and deposit sections. Horizontal length lines at the top represent measurements based on a map perspective, while the dashed red lines represent realistic length measurements projected to the DEM 3D relief. The Height value represents the vertical different between the minimum depth at the scar upslope limit and maximum depth at the termination. (b) Perspective diagram representing the areal morphometric parameters analysed.

Table 2 Metadata summary table for specific data items and parameters.

Volume calculation

For volume calculation, a new DEM raster was calculated to represent the pre-landslide morphology (Fig. 2). To produce this reconstructed surface, a copy of the bathymetry raster was created. Next, the raster was clipped using the landslide limit polygon to remove all morphological data derived from each event. The data gaps were filled using a multilevel b-spline interpolation, further resampled to a 50 m × 50 m grid. The low-frequency raster component derived from this calculation was used as the model for pre-landslide morphology. All other interpolation product rasters were discarded as they do not produce accurate data for this objective.

The landslide evacuation volume calculations used the original bathymetry and the reconstructed morphology DEMs as base and top limiting surfaces, respectively. This operation used the QGis Volume Calculation Tool plugin, which allows the assignment of individualised polygons to delimit areas of operation. This feature greatly optimised the volume calculation task, especially taking into account the sample size. By constraining calculations within each individual landslide limit, the volume calculated for each feature was immediately added to the corresponding Scar_ID in the attribute table. For this work, we kept the values representative of the volume evacuated (cut) by the landslide. The landslide deposit (fill) volume calculated from the two used DEMs was discarded as it cannot be reliably estimated without subsurface data to identify the correct base of the deposit.

Data preparation and visualisation

The final data was compiled in a spreadsheet (MagicLand-Data.xlsx) using the Scar_ID as the data merging attribute. Sequential gaps in the numeric order of this attribute on the data provided are due to the manual removal of faulty entries which either had erroneous parameters or were outside of the target area. These erroneous data would ultimately skew any statistical analysis based on the dataset. Preliminary data for eight representative morphometric parameters are presented in Fig. 4. Logarithmic Y scales were used, and are recommended, for a better visualisation of parameters with very large ranges of values.

Fig. 4
figure 4

Boxplots summarizing the observations from eight selected landslide parameters. Logarithmic scales were used to represent the y-axis of length, width, area and volume for a clearer assessment of the distribution ranges.

Data Records

The MAGICLAND dataset24, available through the Open Science Framework (https://doi.org/10.17605/OSF.IO/S96RW), includes a set of files with the shapefiles with the submarine landslide data (points, lines and polygons), relevant maps in GeoTIFF format, and data records in Microsoft Excel spreadsheets. The shapefiles relative to the landslide location points, scars and areas are included in the respective zip files. The GeoTIFF files correspond to the base bathymetry DEM (MagicLand-BathymetryXYZ.tiff), slope map (MagicLand-slopemap.tiff), reconstructed pre-landslide DEM (BathymReconstruct-Resample50x50_LowPassFilter.tiff) and the cover surface clipped to the landslide area (MagicLand-ReconstCover.tiff). Table 1 describing the morphometric parameters analysed and Table 2 with metadata descriptions are also included as in the dataset (MagicLand-Parameters.pdf and MagicLand-Metadata.pdf). The main data spreadsheet (MagicLand-Data.xlsx) includes the 1552 data entries and parameters. A subset corresponding to the morphometric properties of the deposit and evacuation regions is provided on a separate data file (MagicLand-DepositEvacData.xlsx).

Technical Validation

The dataset presented exhibits sources of uncertainty inherent to distinct steps of the data compilation. These can be attributed to the base dataset used, to the manual interpretation of landslide scars, perimeter, length and width, and to the measurement accuracy.

DEM resolution

The measurements derived from the DEM have an inherent data uncertainty derived from its resolution. Exact details for the spatial area covered in this analysis are not possible to be provided as the EMODnet data derives from the compilation of multiple surveys. This may increase inaccuracies and artifacts, a common issue with bathymetric data25. As the dataset used has a harmonised resolution of 115.6 m × 115.6 m, features smaller than the specified distances were not identifiable. The lower resolution areas of DEMs can also compromise the calculated slope values26 as no detailed morphologies are represented. On our dataset this issue has implications for the mapping and measurement of landslide parameters, particularly towards the western and southwestern limits. Here, the poorer data resolution is perceptible from the smoothed, less detailed contour lines (Fig. 1). Consequently, a lower number of landslides were mapped toward the western limits of the DEM.

Interpreter bias and data limitations

The manual interpretation of the morphometric features consists of digitising lines and polygons close to the perceived morphological limits of the landslide on the DEM. This process is prone to variation between different individuals and can be influenced by factors such as map resolution and visualisation scale. While major parameters such as length or height tend to lead to low variability, others such as width or the delimitation of the evacuation and deposition areas are prone to higher interpreter variability1. This is prone to happen during replication of our work for parameters defined as single-value landslide features, such as width, that effectively change along its length. Nevertheless, the high number of samples likely attenuates the interpreter-derived variability and minimises deviations from the major statistical trends (Fig. 4). The delimitation of the landslide deposit, when identifiable, is likely underestimated as the DEM only expresses seafloor morphologies. Consequently, when the landslide deposit is partially or fully buried, the deposit and full landslide real length may be higher than the values recorded.

Volume calculation

The accurate volume calculation of the 1552 landslides presented the biggest challenge as we tried to use a uniform method that is applicable to all landslides at once. Interpolated top surfaces across the landslide scar area is a method successfully used to reconstruct top pre-landslide morphologies in previous studies9, and this was suitable to use in our database objectives. While on longitudinal sections the reconstructed surface has adequate matches with the landslide limits on the bathymetry DEM, transverse sections clipped to the landslide limits may intersect the sidewall at points below its apex. Thus, absolute volume calculation can be underestimated. The reconstruction may also present limitations for smaller landslides in low slope gradient areas. However, this compromise is required to allow the swift volume computation for all elements identified. Furthermore, it should not significantly affect comparative analysis of relative landslide volume magnitude between distinct examples or locations.

Measurement usage in 3D

The standard procedure to digitise lines and polygons on GIS is based on 2D map projections, with subsequent spatial measurements being primarily derived from planimetric perspectives. This procedure was the base to obtain the landslide total length (Lt), width (Wm) or area (At) parameters. However, the effect of slope gradient on distance and area calculations is relevant and it is crucial to estimate it to obtain, for instance, the realistic submarine landslide runout length. To mitigate this issue, the lines and polygons of landslide features mapped using the planimetric perspective were projected and recalculated over the 3D DEM. Thus, an additional set of length, width and area parameters was obtained for the full landslide and for its deposit and evacuation subsections. The parameters derived from the projection on the 3D surface are identified as **-r (see Table 1 for the full listing) where the r stands for “realistic”, or at least close to it given the inherent data resolution and uncertainties.

Figure 5 illustrates the aforementioned effect of slope gradient on the length measurement of morphological features using the Lt/Lt-r ratio and the average gradient along the landslide runout vector. Very low slope angles will have minimal impact on the length measurements, but for angles of 15 degrees Lt can be 20% shorter than the Lt-r measurement. Towards the steeper slope values of our sample, of circa 30 degrees, this effect can lead to Lt measurements 35% shorter than the Lt-r. The same principle is valid for width, perimeter and area measurements.

Fig. 5
figure 5

Scatter plot representing the impact of slope gradient on the Lt and Lt-r measurement. The Lt/Lt-r ratio indicates how much the planimetric and 3D-projected length measurements differ, with values closer to 100% indicating a minimum or no difference. As shown, the lower the slope gradient, the lower the Lt diverges from the real topographic value Lt-r. The fringes adjacent to the plot axis represent the frequency of registered values.

Usage Notes

All researchers interested in submarine geomorphology, landslides and GIS are free to use the datasets provided at will, with appropriate acknowledgement of the source. The data provided in the repository allows an immediate reproducibility of the results and opens possibilities for further statistical analysis and integration with other databases – being that for individual research items or integration at wider scale. The majority of the work was produced using QGIS v3.14, but all items are importable to any GIS software of choice. Despite the high number of features mapped, there are many more occurrences of landslide and mass-movement features passive of being mapped. Further versions of the MAGICLAND database will make efforts to integrate subsurface information and higher detail metrics when higher resolution bathymetric data is available. Researchers are welcome to contribute to the development of this dataset as deemed fit, either by improving knowledge of the mapped features or adding new ones.