Introduction

With the advent of Open Data1, the Open Science movement2, and the FAIR Principles3 in the scholarly ecosystem, the role and need for storing, managing and sharing data grew significantly in academia. The General Data Protection Regulation (GDPR) has been an essential step in European data management in recent years. At least initially, the GDPR was the main responsible for scientists’ fears of making their work impossible as data scientists. Indeed, one of the reasons for the introduction of the European Open Science Cloud (EOSC) has been to provide a safe European environment for data management compliant with the GDPR, to avoid the risk of European scientists starting to entrust all their data to foreign-owned/registered data servers to bypass European laws4.

In the EOSC, strategic relevance has been given (since the beginning) to address issues with implementing real interoperability among all the infrastructures, services and data researchers share in this ecosystem. Indeed, one of the most cited and used documents produced in projects implementing the EOSC is the EOSC Interoperability Framework, a report of the Interoperability Task Force of the EOSC Executive Board FAIR Working Group5. This report aimed to identify “the general principles that should drive the creation of the EOSC Interoperability Framework (EOSC IF), and organises them into the four layers [...]: technical, semantic, organisational and legal interoperability”5. In this framework, the objects that are referred to as the key components to enable the implementation of semantic interoperability are named semantic artefacts.

A definition for semantic artefact

Previous studies used terms such as Knowledge Organization Systems (KOS)6 or knowledge artefact7 to address semantic artefacts. A KOS has been adopted as a general term to encompass all schemes used to organise information and promote knowledge management, such as classification schemes, gazetteers, lexical databases, taxonomies, thesauri, and ontologies. These KOSs aim to underline the semantic structure of a domain, which needs to be embodied as web services to facilitate resource discovery and retrieval for either humans or machines.

Other recent works defined the expression semantic artefact as a machine-actionable and machine-readable formalisation of a conceptualisation enabling sharing and reuse by humans and machines5,8. Semantic artefacts may have a broad range of formalisation, which include ontologies, terminologies, taxonomies, thesauri, vocabularies, metadata schemas, and standards5,8. Semantic artefact was also strongly advised as an overarching term in the H2020 FAIRsFAIR project’s “FAIR semantics” task. Despite the different forms of a semantic artefact, some works used blanket terms such as “ontologies” or “vocabularies and ontologies”9. Moreover, semantic artefacts are serialised using various digital representation formats, e.g. RDF or OWL using XML, Turtle and JSON-LD8.

In the context of this article, we define a semantic artefact as a machine-actionable formalisation (represented using appropriate formats and serialisations, including RDF and non-RDF standards) of a conceptualisation, enabling sharing and reuse by humans and machines. According to David et al.10, “machine-actionability” is defined as the property belonging to a type for which operations have been specified in a symbolic grammar. This entails that a machine-actionable definition is also machine-interpretable, i.e., can be related to semantic artefacts in a given context and, therefore, has a defined purpose, and machine-readable, i.e., is clearly defined by structural specifications. Semantic artefacts may have a wide range of formalisations, from loose sets of terms, taxonomies, thesauri, (meta)data schemas, term mappings, and schema crosswalks to higher-order logic constructs, vocabularies, and ontologies.

Where semantic artefacts are preserved

Often, these semantic artefacts are stored and shared using specific services called registries, libraries, repositories, catalogues, or terminology/vocabulary servers. Each provides a mixture of functionality — ranging from simple metadata descriptions to advanced content-based services — to facilitate finding, accessing, understanding and re-using of such semantic artefacts and enabling their long-term preservation.

The notion of ontology library was introduced in11, defined as “A library system that offers various functions for managing, adapting and standardizing groups of ontologies”. In addition11, highlighted the importance of making such libraries easily accessible and offering efficient support for re-using existing relevant ontologies and standardizing them based on upper-level ontologies and ontology representation languages.

The terms “collection”, “listing”, or “registry” are also used to describe ontology libraries. All correspond to systems that help reuse or find ontologies by simply listing them (e.g., DAML or DERI listings) or offering structured metadata to describe them (e.g., FAIRSharing, BARTOC, Agrisemantics Map). Yet, those systems do not support additional services beyond the description of the items, e.g., a content analysis of the ontologies or a search index on the ontology content12. A new concept introduced by Hartmann et al.13 to cover these aspects is an ontology repository with advanced features that enable searching, browsing, managing metadata, customizing, and mapping an application to query the contents of the ontologies. D’Aquin and Noy14 and Naskar and Dutta15 provide the latest reviews of ontology repositories.

By the end of the 2000s, the topic was of high interest as illustrated by the 2010 ORES workshop16 or the 2008 Ontology Summit (http://ontolog.cim3.net/wiki/OntologySummit2008). The Open Ontology Repository Initiative17 aimed to create a joint infrastructure of ontology repositories through collaboration. At the time, the initiative utilized the NCBO BioPortal technology18, which was the most advanced open-source technology for ontology management. Still, it was not yet available as a “virtual appliance” as it is today. Later, the initiative considered using the OntoHub19 technology for broader application, but it has since been discontinued.

Other recent works used to refer to a catalogue (that may or may not contain semantic artefacts) with terms such as “repository”5,20 and “registry”5, as well as hypernyms such as “infrastructure” and “service”5,21. Additionally, the FAIRsFAIR project employed the term “semantic registry”, which is defined as a “catalogue that contains metadata about semantic artefacts”8.

While talking about catalogues, two other works provide relevant insights. Ficarra et al.21 present the generic term open science infrastructures and clarify that they are “services, protocols, standards and software that the academic ecosystem needs to perform its functions during the research lifecycle”. Instead, Lin et al.20 introduce the concept of trustworthy digital repositories and provide an operational definition for them “with a clear remit to actively preserve data in response to changes in both technology and stakeholder requirements”. Instead, the only works referring explicitly to semantic artefacts as the items contained in the catalogue are by Corcho et al.5 and, more recently, Jonquet et al.22.

Considering the status we have presented, there is a clear need to adopt an inclusive definition that, in principle, enables us to consider as a catalogue also web pages (e.g. https://w3id.org/mobility) with descriptive metadata of the semantic artefacts included in the catalogue in human-readable form. Thus, in the context of this article, we define a catalogue of semantic artefacts as a dedicated web-based system that fosters the availability, discoverability and long-term preservation and maintenance of semantic artefacts.

Work setting and research question

Two years ago, moved by the principles highlighted in the EOSC IF report, the EOSC Association (https://www.eosc.eu) promoted the creation of thirteen second-generation task forces (EOSC TFs, https://eosc.eu/eosc-task-forces), comprising selected experts from the members of the association, to create guidelines and tools to allow the development and deployment of the EOSC. These TFs have been grouped into four macro topics:

  • Metadata and data quality - focussed on providing guidelines and tools for the discoverability, understanding, and reusability of research objects, and aiming at developing models and benchmarks to assess the data FAIRness and quality;

  • Research careers and curricula - dedicated to identifying the characteristics of the main stakeholders involved in the EOSC, i.e. researchers and data stewards, and possible rewarding systems to promote Open Science practices within the EOSC;

  • Technical challenges - directed at proposing approaches to foster authentication and authorisation procedures, augment the quality and FAIR-compliancy of research software, data and services, and their long-term preservation;

  • Sustaining EOSC - finalised at producing proposals for long-term financial sustainability of the EOSC and the monitoring of the TFs.

The first macro topic resonates with three distinct dimensions (each implemented through a different TF). Two of them have been dedicated to FAIR metrics and Data Quality (https://eosc.eu/advisory-groups/fair-metrics-and-data-quality) and PID Policy and Implementation (https://eosc.eu/advisory-groups/pid-policy-implementation) that have worked, respectively, on proposing metrics for measuring FAIRness and quality of data and on investigating approaches and development of persistent identifiers (PIDs) for emerging resource types to be used in scientific knowledge graphs according to the gaps listed in the EOSC Strategic Research and Innovation Agenda (https://eosc.eu/sria-mar/).

The third TF in the macro topic, which serves as the context of the present article, has been entirely dedicated to Semantic Interoperability (https://www.eosc.eu/advisory-groups/semantic-interoperability) and aims at ensuring “that the precise format and meaning of exchanged data and information is preserved and understood throughout exchanges between parties”5. The charter of this task force23 and the subsequent working activities held in this context have concerned several aspects of semantic interoperability, including the identification of Semantic Interoperability Profiles (SIPs, i.e. the list of resources used to address a specific semantic interoperability case study as defined by a community), mappings and crosswalks (for establishing relationships between elements in different models to achieve interoperability and data exchange), and the management and sharing of the primary tool for enabling semantic interoperability, i.e. semantic artefacts.

The work we describe in this article contributes to the ongoing effort by the EOSC Task Force on Semantic Interoperability to address interoperability challenges towards the vision of building the EOSC, as a Europe-wide shared data infrastructure based on the FAIR ecosystem of data and services. In particular, our work addresses the need to identify dimensions to assess the maturity of catalogues of semantic artefacts. Catalogues are the critical system that enables and maximises the availability and discoverability of the semantic artefacts, thus acting as a keyholder and crucial component for implementing semantic interoperability.

Understanding the maturity of such catalogues is a crucial aspect to consider for envisioning how to enable and improve the long-term preservation of the semantic artefacts that permit semantic interoperability of systems. Indeed, a maturity model for assessing such catalogues would provide recommendations for governance and processes for preserving and maintaining semantic artefacts and help assess/address interoperability challenges. Additionally, improving these catalogues might help to enhance the data research life-cycle and other tools made available by research infrastructures. For example, the research infrastructure may warn the researcher about a new version of a semantic artefact, and help her find relevant mappings and adapt the research to the new version. Finally, the recent workshop organized by the Horizon Europe project FAIR-IMPACT on the governance of semantic artefacts24 has shown the importance of catalogues in the different governance models for semantic artefacts.

This article presents the outcomes of an extensive analysis done by a group within the EOSC Task Force on Semantic Interoperability aiming at answering the following research question:

Which dimensions and features can be used to assess the maturity of catalogues of semantic artefacts?

To answer this research question, we have gathered various definitions concerning catalogues storing and serving semantic artefacts (either at the metadata or data/content level or both). Then, by analysing the current literature on the topic, we have defined a model to measure, compare and evaluate available semantic artefact catalogues. We present this maturity model as composed by several dimensions in which catalogues could be compliant and/or enhanced. These dimensions facilitate the categorization of catalogues and the evaluation of their maturity. In addition, we analysed a collection of 26 semantic artefacts catalogues, aiming, on the one hand, at completing the maturity model by adding additional features (or sub-criteria) for each of the dimensions identified and, on the other hand, at showing how existing catalogues comply with such dimensions and sub-criteria. These features serve as a specification of different levels of compliance of a catalogue against the related dimension and provide a categorical view to assess the maturity of catalogues of semantic artefacts.

The target audience of our work and the maturity model presented here refers to at least the following users:

  • Semantic artefact providers may benefit from the model by understanding the criteria that make their artefacts more discoverable and usable.

  • Users of semantic artefacts can leverage the maturity model to assess and compare different catalogues, gaining insights into the reliability and relevance of the offered artefacts. Both providers and users can rely on the model to select catalogues that align with their specific needs and expectations, e.g. level of accessibility, technical specifications or long-term usability of semantic artefacts.

  • Developers of semantic artefact catalogues can use the maturity model as a roadmap for designing and improving their applications. Understanding the criteria critical to users helps developers prioritise features and improvements that enhance the user experience and overall utility of the catalogue, ensuring they meet the criteria that users find valuable. This also enables developers to identify areas for continuous improvement in their catalogues through regular evaluation and benchmarking against industry standards and user expectations. This ensures they stay competitive and deliver a valuable resource for semantic artefact providers and users. Furthermore, this allows developers to communicate the strengths of their catalogues better to stakeholders, providers, and users, thus fostering trust and engagement.

The rest of the paper is structured as follows. Section ‘Results’ presents our analysis, including the definition of relevant dimensions and the assessment of the selected catalogues. Section ‘Discussion’ discusses the outcomes, outlines lessons learned, and sketches possible future developments. Finally, following the article guidelines of Scientific Data, Section ‘Methods’ describes the selected material and our analysis methods, leading to the results previously described.

Results

The section reports the dimensions and the related features we have identified by assessing 15 selected documents and 26 catalogues of semantic artefacts (Section ‘Dimensions and features’). We also characterise such selected catalogues of semantic artefacts according to the dimensions and their features (Section ‘Catalogues assessment’).

Dimensions and features

We have identified 12 dimensions that can be used to measure the maturity of the catalogues of semantic artefacts. These dimensions, summarised in Table 1 with an indication of the documents that describe or refer to them, are listed in the following subsections. In addition, they are accompanied by several features (or sub-criteria) we have identified by analysing and harmonising the 26 catalogues of semantic artefacts selected.

Table 1 Documents introducing the definitions of catalogues (that either may or may not refer to semantic artefacts explicitly) we have presented in Section ‘Introduction’ and highlighting at least one maturity dimension identified by our analysis (Me: Metadata, Op: Openness, Qu: Quality, Av: Availability, St: Statistics, Pi: PID, Go: Governance, Co: Community, Su: Sustainability, Te: Technology, Tr: Transparency, As: Assessment).

Metadata (Me)

The identification of the minimal set of metadata to describe the catalogue and its semantic artefacts. Huge importance is also given to using metadata standards and schemas (e.g., DCAT or Schema.org), adopting machine-readable formats, the documentation associated, and the licenses used to release the metadata. The six features identified in this dimension are:

  1. a)

    custom vocabulary - custom metadata is used to describe semantic artefacts;

  2. b)

    standard vocabulary - a well-known, widely shared or standard metadata vocabulary is used;

  3. c)

    primary metadata - the original semantic artefact metadata are preserved in the catalogue;

  4. d)

    version metadata - metadata for each distribution/version of the semantic artefact is available;

  5. e)

    human readable - metadata is visible in the user interface in a harmonised manner;

  6. f)

    machine readable - metadata is accessible via API or machine-supported formats.

Openness (Op)

The concept of being open from different perspectives. On the one hand, it concerns technical openness, referring to the metadata handled in the catalogue, the software used to run the catalogue, and the services and protocols used to access the metadata. This aligns with the EOSC perspective towards community-driven Open Science infrastructures, which adheres to the UNESCO recommendation for Open Science2. On the other hand, openness also refers to the social attitude of enabling anyone interested in depositing and helping govern the catalogue. The four features identified in this dimension are:

  1. a)

    fully open source system (OSS) - based on open source software;

  2. b)

    customised open source system (OSS) - the catalogue is based on an open source software, but the customised instance is not available for the public;

  3. c)

    open model - the metadata model/ontology used to document the semantic artefacts is openly available;

  4. d)

    open contribution - external or registered users can add/propose new semantic artefacts for inclusion.

Quality (Qu)

The possibility of having mechanisms to check the quality of the metadata provided and, thus, the catalogue itself. In particular, if processes and workflow are in place for peer reviewing new entities and curating the catalogue. The five features identified in this dimension are:

  1. a)

    curation by owner only - changes (or new submissions) to the semantic artefact can only be conducted by the catalogue owner;

  2. b)

    curation by maintainer - changes (or new submissions) to the semantic artefact can be made by the maintainers/curators of the artefact;

  3. c)

    certified maintainer - the maintainers/curators of the semantic artefacts are certified and assigned by the catalogue owner;

  4. d)

    metadata by editor - metadata is curated by a group of editors;

  5. e)

    metadata by system - metadata is generated/curated by an assessment system.

Availability (Av)

It refers to the availability of the metadata and whether there are methods in place for guaranteeing privacy and access only to certain data due to legal or other contextual issues. The three features identified in this dimension are:

  1. a)

    no restriction - no authentication methods provided; contents are freely available without restrictions;

  2. b)

    multilinguality - items are translated and available in several languages and/or the content of artefact is accessible in multiple languages;

  3. c)

    moderated services - some functionalities for accessing and modifying semantic artefacts are available only to registered users and content creators.

Statistics (St)

The availability of statistics referred to the catalogue (number of semantic artefacts handled, number of users, etc.) in time to measure the usage of the catalogue and its growth. The three features identified in this dimension are:

  1. a)

    catalogue statistics - basic metrics about the metadata catalogue;

  2. b)

    resource statistics - metrics on each semantic artefact;

  3. c)

    social metrics - social metrics for semantic artefacts, e.g., stars, likes, and the number of contributors.

PID (Pi)

The use of persistent identifiers (PIDs) that refer to the metadata of the various semantic artefacts described in the catalogue and the semantic artefacts themselves. The two features identified in this dimension are:

  1. a)

    PID in metadata - PID used to identify the metadata values of the semantic artefact object included in the catalogue, e.g. ORCID for authors and ROR for research organizations;

  2. b)

    resource PID - PID used to identify the semantic artefact itself, e.g., DOI, PURL, w3id.

Governance (Go)

The rules that define the governance of the catalogue and its goals and purpose which should allow community input and responsibility for the integrity of the metadata. The three features identified in this dimension are:

  1. a)

    3rd party - governance is supported by a 3rd party tool or platform, e.g., GitHub;

  2. b)

    description - governance is described as part of the catalogue;

  3. c)

    rules - rules for proposing new semantic artefacts to the catalogue are specified.

Community (Co)

The mechanism that is in place to involve the community in the catalogue, identifying and reaching target users’ expectations and attracting stakeholders from diverse lived experiences and viewpoints. The four features identified in this dimension are:

  1. a)

    read only - no direct involvement is possible; users can only communicate to the catalogue via read-only user interface, API or other mechanism such as email or feedback form;

  2. b)

    read and write - curators and developers can use services to get information from the catalogue and be directly involved through the creation of their records to be added to the catalogue and increase their visibility;

  3. c)

    3rd party - community features delegated to 3rd party tool or platform, e.g. GitHub issues;

  4. d)

    suggestion - a dedicated page for content suggestion (e.g. term change and term proposal) is available.

Sustainability (Su)

The models to sustain services financially and preserve the catalogue in the long run that are in place. The four features identified in this dimension are:

  1. a)

    organization - the catalogue is a service provided by an organization (university, institute or one of its research units);

  2. b)

    community - a community with members from various organizations or infrastructure maintains the catalogue;

  3. c)

    management board - a multidisciplinary community-driven service strongly sustained by an operational team;

  4. d)

    (research) project - sustained by funds coming from one or more time-limited projects.

Technology (Te)

The tools that the catalogue should provide to enable users to have a better experience in exploring the data, such as REST APIs, Web search interfaces, SPARQL endpoints, etc. The four features identified in this dimension are:

  1. a)

    REST API - a service to access semantic artefact information and/or metadata via a REST web service application programming interface;

  2. b)

    web search GUI - a service to access semantic artefact information and/or metadata via a web search or a graphical user interface;

  3. c)

    SPARQL endpoint - a service to access semantic artefact information and/or metadata via a SPARQL endpoint;

  4. d)

    alignment - a service that might be used within a catalogue to align (mapping) semantic artefacts and/or some of their parts (e.g. concepts, attributes, properties) one another or to deal with alignments/mappings without generating them.

The identified features represent the aspects that can be directly assessed. Several other technical aspects on the backend solutions are generally not disclosed by the catalogue owners.

Transparency (Tr)

The processes behind the governance of the catalogue, from the elections of new members of the various governing boards, curators, etc., to the clarity in exposing fees for the services offered by the catalogue and its revenue model. The three features identified in this dimension are:

  1. a)

    documented curation - data flow of curation is documented;

  2. b)

    automatic curation - curation process happened automatically based on a documented process flow;

  3. c)

    resource versioning - records on previous versions of items are available.

Assessment (As)

The presence of some practice in place for assessing the catalogue against all these dimensions or additional ones, e.g. by adopting self-assessment exercises and/or by asking third parties to run an independent assessment of the catalogue. The two features identified in this dimension are:

  1. a)

    shared metrics - assessment in terms of FAIRness is provided;

  2. b)

    custom metrics - assessment against catalogue’s own assessment metrics.

Catalogues assessment

The selected 26 catalogues of semantic artefacts are reported as follows:

Hereby, we provide the result of our assessment of the catalogues against the dimensions and related features described in the previous section in Table 2. In addition to such an assessment, we also tracked the catalogue type. Specifically, we look into catalogues containing data and metadata, which we classified as a repository according to the classification from Jonquet12. These catalogues are marked with an asterisk (*) in the table header alongside the catalogue names.

Table 2 An overview of the dimensions and related features identified, accompanied by an assessment of 26 selected catalogues of semantic artefacts against such dimensions and features.

In addition, it is worth mentioning that the catalogues listed in Table 2 are not the same type of system. Indeed, some are ontology lookup services/repositories (e.g. AgroPortal and the EBI Ontology Lookup Service), while others are wider registries of standards/databases/policies (e.g., FAIRsharing) that include, but are not limited to, ontologies or other semantic artefacts. As such, their original intended scope and functionalities are inevitably different. However, according to the definition provided in this work, they all fall within the concept of catalogues of semantic artefacts.

The raw data of Table 2 are available online37.

Discussion

This section contains an analysis of the maturity dimensions based on the assessment results outlined in Table 2. Maturity dimensions are examined in the order they appear in the table to summarize the current status of implementation regarding each category.

Metadata (Me)

Adopting standard vocabulary for metadata is an essential aspect of joining, comparing, and curating semantic artefacts. In addition, it fosters the interoperability of these items. Yet, only 19% (5) use standard vocabularies. Further analysis is needed to confirm that all the other catalogues do not use standard vocabularies. Indeed, these catalogues may use custom vocabularies made by extending standard vocabularies.

Openness (Op) and Quality (Qu)

All the catalogues maintained by a community with members from different organizations/infrastructures (5 out of 26) are based on open-source tools and provide no authentication methods/restrictions to their contents. Furthermore, catalogues that enable external/registered users to add/propose new semantic artefacts to include in the catalogue (65%) delegate the quality control of these changes to the maintainers/curators of the artefacts. Indeed, generally, more than half of the analysed catalogues (65%) permit the curation of their data by either the owners of the catalogue or the maintainers of the semantic artefacts. No catalogue provides both strategies. This aspect is a positive sign, which shows that most of the catalogues are concerned with guaranteeing a good quality of their data.

Availability (Av) and PID (Pi)

All catalogues, with only one exception, provide access with no restriction. However, 8 (i.e., around 31%) include functionalities, mostly APIs, accessible only to registered users. Only a tiny percentage of catalogues (i.e., 11.5%) use PIDs for identifying resources. At the same time, a more significant number of catalogues (i.e., 30.8%) provide a PID for the metadata of a given record. They are typically used to represent the creator, the contributor or for more specific aspects, e.g., the URI of imported ontologies. In only one case, the catalogue provides PIDs for both the identification of resources and their metadata; most of the time, the catalogues do not assume the role of assigning and managing PIDs to their resources.

Statistics (St)

Half of the catalogues provide resource statistics, typically including the number of classes, properties and axioms included in their semantic artefacts. Most of these last (corresponding to around 35% of the total) also provide general statistics that aggregate information across all resources. In a few cases, social statistics are enabled by the catalogue or included from another source, e.g., in the form of metrics taken from GitHub metadata where the original resources are stored (e.g., number of received stars and contributors).

Governance (Go) and Community (Co)

Governance processes and/or structure are described by over half of the catalogues, 73% of which explicitly provide rules for contributors willing to propose new resources. In 7 cases, external 3rd-party solutions, particularly GitHub, are used as resource management tools. Some of the catalogues are more open to the community’s contribution than others. In particular, while almost one-third only provide the capabilities to communicate with the catalogue through read-only APIs, only 8 out of 26 also provide the possibility for resource creation.

Sustainability (Su)

In 2 cases, the sustainability could not be assessed. In the remaining catalogues, the sustainability seems sufficiently stable since they are maintained by at least one organization, community, or management board. Furthermore, six of them are also supported by research projects.

Technology (Te)

Most catalogues (92%) provide at least a web search GUI. The non-SPARQL catalogues represent 61% (16) of the total; around 88% of these use a web search GUI, and 56% combine it with REST APIs. Therefore, only 10 out of 26 provide a SPARQL endpoint. 70% of these provide machine-readable metadata. Overall, the preferred technology used by all the catalogues integrating machine-readable metadata is either REST APIs, web search GUI, or both. Many catalogues incorporate all three technologies (30%), i.e. REST API, web search GUI, and SPARQL endpoint, especially the catalogues based on the OntoPortal software22. These catalogues provide data with no authentications/restrictions, and contents are freely available without restrictions. In addition, for most of them, external/registered users can add/propose new semantic artefacts to be included.

Transparency (Tr)

Only 10 catalogues (i.e., around 38%) explicitly document a workflow for data curation, which in two cases is mainly automated; 9 of these are open for modifications to be made to the semantic artefacts by external/registered users. In 11 cases, the catalogue also provides previous resource versions, enabling a versioning system useful for backward compatibility and documentation.

Assessment (As)

Most catalogues do not provide information on (self-)assessment against quality criteria. Among the few exceptions, AgroPortal9 includes an assessment in terms of FAIR score, an evaluation of the satisfaction of each aspect of the FAIR principles. This FAIRness assessment methodology38 has been recently transposed to 6 other OntoPortal-based catalogues, including solely EcoPortal, in our study. FAIR score includes many questions specific to ontologies and semantic resources, with the ability to compute the score for each resource and the whole catalogue. On the other hand, Archivo28 proposes a rating based on a set of automatically assessed criteria (whether the ontology is retrievable and parsed correctly, is provided with a clear and proper license statement, and is logically consistent).

Final remarks

The analysis showed the current state of the identified catalogues, and it can also serve as a guidance for shaping future developments. The maturity model created was a first attempt toward understanding catalogues of semantic artefacts. In addition, the catalogue assessment we performed was a preliminary (and we believe a successful) attempt to perceive the effectiveness of the maturity model.

The analysis will be complemented in the future with the ongoing work on minimum metadata sets and interoperability indicators currently carried out within the EOSC Task Force on Semantic Interoperability. By combining these efforts, the aim is to provide recommendations for governance and processes for preserving and maintaining semantic artefacts. This will involve identifying gaps and improvement areas in semantic interoperability approaches and developing strategies to address these challenges.

In the future, we aim to interlink aspects of the dimensions identified within the maturity model presented in this paper with recommendations of other EOSC Task Forces. For instance, the ESOC Task Force on FAIR Metrics and Data Quality has produced some guidelines39,40,41 that may be used in the context of the Quality maturity dimension for providing even more in-depth specifications for measuring the FAIRness of semantic artefacts catalogues. Similarly, the work under development in the EOSC Task Force on PID Policy and Implementation (https://www.eosc.eu/advisory-groups/pid-policy-implementation) may provide additional insights related to the PID maturity dimension highlighted in this article.

Another aspect that would deserve further study concerns the relation of our dimensions with those described in other maturity models available in the literature, such as some of those we used for defining the dimensions. Indeed, it would be essential to make maturity models semantically interoperable since it may also facilitate further analysis of the systems they measure. We will leave this investigation for future developments.

The maturity dimensions, the related features, and the list of assessed catalogues are open for extensions and refinements in the future, especially in light of the feedback from the community. For instance, a few people suggested, via their reviews (https://prereview.org/preprints/doi-10.48550-arxiv.2305.06746), the addition of further perspectives to include in the maturity model, such as metrics measuring the facility to find specific contextual information in a catalogue, the size of the community using it, its Technological Readiness Level (TRL), its time of existence, and the specification of the types of semantic artefacts handled by a catalogue with appropriate dimensional features associated to each type. Others have already suggested the inclusion of additional catalogues in the assessment. While these are all valid suggestions, further technical analyses are necessary for a future refinement/extension of the maturity model. While currently, within the Task Force, there are no particular plans for adding more technical/operational features, being the data and resources produced in this work available on Zenodo37,42 will allow and enable the community to create further versions of them if needed, even outside the framework of the EOSC Task Forces.

We believe that having catalogues of semantic artefacts compliant with the dimensions and features we identified in the maturity model is a crucial step towards the broad adoption and reuse in research, by all the involved users (researchers, data stewards, etc.), of the semantic artefacts they provide. Indeed, without sound and well-governed catalogues, we do not have persistence, availability and maintenance of semantic artefacts in the long term; and, without guaranteeing these activities, it is challenging to build users’ trust in such artefacts, which is crucial to enable and push their broad adoption and reuse in the EOSC and, more generally, in research. However, we recognise a perceived need by the community to extend, in the future, the effort of the EOSC Task Force on Semantic Interoperability to focus on: (a) understanding how semantic artefacts should be used productively in and for research; (b) how data stewards can efficiently and effectively use these artefacts in metadata specification and metadata curation workflows; and (c) how to bring data producers, publishers, researchers, to adopt them.

Methods

This section presents the methodology followed to identify the dimensions and related features of the maturity model and analyse the catalogues of semantic artefacts. The process was divided into five steps, described in the following subsections.

Analysis of existing literature

We involved all the members of the EOSC Task Force on Semantic Interoperability, which included several experts on the topic with very heterogeneous backgrounds, to provide us with relevant material related to at least one of the two aspects of interest for our study, i.e., (1) definitions of catalogues of semantic artefacts and (2) dimensions that can be used to measure the maturity of such catalogues.

After gathering all such relevant documents, we asked the members of the Task Force to read them and highlight any passage in the text referring either to definitions related to catalogues of semantic artefacts or maturity measures. We also asked that the reading and analysis of each document must be performed, when possible, by a member who did not propose it to avoid possible biases.

We found that 15 of the gathered documents contained relevant texts, as shown in Table 1. The raw data of this overview are available online37. The authors analysed all the quotations highlighted to gather maturity dimensions. In particular, the names of the dimensions were compiled from the analysed literature, thus reusing a terminology already shared with the scientific community and heterogeneously distributed in different documents. The 12 dimensions extracted, which we used as a starting point for identifying additional features characterising them, are summarised in Table 1.

Collection of catalogues

A preliminary search was conducted to collect potential catalogues of semantic artefacts. Potential catalogues of semantic artefacts have been identified by direct knowledge of the co-authors and the other members of the Task Force.

The authors then screened the resulting list of potential catalogues to remove duplicates and those irrelevant to the study. In particular, we decided to keep only those potential catalogues that refer mainly to semantic artefacts in the analysis. This exclusion criterion has been made to filter out (i) generic repositories that may also contain semantic artefacts, even if it is not the primary resource types they refer to (e.g., Zenodo), and (ii) generic repositories (e.g., Google or other general-purpose search engines). The resulting set included 26 selected catalogues. With this list, our goal was not to be exhaustive but to cover multiple application domains well and get a good representation of the underlying technology used to build the catalogues (e.g., OntoPortal, OLS, SKOSMOS, etc.).

Setup of catalogues assessment

The identified catalogues were evaluated based on their relevance to semantic artefacts. A spreadsheet was created with the selected catalogues listed on rows and the 12 maturity dimensions on columns. Additional columns were dedicated to the authors’ names (acting as reviewers) and comments. Each reviewer was assigned some catalogues to review.

Twelve separate tabs in the spreadsheet described the possible features characterising each maturity dimension. These tabs contained the name of the dimension and a set of particular features, each with a number, a short name, a description, the name of the reviewer that proposed it, and whether the group had validated it. In addition, there was a column for possible comments.

The main table in the first tab was extended with two additional columns. The first column allowed us to specify whether the catalogue under analysis stores a copy of the semantic artefacts it describes. The second column referred to the software or tool used to implement the catalogue to distinguish if the software adopted was generic (i.e., can be used to deploy multiple catalogues) and open-source.

Analysis

Each reviewer evaluated a first small set of 2-3 catalogues against the 12 dimensions identified before, which we considered fixed. For each catalogue and each dimension, the reviewer had to select which features of the dimensions applied to the catalogue. If, for a given dimension, a particular feature was not already present, the reviewer could add such a new feature, adding a name and a description for it and making it available for the following reviewers. Several features for each dimension (from 3 to 7), have been added during this step, for a total of 63 potential features.

After the first analysis, early results and issues were discussed by all reviewers. The analysis was then extended by assessing other potential catalogues, i.e., an additional five for each reviewer, to have a clear view of other aspects that did not arise from the former analysis. In addition, we decided to invite other experts (external to the Task Force) to such a study afterwards, ideally after we have assessed all the potential catalogues. Our perceived risk here was to be biased by a particular point of view while we were still in a phase of preliminary analysis.

Harmonization and summarization

After completing the catalogue descriptions, a final review and summarization were conducted. Each reviewer was tasked with analyzing 2 to 4 dimensions to review the corresponding features provided by other reviewers and suggested potential edits such as merging similar features, removing irrelevant ones, or splitting them into separate aspects. As a result, 20 out of 63 identified features have been removed from the analysis or merged with others, keeping 43 final features. The main table was updated accordingly to reflect any changes made by the reviewer.

A final meeting was held to review and harmonize the catalogue dimensions and features to ensure a consistent set of features that could be used to compare and analyze the different catalogues. During this meeting, any remaining inconsistencies or ambiguities in the features were discussed, and final decisions were made on harmonising them across all catalogues.