Submitted:
26 July 2024
Posted:
29 July 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Principles and Recommendations for Data Interoperability and Management
1.1.1. Findable Accessible Interoperable and Reusable (FAIR) Principles
1.1.2. The European Interoperability Framework
1.1.3. GEO Data Sharing and Data Management Principles
- data, metadata and products will be shared as Open Data by default, by making them available as part of the GEOSS Data Collection of Open Resources for Everyone (Data-CORE) without charge or restrictions on reuse, subject to the conditions of registration and attribution when the data are reused;
- where international instruments, national policies or legislation preclude the sharing of data as Open Data, data should be made available with minimal restrictions on use and at no more than the cost of reproduction and distribution;
- all shared data, products and metadata will be made available with minimum time delay.
1.2. Established Interoperability and Standardisation Organisations and Initiatives
1.3. Software Interoperability Standards
- Enterprise - Business requirements of the system (purpose, scope policies);
- Information - Information managed by the system and the structure and content type of the supporting data (semantics and information processing);
- Computational - Functionality provided by the system and its functional decomposition (objects which interact at interfaces);
- Engineering - distribution of processing performed by the system to manage the information and provide the functionality (mechanisms and functions required to support distributed interactions between objects in the system);
- Technology - technologies chosen to provide the processing, functionality and presentation of information.
1.4. Data Spaces Concept
- New services relying on enhanced transparency and data sovereignty;
- Level playing field for data sharing and exchange;
- A new user behaviour and digital culture, including a higher awareness of digital related ethics. Users will likely become more mindful about treating their data as an asset;
- Availability of large pools of data;
- Infrastructure to use and exchange data;
- Appropriate governance mechanisms.
1.4.1. Data Spaces beyond Technical Features
1.4.2. Main Projects and Initiatives for Data Spaces
- BDVA: knowledge and general understanding for data usage
- FIWARE Foundation: components for digital twin data exchange, decentralized Identity and Access Management relying on existing trust frameworks, and data services publication/trading
- Gaia-X: Global cross-dataspace governance based on European values
- IDSA: Data Space Connectors, Usage Contract Negotiation, and general creation of a data space
- Low Entry Barrier: for both data providers and data consumers.
- System of Systems: the GDDS is designed as a System of Systems (SoS) to interconnect many independent, autonomous systems, frequently of large dimensions, to satisfy a global goal (i.e., the GDDS Digital Ecosystem service) while keeping them autonomous.
- Standardization and Mediation: the GDDS will rely on interoperability standards, developed at community level, complementing them with mediation/brokering to enable cross-domain interoperability.
- Data as entry point: the GDDS focuses on the sharing and use of data, independently of how the data is generated (e.g., off-line, on-the-fly, etc.).
- Loose-coupling: the GDDS Digital Ecosystem is enabled by a set of APIs which can be used by data consumers to leverage (and enrich) the GDDS resources.
- Interoperability/Security Orthogonality: the GDDS security architecture is orthogonal to the GDDS interoperability architecture, so that the two issues can be tackled independently.
- Enterprise (purpose, scope and policies governing the activities of the specified system within the organization of which it is a part);
- Information (concerned with the kinds of information handled by the system and constraints on the use and interpretation of that information);
- Computational (functional decomposition of the system into a set of objects that interact at interfaces - enabling system distribution);
- Engineering (infrastructure required to support system distribution)
- Technology (technology to support system distribution).
- authentication (process of verifying a claim that a system entity or system resource has a certain attribute value);
- access control (protection of system resources against unauthorized access);
- confidentiality (data is not disclosed to system entities unless they have been authorized to know the data);
- integrity (data has not been changed, destroyed, or lost in an unauthorized or accidental manner);
- non-repudiation (protection against false denial of involvement in an association).
- Anchored to specific use cases: Simpl will ensure that data sets and their infrastructure can be seamlessly interconnected and made interoperable.
- Smart and modular: Simpl will allow the replacement or addition of components without affecting the rest of the system.
- Open source: Simpl will be publicly accessible, and allow anyone to see, modify and distribute it.
- Green, scalable and agile: Simpl will allow the monitoring of its environmental performance, and the addition of new users without affecting performance.
- Secure and interoperable: Simpl will ensure trust, confidence and compliance with regulations are built into the system. This implies an effortless sharing of resources between participants, regardless of their data processing environment. It creates an abstraction layer that enables data to flow across multiple providers and Member States.
- MIM1 – Context – Context Information Management;
- MIM2 – Data Modules – Shared Data Models;
- MIM3 – Contracts – Ecosystem Transaction Management;
- MIM4 – Trust – Personal Data Management;
- MIM5 – Transparency – Fair Artificial Intelligence;
- MIM7 – Places – Geospatial Information Management;
- MIM6 – Security – Security Management;
- MIM8 – Indicators – Ecosystem Indicator Management;
- MIM9 – Analytics – Data Analytics Management;
- MIM10 – Resources – Resource Impact Assessment.
2. Methodology
2.1. Finding a Consensus over the Revised Data Spaces Building Blocks
3. Results
3.1. A Revised Data Spaces Building Block Stack
3.1.1. Review and Comparison of Data Spaces-Related Initiatives and Blueprints
3.1.2. Identification of a Initial Baseline Suite of Building Blocks
3.1.3. Improved Mapping to Available Solutions through Increased Building Block Granularity

3.2. Building Blocks Integration Aligning to Agreed International Principles
-
Data Model used, which should be based on standards, typically specifying the used profile (i.e., the subset of a more comprehensive data model, if applicable) and possible extensions. using standardised documentation of them and machine-readable encodings to possibly support data validation as well. Data models and profiles should specify:
- -
- the semantic and structural description of the data,
- -
- the use of geometry and any related aspect, such as the kind of representation used (as solids, as surfaces, polygons, lines, raster), kind of geometry stored, any topology representation required, level of detail or resolution, accuracy, and so on.
- Data Exchange – Encodings, related to the syntax.
- Data description (metadata) (moved from the ‘Data value creation’ category after splitting from the related service).
- Provenance and Traceability extends the attribute ‘provenance’ or ‘lineage’ of typical "discovery" metadata allowing the visibility of underlying data supply chains critical to a suitable understanding of, and subsequent reusability of the data.
- Data Licences (moved from the ‘Data Sovereignty and Trust category’ after splitting from the related control service), indicating the conditions for use of the data and reusability of derived outputs.

3.2.1. Building Blocks Refinement with Input from Experts and Green Deal Data Spaces Projects

3.2.2. Rediscussed Building Blocks Validation

3.3. Available Standards for Data Spaces Building Blocks
3.4. Measuring Standards Quality and Uptake
| Relevance and Scope |
Fit for Purpose: Ensure that the standards align with the specific needs and objectives of the domain. Adoption within a domain is an obvious indicator of quality, but should be qualified by any activities to improve such standards based on experience of usage. Scope Coverage: Evaluate whether the standards cover all necessary areas without significant gaps. Overlapping standards should be assessed to see if they offer complementary benefits or if they introduce redundancy. Alignment: have alignments between related standards in use or proposed been published and available for re-use (noting that transformations of data are a significant overhead in most re-use scenarios, availability of tested transformation mechanisms make it easier to combine different standards in practice). |
| Flexibility and Extensibility |
Modularity: Building Block composition mechanisms must be explicit and supported by tooling to realise the principle of reuse. Such mechanisms must include explicit traceability of interoperability design, through transparent and standardised description of building block dependencies. Furthermore, such building blocks need to be composable into larger building blocks that use the same composition mechanisms, to allow the rich metadata required to evaluate and reuse resources to be assembled and understood. This has been a critical weakness of formal standardisation processes - leading to many alternative ad-hoc approaches to standardisation in application profiles, Adaptability: Standards should be flexible enough to accommodate future changes and extensions. Avoid overly rigid standards that may hinder innovation or adaptation to new requirements Special vs. General Solutions: Prefer standards that provide flexible, general solutions over those offering special case solutions, unless the special case is critical and cannot be addressed adequately by the general standard. |
| Interoperability and Integration |
Compatibility: Standards should work well together and integrate smoothly, minimizing the need for custom adapters or significant modifications. Interdependencies: Assess the interdependencies between standards. Strongly interdependent standards should be adopted together only if they provide a cohesive, integrated solution. Transparency: Machine readable declarations of compatibility and interdependencies allows for cost-effective and scalable testing and reuse of integrated suites of standards |
| Simplicity and Clarity |
Simplicity: Choose standards that support simplification through encapsulation – the ability to test and compose arbitrarily complex complete solutions from simple components Ease of Understanding: Standards should be clear, well-documented, and easy to understand. Complex standards can lead to misinterpretation and implementation errors. Examples: Standards should be supported by clear examples to allow practitioners to easily understand scope of standards. Examples should be tested to conform to standards, as discrepancies are common and cause significant confusion. |
| Support and Ecosystem |
Tool Support: Consider the availability of software tools and libraries that support the standards. Robust tool support can significantly ease implementation and maintenance. Community and Vendor Support: Evaluate the level of community and vendor support. Widely adopted standards with active communities and strong vendor backing are often more reliable and future-proof. |
| Maturity and Stability |
Proven Track Record: Mature standards that have been widely adopted and tested in various contexts are usually more reliable. Stability: Prefer standards that are stable and have a clear roadmap for updates and maintenance. Frequent changes can disrupt development and integration processes. |
| Compliance and Security |
Regulatory Compliance: Ensure that the standards comply with relevant regulations and industry best practices. Security: Assess the security implications of the standards. They should support the implementation of secure systems and not introduce vulnerabilities. |
| Cost and Resource Considerations |
Implementation Cost: Consider the cost of implementing and maintaining the standards, including licensing fees, if any. Resource Availability: Ensure that the necessary skills and resources are available for adopting and maintaining the standards. |
- Requirements Analysis: Define the specific needs and objectives the standards must meet.
- Standards Identification: Identify potential standards and gather detailed information about each.
- Evaluation and Comparison: Apply the principles in Table X to evaluate and compare the standards. A matrix will be proposed in future work to assess each standard against these criteria.
- Integration Assessment: Examine how the standards will work together, considering interdependencies and potential conflicts.
- Pilot Implementation: Conduct a pilot implementation to test the chosen standards in a real-world scenario.
- Decision and Adoption: Based on the evaluation and pilot results, make an informed decision and proceed with the adoption of the selected standards.
3.5. Discussion
3.6. Conclusion
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Mapping of Available Standards to Data Spaces Building Blocks
Appendix A.1. Technical Building Blocks – Data Specification Enabling FAIRness
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | SSN-SOSA |
| OGC | CityGML/CityJSON, LandInfra, IndoorGML, Indoor Mapping Data Format (IMDF), MUDDI, PipelineML, WaterML, Augmented Reality ML (ARML), SensorThings API data model, SWE common Data Model, SensorML, Semantic sensor Network (SSN), STAplus, Time Ontology in OWL, TimeseriesML, WaterML, GeoPose, Geoscience Markup Language (GeoSciML), Zarr (https://portal.ogc.org/files/100727), GroundwaterML, network Common Data Form (netCDF) standards suite, Observations, Measurements and Samples |
| GEO et al. | Essential Variables (https://www.earthdata.nasa.gov/learn/backgrounders/essential-variables#:~:text=Essential%20variables%20(EV)%20are%20variables,facet%20of%20the%20Earth%20system). Topics/domains: Climate; Ocean; Biodiversity; Geodiversity; Agriculture |
| INSPIRE | INSPIRE Themes and UML model (https://inspire.ec.europa.eu/Themes/Data-Specifications/2892) |
| OASC | MIM2 – Data Models - Smart data Models; NGSI-LD compliant data models for aspects of the smart city have been defined by organisations and projects, including OASC, FIWARE, GSMA and the SynchroniCity project and there is an ongoing joint activity of TM Forum and FIWARE to specify more. Existing data models and ontologies, e.g. the SAREF (Smart Applications REFerence ontology) standard by ETSI/oneM2M, can be mapped for use with NGSI-LD by identifying what are entities, properties and relationships, which can be managed and requested by the NGSI-LD API. oneM2M base ontology (that is compatible with SAREF). Additionally, oneM2M provides the means to instantiate ontologies as a means to provide semantic descriptions of the data exchanged (through the use of metadata). The extension SAREF4Cities provides an ontology focused on smart cities. Core vocabularies of ISA like Core Public Service Vocabulary Application Profile used as the basis for the Single Digital Gateway Regulation that touches local governments, Core Person, Core Organization etc. DTDL is the Digital Twin Definition Language developed by Microsoft. This language is based on top of JSON-LD and the existing FIWARE data models are converted in this format. MIM7 - Places |
| DSBA | SmartDataModels |
| IDSA | IDS RA – Functional Layer – Ecosystem of Data – Vocabularies; Information Layer - Hexagon of concerns |
| FAIR principles | EIF |
|---|---|
|
R1.3: |
Recommendation 4 (Openness): Give preference to open specifications, taking due account of the coverage of functional needs, maturity and market support and innovation. |
| Ref | Specification / implementation(s) recommended |
|---|---|
| ISO | SQL, JSON |
| W3C | RDF (RDF/XML, Turtle, JSON-LD), SPARQL, OWL |
| OGC | 3D Tiles, Cloud Optimised GeoTIFF, CoverageJSON, GML in JPEG2000, GeoPackage, GeoSPARQL, GML, GeoTiff, I3S, Hierarchical Data Format Version 5 (HDF5), KML, LAS, Moving Features, SWE Service Model Implementation Standard, Sensor Observation Service SOS, WKT CRS, Simple Features, OpenGeoSMS, GeoXACML |
| FAIR principles | DMP | EIF |
|---|---|---|
|
I1: |
DMP-3 (Usability). Data will be structured using encodings that are widely accepted in the target user community and aligned with organizational needs and observing methods, with preference given to non-proprietary international standards. | Recommendation 9 (Technological neutrality and data portability): Ensure data portability, namely that data is easily transferable between systems and applications supporting the implementation and evolution of European public services without unjustified restrictions, if legally possible. |
| Ref | Specification / implementation(s) recommended |
|---|---|
| ISO | ISO 19115, ISO 15836 Dublin core |
| W3C | DCAT |
| OGC | GeoDCAT – 3dP, EO Dataset Metadata GeoJSON(-LD) Encoding Standard (EO-GeoJSON) |
| INSPIRE | INSPIRE metadata (based on ISO19115, ISO19119 and ISO 15836 (Dublin Core) |
| OASC | MIM1 - Context, MIM7 - Places |
| IDSA | IDS Reference Architecture – Functional Layer – Ecosystem of Data – Data source description; Information Layer |
| Gaia-X | Federated catalogue – Self description |
| SIMPL | Self-description (ID SECAV-FUNC-002-FUNC-001) (https://futurium.ec.europa.eu/en/simpl/l2-detailed-requirement/attributes-self-description-dataset) |
| FAIR principles | DMP |
|---|---|
|
F2: Data are described with rich metadata R1: (Meta)data are richly described with a plurality of accurate and relevant attributes R1.3: (Meta)data meet domain-relevant community standards I1: (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation I2: (Meta)data use vocabularies that follow the FAIR principles I3: (Meta)data include qualified references to other (meta)data |
DMP-4 (Usability). Data will be comprehensively documented, including all elements necessary to access, use, understand, and process, preferably via formal structured metadata based on international or community-approved standards. To the extent possible, data will also be described in peer-reviewed publications referenced in the metadata record. |
| Q - quality: Data should be of sufficient quality for the user’s task (extension to the FAIR principles [7]). | DMP-6 (Usability). Data will be quality-controlled and the results of quality control shall be indicated in metadata; data made available in advance of quality control will be flagged in metadata as unchecked. |
|
F1: (Meta) data are assigned globally unique and persistent identifiers. F3: Metadata clearly and explicitly include the identifier of the data they describe |
DMP-10 (Curation). Data will be assigned appropriate persistent, resolvable identifiers to enable documents to cite the data on which they are based and to enable data providers to receive acknowledgement of use of their data. |
| Ref | Specification / implementation(s) recommended |
|---|---|
| ISO | ISO19131 on Data Product Specification, ISO/IEC25012 on Data Quality Model (https://iso25000.com/index.php/en/iso-25000-standards/iso-25012) |
| OGC | Schema used by the Data Exchange Toolkit [21]. It addresses a complementary part of ISO19131, to support data requirements definition and semantic validation. Other experiments are trying to address the issue starting from profiling the PROV vocabulary, originally intended to represent provenance information (https://github.com/ogcincubator/prov-cwl/tree/master) |
| Committee on Earth Observation Satellites (CEOS) (https://ceos.org/ard/) | Analysis Ready Data Framework, currently planned to be extended for being applied to geospatial data within OGC (https://www.ogc.org/press-release/ogc-forms-new-analysis-ready-data-standards-working-group/) |
| SIMPL | Quality dimension and quality rules (ID SEGOA-FUNC-012-FUNC-001) (https://futurium.ec.europa.eu/en/simpl/l2-detailed-requirement/quality-dimension-and-quality-rules-0) |
| Ref | Specification / implementation(s) recommended |
|---|---|
| ISO | ISO19115 geospatial lineage model |
| W3C | PROV-O (https://www.w3.org/TR/prov-o/) |
| OGC | OGC Provenance chains (https://ogcincubator.github.io/bblock-prov-schema/build/generateddocs/slate-build/ogc-utils/prov/index.html?json#examples), W3C PROV-O extension |
| OASC | MIM5 - Transparency |
| IDSA | Part of the information layer – hexagon of concerns |
| Others (reported by DS4SSCC) | ETSI-CIM; DACT-AP – property ‘provenance’ |
| FAIR principles | DMP |
|---|---|
| R1.2: (Meta)data are associated with detailed provenance | DMP-5 (Usability). Data will include provenance metadata indicating the origin and processing history of raw observations and derived products, to ensure full traceability of the product chain. |
Appendix A.2. Technical Building Blocks – Data Sovereignty and Trust
| Ref | Specification / implementation(s) recommended | |
|---|---|---|
| W3C | W3C Open Digital Rights Language (ODRL) (https://www.w3.org/TR/odrl-model/), W3C Verifiable Credentials. | |
| OGC | RAINBOW for licences; GeoXACML | |
| OASC | MIM3 - Contracts | |
| IDSA | RA – Functional layer – Security | Data Sovereignty – Usage Policies |
| Gaia-X | Identity and Trust – Federated Access; Sovereign Data Exchange – Policies and |
|
| Others (reported by DS4SSCC) | Standards: OASIS XACML (https://docs.oasis-open.org/xacml/3.0/xacml-3.0-core-spec-os-en.html) Policy Definition Language; Industry Body Specifications: Rego, Open Policy Agent, JSON-LD; Implementations: i4Trust, Prometheus-X | |
| Others | Creative Commons |
| FAIR principles | DMP |
|---|---|
| R1.1: (Meta)data are released with a clear and accessible data usage licence. | DMP-1b (Discoverability). [...] and data access and use conditions, including licences, will be clearly indicated. |
| Ref | Specification / implementation(s) recommended | |
|---|---|---|
| W3C | W3C Web Access Control (WAC) (https://www.w3.org/wiki/WebAccessControl) | |
| OGC | OGC APIs rely on the access control from the underlying OpenApi mechanisms (https://docs.ogc.org/is/19-072/19-072.html#rc_oas30-security), included in service model (STA+) | |
| OASC | MIM3 - Contracts | |
| IDSA | RA – Functional layer – Security | Data Sovereignty – |
| Gaia-X | Identity and Trust – Federated Access; Sovereign Data Exchange – |
|
| Others (reported by DS4SSCC) | Industry Body Specifications: Rego, Open Policy Agent, JSON-LD Implementations: i4Trust, Prometheus-X | |
| SIMPL | Federated Authentication (ID SEGOA-FUNC-001) (https://futurium.ec.europa.eu/en/simpl/l1-high-level-requirement/federated-authentication) |
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | W3C Decentralized Identifiers (DID) (https://www.w3.org/TR/did-core/) |
| OASC | MIM4 - Trust, MIM6 - Security |
| IDSA | RA – functional layer – Trust – Identity Management + user certification; RA – Functional layer – Security & Data Sovereignty – Authentication & Authorisation; RA – Security perspective + Certification perspective |
| Gaia-X | Identity and Trust – Federated Identity Management; Sovereign Data Exchange – logging service; Compliance – Onboarding and certification |
| Others (reported by DS4SSCC) | Standards: LDAP OAUTH2 X.500 X.509; Industry body specifications: CEF eID, OpenID Connect; SAML 2.0; SOLID |
| FAIR principles |
|---|
| A1.2: The protocol [for metadata publication] allows for an authentication and authorisation procedure where necessary |
| Ref | Specification / implementation(s) recommended | |
|---|---|---|
| W3C | Verifiable Credentials (https://www.w3.org/TR/vc-data-model-2.0/) | |
| OGC | OGC Web Services Security OASC | MIM4 - Trust, MIM6 - Security |
| IDSA | RA – Functional layer – Security & Data Sovereignty – Trustworthy communication; + Security by design; + Technical certification; RA – Security perspective + Certification perspective | |
| Gaia-X | Identity and Trust – Trust Management; Compliance – Relation between Providers and consumers + Rights and obligations of participants | |
| Others (reported by DS4SSCC) | Standards: EUDI; Industry body specifications: EBSI; Reference implementations: European Blockchain, i4Trust |
| EIF |
|---|
| Recommendation 15 (Security and privacy): Recommendation 15 (Security and privacy): Define a common security and privacy framework and establish processes for public services to ensure secure and trustworthy data exchange between public administrations and in interactions with citizens and businesses. |
| Ref | Specification / implementation(s) recommended |
|---|---|
| OASC | MIM5 - Transparency |
| Others (reported by DS4SSCC) | ETSI-CIM; DACT-AP – property ‘provenance’ |
Appendix A.3. Technical Building Blocks – Data Value Enhancement
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | ? |
| OGC | ? |
| OASC | ? |
| IDSA | ? |
| ... | ? |
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | W3C APIs (https://api.w3.org/doc) |
| OGC | OGC APIs (https://ogcapi.ogc.org/), OGC Web Services. |
| OASC | MIM1 – context; MIM7 - Places |
| IDSA | Connectors |
| Others (reported by DS4SSCC) | NGSI-LD; LDES MQTT JSON-LD |
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | ? |
| OGC | OGC Catalogue Service (https://www.ogc.org/standard/cat/), OGC API Records, Cat:ebRIM App Profile: Earth Observation Products (https://www.ogc.org/standard/cat2eoext4ebrim/) |
| OASC | MIM1 - Context, MIM3 - Contracts |
| IDSA | IDS Reference Architecture – Functional Layer – Ecosystem of Data – Brokering |
| Gaia-X | Federated Catalogue – Catalogue Management Functions |
| Others (reported by DS4SSCC) | ICT Innovation Network reference architecture, DCAT-AP, JSON-LD |
| SIMPL | Catalogues of Data/Applicaton/Infrastructure (ID SECAV-FUNC-001) (https://futurium.ec.europa.eu/en/simpl/l1-high-level-requirement/catalogues-dataapplicationinfrastructure) |
| FAIR principles | DMP | EIF |
|---|---|---|
| F4: (Meta)data are registered or indexed in a searchable resource | DMP-1a (Discoverability). Data and all associated metadata will be discoverable through catalogues and search engines | |
| A2: Metadata should be accessible even when the data is no longer available | ||
| A1: (Meta)data are retrievable by their identifier using a standardised communication protocol | ||
| A1.1: The protocol is open, free and universally implementable | ||
| DMP-2 (Accessibility). online services, including, at minimum, direct download but preferably user-customizable services for visualization and computation. | ||
| Recommendation 5 (Transparency): Ensure internal visibility and provide external interfaces for European public services. |
| Ref | Specification / implementation(s) recommended |
|---|---|
| W3C | ADMS |
| ISA/ISA2/SEMIC | ADMS-AP |
| OGC | Coordinate transformation Service, GeoAPI, LocationService (OpenLS), Open Model Interface (OpenMI), RAINBOW, Filter Encoding, Styled Layer Description, Symbology Encoding, Geospatial User Feedback (GUF) |
| OASC | MIM3 Basic Data Marketplace Enablers SynchroniCity_D2.4.pdf |
| IDSA | RA – functional layer – Data markets – Clearing and billing |
| SIMPL | UI and API for defining data quality rules (ID SEGOA-FUNC-012) (https://futurium.ec.europa.eu/en/simpl/l1-high-level-requirement/ui-and-api-defining-data-quality-rules), Data quality assessment (ID SEARE-FUNC-017) (https://futurium.ec.europa.eu/en/simpl/l1-high-level-requirement/data-quality-assessment) |
| EIF |
|---|
| Recommendation 6 (Reusability): Reuse and share solutions, and cooperate in the development of joint solutions when implementing European public services. |
Appendix A.4. Technical Building Blocks – Services FAIRness
| Ref | Specification / implementation(s) recommended |
|---|---|
| OGC | OGC API Processes, Web Processing Service, Web Coverage Processing Service |
| Gaia-X | Gaia-X Labels |
| SIMPL | Attributes of a self-description for an application (ID SECAV-FUNC-002-FUNC-001) (https://futurium.ec.europa.eu/en/simpl/l2-detailed-requirement/attributes-self-description-application) |
References
- Heath, T.; Bizer, C. Linked data: Evolving the web into a global data space (Vol. 1), 3rd ed.; Morgan & Claypool Publishers, 2011. [Google Scholar]
- Halevy, A.; Franklin, M.; Maier, D. Principles of dataspace systems. In Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 2006; pp. 1–9. [Google Scholar]
- Kasamani, S.B.; Lukandu, A.I.; Gregory, W. Modelling Dataspace Entity Association Using Set Theorems. Computer Technology and Application 2012, 3. [Google Scholar]
- Zuiderwijk, A.; Janssen, M.T. Open data policies, their implementation and impact: A framework for comparison. Government information quarterly 2014, 31, 17–29. [Google Scholar] [CrossRef]
- Braunschweig, K.; Eberius, J.; Thiele, M.; Lehner, W. The state of open data. Limits of current open data platforms 2012, 1, 72. [Google Scholar]
- Huijboom, N.; Van den Broek, T. Open data: an international comparison of strategies. European journal of ePractice 2011, 12, 4–16. [Google Scholar]
- Tandy, J.; van den Brink, L.; Barnaghi, P.; Homburg, T. Spatial Data on the Web Best Practices; OGC, W3C, 2023. Available online: https://www.w3.org/TR/2023/DNOTE-sdw-bp-20230919/ (accessed on 8 July 2024).
- Abhayaratna, J.; Daemen, E.; Janowicz, K.; Parsons, E.; Smith, R.; Verschoor, F. The Responsible Use of Spatial Data; OGC, W3C, 2021. Available online: https://www.w3.org/TR/responsible-use-spatial/ (accessed on 8 July 2024).
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Mons, B. The FAIR Guiding Principles for scientific data management and stewardship. Scientific data 2016, 3, 1–9. [Google Scholar] [CrossRef] [PubMed]
- EU. New European Interoperability Framework – Promoting seamless services and data flows for European public administrations; ISBN 978-92-79-63756-8. [CrossRef]
- Lin, D.; Crabtree, J.; Dillo, I.; Downs, R.R.; Edmunds, R.; Giaretta, D.; De Giusti, M.; L’Hours, H.; Hugo, W.; Jenking, R; Khodiyar, V.; Martone, M.E.; Mokrane, V.N.; Petters, J.; Sierman, B.; Sokolova, D.V.; Stockhause, M.; Westbrook, J. The TRUST Principles for digital repositories. Sci Data 2020, 7, 144. [Google Scholar] [CrossRef] [PubMed]
- Otto, B.; Steinbuß, S.; Teuscher, A.; Lohmann, S.; et al. Reference architecture model; Version 3; International Data Spaces Association, 2019; Available online: https://internationaldataspaces.org/wp-content/uploads/IDS-Reference-Architecture-Model-3.0-2019.pdf (accessed on 8 July 2024).
- Grothe, M. Exploring data space initiatives; Geonovum, 2023. Available online: https://www.geonovum.nl/uploads/documents/Exploring%20data%20space%20initiatives%20v0.52_EN%20publication%20version%20Geonovum.pdf (accessed on 8 July 2024).
- Ulrich, A.; et al. Design principles for Data Spaces, Version 1.0.; OpenDEI, International Data Spaces Association, 2021. (accessed on 8 July 2024). [CrossRef]
- Farrell, E.; Minghini, M.; Kotsev, A.; Soler-Garrido, J.; Tapsall, B.; Micheli, M.; Posada, M.; Signorelli, S.; Tartaro, A.; Bernal, J.; Vespe, M.; Di Leo, M.; Carballa-Smichowski, B.; Smith, R.; Schade, S.; Pogorzelska, K.; Gabrielli, L.; De Marchi, D. European Data Spaces: Scientific insights into data sharing and utilisation at scale, 3rd ed.; Publications Office of the European Union: Luxembourg, 2023; JRC129900. [Google Scholar] [CrossRef]
- Gaia-X. Gaia-X - Architecture document. 2022. Available online: https://gaia-x.eu/wp-content/uploads/2022/06/Gaia-x-Architecture-Document-22.04-Release.pdf (accessed on 8 July 2024).
- Gronlier, P.; Hierro, J.; Steinbuss, S. Technical convergence – Discussion Document; Data Spaces Business Alliance, 2023. Available online: https://data-spaces-business-alliance.eu/wp-content/uploads/dlm_uploads/Data-Spaces-Business-Alliance-Technical-Convergence-V2.pdf (accessed on 8 July 2024).
- DSSC. Blueprint, Version 0.5; 2023. Available online: https://dssc.eu/space/BPE/179175433/Data+Spaces+Blueprint+%7C+Version+0.5+%7C+September+2023 (accessed on 8 July 2024).
- Santoro, M.; Mazzetti., P. GREAT D3.2 Final Blueprint of the GDDS Reference Architecture. 2024. Available online: https://www.greatproject.eu/wp-content/uploads/2024/04/D3.2-Final-Blueprint-of-the-GDDS-Reference-Architecture.pdf (accessed on 8 July 2024).
- DSSC. Blueprint, Version 1.0; 2024. Available online: https://dssc.eu/space/BVE/357073006/Data+Spaces+Blueprint+v1.0 (accessed on 8 July 2024).
- Noardo, F.; Atkinson, R.; Simonis, I.; Villar, A.; Zaborowski, P. OGC Data Exchange Toolkit: Interoperable and Reusable 3D Data at the End of the OGC Rainbow. In Recent Advances in 3D Geoinformation Science - Proceedings of the 3D Geoinfo Conference; Kolbe, T.H., Donaubauer, A., Beil, C., Eds.; Springer Nature Switzerland, 2024; pp. 761–779. [Google Scholar]
- Chignard, S. A brief history of open Data. ParisTech Review. 2013. Available online: http://www.paristechreview.com/2013/03/29/brief-history-open-data/ (accessed on 8 July 2024).
- Directive 2007/2/EC of the European Parliament and of the council of 14 March 2007 establishing an Infrastructure for Spatial Information in the European Community (INSPIRE). Available online: https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://knowledge-base.inspire.ec.europa.eu/publications/directive-20072ec-european-parliament-and-council-14-march-2007-establishing-infrastructure-spatial_en&ved=2ahUKEwivsMPsuv6FAxWxg_0HHU7TBKMQjBB6BAgKEAE&usg=AOvVaw3qAi9koTOasyha4tbMdIUd (accessed on 8 July 2024).
- Directive 2003/98/EC of the European Parliament and of the Council of 17 November 2003 on the re-use of public sector information. Available online: https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2003:345:0090:0096:en:PDF (accessed on 8 July 2024).
- WIS Timelines. Available online: https://community.wmo.int/en/activity-areas/wis/wis-timelines (accessed on 8 July 2024).
- World Wide Web Consotium - W3C. Available online: https://www.w3.org (accessed on 8 May 2024).
- Open Geospatial Consortium - OGC. Available online: https://en.wikipedia.org/wiki/Open_Geospatial_Consortium; https://www.ogc.org (accessed on 8 May 2024).
- Group on Earth Observations. Available online: https://earthobservations.org; https://en.wikipedia.org/wiki/Group_on_Earth_Observations#:~:text=GEO%20was%20established%20formally%20in,Observation%20Summit%20in%20Washington%2C%20DC (accessed on 8 July 2024).
- International Data Spaces Association - IDSA. Available online: https://internationaldataspaces.org (accessed on 9 May 2024).
- USAGE Deliverable 3.2 - Data space prototype and report - first version. Available online: https://drive.google.com/file/d/1FyvNkWkAWkuKKHh-3l529VUu5LIKyu5E/view?usp=drive_link (accessed on 8 July 2024).
- Urban Data Space for Green deal - USAGE project. funded within the HORIZON Europe programme (GA. 101059950). Available online: https://www.usage-project.eu (accessed on 9 May 2024).
- Common European Data Spaces. Available online: https://digital-strategy.ec.europa.eu/en/policies/data-spaces (accessed on 8 July 2024).
- International Standardisation Organisation - ISO. Available online: https://www.iso.org/home.html (accessed on 8 July 2024).
- W3C-OGC Spatial Data on the Web Working Group Charter. Available online: https://www.w3.org/2021/10/sdw-charter.html (accessed on 8 July 2024).
- Global Earth Observation System of Systems - GEOSS. Available online: https://old.earthobservations.org/geoss.php (accessed on 8 July 2024).
- FAIR principles. Available online: https://www.go-fair.org/fair-principles/ (accessed on 8 July 2024).
- GEOSS Data Management Principles. Available online: https://old.earthobservations.org/documents/dswg/201504_data_management_principles_long_final.pdf (accessed on 8 July 2024).
- CARE principles. Available online: https://en.wikipedia.org/wiki/CARE_Principles_for_Indigenous_Data_Governance (accessed on 8 July 2024).
- ISO 25012. Available online: https://iso25000.com/index.php/en/iso-25000-standards/iso-25012 (accessed on 8 July 2024).
- ISO 25010. Available online: https://www.iso25000.com/index.php/en/iso-25000-standards/iso-25010 (accessed on 8 July 2024).
- OSI model. Available online: https://en.wikipedia.org/wiki/OSI_model (accessed on 8 July 2024).
- DSSC Glossary. Available online: https://dssc.eu/space/Glossary/55443460/DSSC+Glossary+%7C+Version+1.0+%7C+March+2023 (accessed on 8 July 2024).
- DSSC starter Kit. Available online: https://dssc.eu/space/SK/29523973/Starter+Kit+for+Data+Space+Designers+%7C+Version+1.0+%7C+March+2023 (accessed on 8 July 2024).
- OpenDEI project. Available online: https://www.opendei.eu (accessed on 8 July 2024).
- Support Centre for Data Sharing deliverables. Available online: https://dssc.eu/space/DC/408059908/Support+Centre+for+Data+Sharing (accessed on 8 July 2024).
- Data Sharing Canvas - A stepping stone towards cross-domain data sharing at scale. Available online: https://coe-dsc.nl/wp-content/uploads/2024/02/data-sharing-canvas.pdf (accessed on 8 July 2024).
- Gaia-X. Available online: https://gaia-x.eu/ (accessed on 8 July 2024).
- FIWARE. Available online: https://www.fiware.org/about-us/ (accessed on 8 July 2024).
- Big Data Value Association - BDVA. Available online: https://www.bdva.eu (accessed on 8 July 2024).
- DSSC Endorsement. Available online: https://dssc.eu/page/Endorsements (accessed on 8 July 2024).
- Data Spaces Support Centre - DSSC. Available online: https://dssc.eu. https://www.egi.eu/project/dssc/. https://dssc.eu/space/DDP/117211137/DSSC+Delivery+Plan+-+Summary+of+assets+publication (accessed on 8 July 2024).
- Data Space for Smart and Sustainable Cities and Communities - DS4SSCC. Available online: https://inventory.ds4sscc.eu (accessed on 8 July 2024).
- GREAT project. Available online: https://www.greatproject.eu. https://www.epos-eu.org/great (accessed on 8 July 2024).
- GREAT technical blueprint. Available online: https://www.greatproject.eu/wp-content/uploads/2023/10/D3.1-Initial-Blueprint-of-the-GDDS-Reference-Architecture_web.pdf (accessed on 8 July 2024).
- A European Strategy for Data. Available online: https://digital-strategy.ec.europa.eu/en/policies/strategy-data (accessed on 8 July 2024).
- Simpl. Available online: https://digital-strategy.ec.europa.eu/en/policies/simpl (accessed on 8 July 2024).
- Simpl requirements. Available online: https://futurium.ec.europa.eu/en/simpl/pages/about (accessed on 8 July 2024).
- Open and Agile Smart Cities - OASC. Available online: https://oascities.org (accessed on 8 July 2024).
- DSSC data Sovereignty and Trust. Available online: https://dssc.eu/space/BVE/357075333/Data+Sovereignt y+and+Trust (accessed on 8 July 2024).
- DSSC Data Exchange. Available online: https://dssc.eu/space/BVE/357075193/Data+Exchange (accessed on 8 July 2024).
- DSSC Access and Usage Policies Enforcement. Available online: https://dssc.eu/space/BVE/357075567/Access+%26+Usage+Policies+Enforcement (accessed on 8 July 2024).
- DSSC Identity and Attestation Management. Available online: https://dssc.eu/space/BVE/357075352/Identity+and+Attestation+Management (accessed on 8 July 2024).
- DSSC Data Services and Offerings Descriptions. Available online: https://dssc.eu/space/BVE/357075789/Data%2C+Services+and+Offerings+Descriptions (accessed on 8 July 2024).
- DSSC Publication and Discovery. Available online: https://dssc.eu/space/BVE/357076320/Publication+and+Discovery (accessed on 8 July 2024).
- DSSC Value Added Services. Available online: https://dssc.eu/space/BVE/357076468/Value-Added+Services (accessed on 8 July 2024).
- OGC Location building Blocks. Available online: https://blocks.ogc.org (accessed on 8 July 2024).
- Common Assessment Methods Standards and Specifications - CAMSS. Available online: https://joinup.ec.europa.eu/collection/common-assessment-method-standards-and-specifications-camss (accessed on 8 July 2024).
- The Open Data Institute - ODI. Available online: https://theodi.org/about-the-odi/our-vision/ (accessed on 8 July 2024).
- ODI - How to choose an Open Standard. Available online: https://docs.google.com/document/d/1E5uARrZf5AJUIF_DJz-42_793EY_Dwk7n7B3bMn3x5A/edit#heading=h.xbuzggui7nk0; https://standards.theodi.org/find-existing-standards/how-to-choose-an-open-standard/ (accessed on 8 July 2024).
- Reference Model of Open Distributed Processing (RM-ODP). Available online: https://en.wikipedia.org/wiki/RM-ODP (accessed on 19 July 2024).
- ISO/IEC10746 Information technology - Open Distributed Processing (RM-ODP). Available online: https://committee.iso.org/sites/jtc1sc7/home/projects/flagship-standards/isoiec-10746.html (accessed on 19 July 2024).



| Findable |
|
| Accessible |
|
| Interoperable |
|
| Reusable |
|
| DSSC blueprint aspect | Extension | Reason for extension | Definition |
|---|---|---|---|
| Data Interoperability category | Data Specification enabling FAIRness | Building blocks support different FAIR aspects. All of these contribute to an effective description of data characteristics (be it published or required) | A complete and standards compliant specification of all aspects of data FAIRness. |
| Data Sovereignty and Trust category | Unchanged | - | Technical enablers to guarantee reliability and authenticity of participants’ information, to establish trust among them when interacting and performing data transactions. Common standards and agreed policies should prevent lock-in effects for users, and support FAIR principles, verification and authentication mechanisms, ensuring interoperability and security. |
| Data Value Creation Enablers | Data Value enhancement | The intrinsic value of data is not created, however, the tools contained in this category unlock the value of those data by making them available for a wider audience and facilitating their use. | To leverage the value of data, users must be supported in the retrieval and access to the data they need, and have the possibility to apply the necessary processing to adapt them to specific needs (e.g. data transformation, data visualisation, etc.). |
| - | New Services FAIRness category | Although data spaces are focused on data, specialised Services may be required to enable applications to exploit that data. Therefore, similar challenges apply to identify the necessary services and their possible use. | Symmetrically to the category “Data specification enabling FAIRness”, this category supports a good description of services to facilitate services retrieval according to the use cases workflows needs. |
| Data models | Unchanged | - | The model provides semantics and a shared vocabulary, as well as a structure for the data (hierarchies and relationships). |
| Data Exchange or Data Models | Data Exchange - Encodings | In previous versions of the DSSC building blocks stack, the encodings of data, or formats were included in the Data Models building block and, in the current version, they also have relationships with the Data Exchange building block. However, it is important to specify them separately, because dedicated standards are available, and they are independent from the options for representing data models or data exchange mechanisms. | Encoding is the format in which the data are encoded. |
| Data Exchange | Data Exchange (APIs) | Minor change in the title. Moved from the “Data interoperability” to the “Data value enhancement” category | The data exchange building block focuses on data transmission once the conditions for interchange authorisation are met. |
| Provenance and Traceability | Data Provenance model | The original building block is intended to address both (a) the standards to be used to represent and document provenance in the data and (b) mechanisms to track the data throughout their lifecycle. However, two different kinds of standards and services are available for the two scopes. Moreover, one complements the data description, while the other is intended to support data sovereignty. It is therefore reasonable to address them separately. | Standards intended to represent and document provenance and lineage of data. |
| Provenance and Traceability | Sharing Traceability | See row above. Moved from the “Data Interoperability” to the “Data Sovereignty and Trust” category | Standards and services intended to keep track of the data processing and sources along their lifecycle. |
| Access and usage policies and enforcement | Data Policies | The DSSC building block aims to specify how to define and enforce access and usage policies within a data space and how participants define their policies in data spaces. However, we consider that the policies specifying access and usage conditions and policies schemas follow rules and definitions which are in practice separated from the generic mechanisms used to ensure that such conditions are respected. Therefore in this proposal, it is split in two respective building blocks. | A policy is defined by the DSSC Blueprint v1.0 as “either an offer from the data provider or an agreement between the data provider and the data recipient. A policy is comprised of rules that specify the rights and duties of the parties: Access Rules: whether access to a resource is allowed or not; Usage Rules: how a resource might or may not be used; Consent Rules: whether usage of a resource, for which consent might be required from third parties, is allowed or not.”. Policies include licence details. |
| Access and usage policies and enforcement | Access and usage control and management | The DSSC building block aims to specify how to define and enforce access and usage policies within a data space and how participants define their policies in data spaces. However, we consider that the policies specifying access and usage conditions and policies schemas follow rules and definitions which should be separated from the mechanisms used to ensure that such conditions are respected. Therefore in this proposal, it is split in two respective building blocks. | Mechanisms in place to ensure access and usage policies related to certain data are respected. |
| Identity and attestation management | Unchanged | - | Information provided on the relevant entities must be verifiable to enable the onboarding and offboarding processes. The trustworthiness of information is linked to the trustworthiness of the Trust Anchors (or Trust Service Providers, specifically for identities), who are entitled to issue the respective attestations. |
| Trust Framework | Unchanged | - | Mechanisms and standards enabling a trust environment to be implemented within which data can be securely exchanged. |
| Data, services and offering descriptions | Data descriptions (metadata) | Separated by the original building block, because specific schemas are needed to describe datasets (metadata). Moved from the “Data Value Creation Enablers” to the “Data Specification enabling FAIRness” category | Metadata, technical guidance and schemas for describing datasets, exchanged within a data space between providers and recipients. Metadata allow consistent data retrieval, generation or reuse, ensuring reliability in the results for which data are used as input and related decision making process. |
| Data, services and offering descriptions | Services descriptions (metadata) | Separated by the original building block, because specific schemas are needed to describe services. Moved from the “Data Value Creation Enablers” to the “Services FAIRness” category | Metadata, technical guidance and schemas for describing services chosen as components of a data space architecture. |
| Data, services and offering descriptions | Offerings descriptions? | Separated by the original building block as part not covered by existing or planned data and services technical description. We remain with the doubt about the category under which it could fall, since several elements from different categories are useful to define the offering. | Offerings refer to a combination of descriptions and conditions attached to the data made available in the data space. However, a sharper definition is hard to find in the DSSC building block description. |
| Publication and discovery | Metadata publication and discovery | Minor change in the title. Checking the DSSC blueprint, it refers to metadata publication rather than to data publication itself. Therefore “metadata” was added to the title, to avoid any confusion with data publication systems (e.g. by means of APIs). | |
| Value added services | Unchanged | - | Included in the Blueprint v1.0, any kind of processing, as service, is included |
| - | New Data Requirements and quality Schemas | In “Data Specification enabling FAIRness” category | Data requirements specification is essential for a successful data retrieval or generation, leading to reliable results for use cases. Referring to standardised schemas to specify data requirements allows interoperability between systems. |
| Data models | New Vocabularies | Data models are often described as "vocabularies", however there are many forms of terminology needed to describe the structure and semantics of data, as well as constrain the values that are used to convey information within these structures. Data spaces will face issues of abstraction, where one domain may model a concept in detail, where for another domain this is simply a classification attribute - e.g. a "cat" or an "animal, type=cat". | Vocabularies are defined sets of terms. In the case of data models, these terms may be formally described in terms of relationships to other terms, as ontologies or taxonomies, but they may exist a lists of terms with defintions. |
| - | New Vocabularies services | In “Data Value Enhancement” category. Services may be used to cross-reference or match related terms from different domains to enhance "findability" in particular. Publishing cross-walks with expert curation can add significant value to data for domains needing this semantic clarity. | Services intended to manage or augment vocabularies for the related functionalities. |
| - | New data requirements and quality services (definition+validation) | In “Data Value Enhancement” category | Services intended to support specification of standard-based data requirements (based on the building blocks in the category “Data Specifications for FAIRness”) as well as data validation against such requirements. |
| - | New Licenses for services | In “Services FAIRness” category | The kinds of licenses available for services (to be known when planning and implementing a data space architecture). |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
