Semantic Overlay Architecture (SOyA) is a data model authoring and publishing platform and also provides functionalities for validation and transformation. It builds on W3C Resource Description Framework (RDF) and related semantic web technologies to provide a lightweight approach for data integration and exchange. At the core of SOyA is a YAML-based data model for describing data structures with bases and optional overlays, which provide additional information and context.
This anonymisation process ensures GDPR-compliant handling of personal data by applying a configurable, ontology-driven approach. It begins by fetching a JSON-LD configuration from a knowledge graph, which defines the anonymization type and data type for each attribute using SPARQL queries. For every attribute, a matching anonymizer (e.g., masking, generalization, or randomization) is instantiated—depending on available implementations—and applied to the input data after restructuring it by attribute. Generalization, for example, assigns values into buckets to reduce identifiability, while randomization introduces controlled noise, and masking hides values entirely. This modular process ensures flexibility and extensibility, and the entire service is accessible via a documented API. For more details, visit the GitHub repository: https://github.com/OwnYourData/anonymisation-service.
Follow these steps to anonymise your dataset using the Anonymisation Service:
OverlayClassification
(see the Classification section in the tutorial) to specify the anonymisation methods for each attribute.This website is a frontend for the underlying technology of data anonymisation. You can use this service via a REST API by calling the following API endpoint:
POST https://anonymizer.go-data.at/api/anonymise
: provide the data set and a reference to the SOyA structure in the body of a POST request;cat input.json | curl -H 'Content-Type: application/json' -d @- -X POST https://anonymizer.go-data.at/api/anonymise
input.json
{ "configurationURL": "https://soya.ownyourdata.eu/AnonymisationDemo", "data": [...] }example file: input.json
Swagger API of this service is available here: https://anonymizer.go-data.at/swagger-ui/index.html
Docker image for local deployment can be downloaded here: https://hub.docker.com/r/oydeu/anonymizer
This service is a Proof-of-Concept to demonstrate an anonymisation service using the overlay capabilities of SOyA, i.e., show-case an easy but still machine-readable format to describe datasets and use the built-in mechanisms of SOyA for anonymisation.
We would like to encourage everyone to report issues or even provide pull-requests on the public Github repository.