HDTcrypt: Efficient Compression and Encryption of RDF Datasets

Javier D. Fernández1, 2, Sabrina Kirrane1, Axel Polleres1, 2, and Simon Steyskal1, 3

1. Vienna University of Economics and Business (Austria)
2. Complexity Science Hub Vienna (Austria)
3. Siemens AG Österreich, Vienna (Austria)

Go to Source Code

This work has been submitted to Semantic Web Journal. In the following, we provide a brief overview of the proposal and the source code of its protoype. Please see the “RDF/HDT” project website for a further details on HDT.

1. Introduction
2. Source Code

1. Introduction

The publication and interchange of RDF datasets online has experienced significant growth in recent years, promoted by different but complementary efforts, such as Linked Open Data, the Web of Things and RDF stream processing systems.

However, the Web of Data is still not mature enough to support the next generation of eBusiness applications. On the one hand, data publishers need a means to tightly couple access policies with sensitive data (e.g. personal data, health data, financial data). On the other hand, the infrastructure needs to compress RDF graphs to minimise the amount of data transferred between data publishers and data consumers.

In this paper, we present a framework which combines encryption with HDT compression techniques for RDF datasets. Particularly, we demonstrate how careful integration allows compressed and encrypted datasets, for multiple users with different access policies, to be efficiently published, exchanged and consumed using the existing web infrastructure.

2. Source Code

We provide a first alpha version of HDT-Crypt source code. While this build has been extensively tested, the current alpha state is still subject to bugs and optimizations.

HDT-Crypt content is licensed by Lesser General Public License.

Please find the LUBM and DBPEDIA triple pattern queries of our evaluation, and the LUBM and DBPEDIA RDF graphs (in HDT format).