
Prof. Dr. Felix Naumann
Hasso-Plattner-Institut
für Softwaresystemtechnik
Prof.-Dr.-Helmert-Str. 2-3
D-14482 Potsdam, Germany
Paper accepted at SSDBM
Proceedings of the 24th International Conference on Scientific and Statistical Database...
JWS Article Accepted
Integrating Open Government Data with Stratosphere for more Transparency Arvid Heise and Felix...
LREC Paper Accepted
The eighth international conference on Language Resources and Evaluation (LREC), Istanbul,...
Daniel Rinser wins award for his masters thesis
IQ Best Master Degree Wettbewerb der Deutschen Gesellschaft für Informations- und Datenqualität e....
HPI TV releases video about GovWILD
See the new video about our Government Data Integration platform GovWILD.
Tool voidGen released
As part of our winning submission at the 2010 Billion Triple Challenge at the International...
ICDE Paper Accepted
28th IEEE International Conference on Data Engineering (ICDE) Washington, DC, USA Adaptive...
Additional resources
- Ontology for classes in BTC dataset (extracted from data, zipped .nt file)
- Ontology for classes in BTC dataset (lookup on URIs, zipped .nt file)
For the approach introduced in our submission, we used a combination of these two ontologies.
Results
- Ontology coverage data
- Frequency and coverage of properties of some common types.
Schema of csv file: type;property;frequency; property rdfs:domain type as defined by BTC ontology (see above): y(es) or n(o)
- Frequency and coverage of properties of some common types.
- Rule data
- Frequent Set Format: | set values | frequencies
- e.g. | birthPlace, birthDate | 1045
- Rule Format : |antecedent values -> consequent values| frequency| confidence
- e.g. |birthPlace -> birthDate| 1045 | 97%
- Positive association rules between predicates of mo:MusicArtist
(0.2% support and 80% confidence; maximum rule size of 3) - Negative association rules between predicates of mo:MusicArtist (0.2% support and 80% confidence; maximum rule size of 3)
- Negative and positive association rules between predicates of foaf:OnlineAccount
(0.2% support and 80% confidence; rules of all sizes)
- Frequent Set Format: | set values | frequencies


