Hasso-Plattner-Institut für Softwaresystemtechnik
Stratosphere

Prof. Dr. Felix Naumann

Hasso-Plattner-Institut
für Softwaresystemtechnik
Prof.-Dr.-Helmert-Str. 2-3
D-14482 Potsdam, Germany

Stratosphere

Stratosphere is a joint DFG project conducted by the Technische Universität Berlin, Humboldt Universität Berlin, and the Hasso-Plattner-Institut. It explores how the elasticity of clouds can be exploited for processing analytic queries massively in parallel. Unlike most traditional DBMS, Stratosphere will inherently support text-based and semi-structured data.


The sub-project at the HPI focuses on data quality improvements of linked open data. We define a declarative data cleansing language, implement the underlying basic operations, and develop cost estimations for the operations. Furthermore, we provide test data sets and example queries to evaluate the efficiency and effectivity of the data cleansing process.


Official Project Site

Please contact Felix Naumann or Arvid Heise for further questions.