
"Das Hasso-Plattner-Institut ist ein Vorzeige- projekt für ganz Deutschland. Es ist von der Konstruktion her ganz ungewöhnlich, und das macht den Erfolg aus." Johanna Wanka, Wissenschafts- ministerin a.D. des Landes Brandenburg
Open Course Design Thinking: d.school-Referent Thomas Both zu Gast
Aufgrund der großen Nachfrage gibt es einen weiteren Open Course Design Thinking vom 31. Mai bis 2....
Bewerbungsschluss HPI-Schülerkolleg
HPI-Schülerkolleg geht 2012 in sein viertes Jahr. Bis zum 6. Juni können sich interessierte und...
Hochschulinformationstag am HPI
Am 8. Juni 2012 findet der Hochschulinformationstag der Universität Potsdam auf dem Campus...
HPI Alumni Homecoming Event 2012
Die zentrale Begegnungsveranstaltung für die Ehemaligen des HPI feiert 2012 gleich mehrere...
Future SOC Symposium am HPI
Vom 14. bis zum 15. Juni 2012 findet das siebte Future SOC Symposium statt.
Zertifikatsverleihung HPI-Schülerkolleg 2011/12
15 Seminareinheiten in je 3 bis 4 Modulen haben die rund 55 Schülerinnen und Schüler abgeschlossen,...
Open Data Projects at HPI
This page gathers open-data-related activities at HPI, in particular in the area of linked open data and the semantic web. Contributing groups are
- Internet technologies and systems (Christoph Meinel and Harald Sack)
- Information systems (Felix Naumann)
Research Projects
- GovWILD
GovWILD gathers and integrates government data from diverse government data sources, presents them in a Web interface and provides the data for download.
Received the IBM Scalable Data Analytics Award - ProLOD
Profiler for Linked Open Data (ProLOD) is a Web-based tool to perform data profiling tasks on datasets such as DBpedia. (NTII 2010 paper) - Yovisto
Yovisto is a semantic video search engine specialized in academic content. Yovisto's search index is based on the combination of automated content based video analysis with user generated collaborative annotation (collaborative tagging, discussions, and comments) and Linked Open Data resources such as DBpedia. In difference to traditional video search engines, Yovisto enables pinpoint access within video data by providing fine-granular, time-dependent semantic metadata being published as Linked Open Data (SAMT 2010 paper / SemSearch 2010 paper). - Mediaglobe
Cultural Memory provides an ever-increasing amount of information. Only a negligible small portion of this content is already accessable in the World Wide Web. Mediaglobe is a funded by the Federal Ministry of Economics and Technology within the context of the THESEUS research program 'New technologies for the internet of services' and aims to take media archives into the digital future providing availability and usability for the growing stock of audiovisual documents concerning German contemporary history. Mediaglobe offers state-of-the-art services for multimedia analysis and semantic multimedia search based on mappings to Linked Open Data resources on the web. - Stratosphere
The DFG Research Unit (Forschergruppe) is developing a cloud-based data management system. One of the use cases is to query and cleanse linked open data. - iPopulator
With iPopulator, we have introduced a system that automatically populates infoboxes of Wikipedia articles by extracting attribute values from the article's text. In contrast to prior work, iPopulator detects and exploits the structure of attribute values to independently extract value parts. We have published a set of extracted facts from Wikipedia. (CIKM 2010 paper / technical report)
Publications
- RDF Ontology (Re-)Engineering through Large-scale Data Mining
Johannes Lorey and Ziawasch Abedjan and Felix Naumann and Christoph Böhm.
Finalist @ Billion Triple Challenge, International Semantic Web Conference (ISWC), 2011 - Creating voiD Descriptions for Web-scale Data
Christoph Böhm and Johannes Lorey and Felix Naumann.
Journal of Web Semantics: Science, Services and Agents on the World Wide Web 9(3):339-345, 2011 - Extracting Structured Information from Wikipedia Articles to Populate Infoboxes (short paper)
Dustin Lange, Christoph Böhm, and Felix Naumann
Proceedings of the 19th Conference on Information and Knowledge Management (CIKM) 2010, Toronto, Canada
(Extended version available as technical report) - Profiling Linked Open Data with ProLOD
Christoph Böhm, Felix Naumann, Ziawasch Abedjan, Dandy Fenz, Toni Grütze, Daniel Hefenbrock, Matthias Pohl, David Sonnabend
Workshop New Trends in Information Integration (NTII) 2010, Long Beach, USA - Linking Open Government Data: What Journalists Wish They Had Known
Christoph Böhm, Felix Naumann, Markus Freitag, Stefan George, Norman Höfler, Martin Köppelmann, Claudia Lehmann, Andrina Mascher, and Tobias Schmidt.
Honorable Mention at Linked Data Triplification Challenge 2010 @ I-Semantics, Graz. (link to GovWILD) - J. Waitelonis, N. Ludwig, H. Sack:
Use What You Have -- Yovisto Video Search Engine Takes a Semantic Turn, in Proc. of 5th Int. Conf. on Semantic and Digital Media (SAMT 2010), December 1-3, 2010, DFKI Saarbrücken. - J. Waitelonis, H. Sack:
Exploratory Video Search with yovisto, 4th IEEE International Conference on Semantic Computing (ICSC 2010), September 22-24, 2010, Carnegie Mellon University, Pittsburgh, PA, USA. - J. Waitelonis, H. Sack, Z. Kramer, J. Hercher:
Semantically Enabled Exploratory Video Search, in Proc. of Semantic Search Workshop (SemSearch10) at the 19th Int. World Wide Web Conference (WWW2010), 26-30 April 2010, Raleigh, NC, USA, 2010. - J. Waitelonis, H. Sack:
Towards Exploratory Video Search by Using Linked Data, in Proc. of 2nd IEEE International Workshop on Data Semantics for Multimedia Systems and Applications (DSMSA2009), in conjunction with IEEE International Symposium on Multimedia (ISM2009), 14-16 December, 2009, San Diego, California, USA, 2009, pp. 540-545 (ISBN 978-0-7695-3890-7). - J. Waitelonis, H. Sack:
Augmenting Video Search with Linked Open Data, in Proc. of International Conference on Semantic Systems 2009 (i-semantics 2009), September, 2-4, 2009, Graz, Journal of Universal Computer Science, Verlag der TU Graz, Austria, 2009 (ISBN 978-3-85125-060-2). - M. Quasthoff, H. Sack, Ch. Meinel:
Can Software Developers Use Linked Data Vocabulary?, in Proc. of International Conference on Semantic Systems 2009 (i-semantics 2009), September, 2-4, 2009, Graz, Journal of Universal Computer Science, Verlag der TU Graz, Austria, 2009 (ISBN 978-3-85125-060-2).
Data Sets
- GovWILD data set (JSON, SQL, RDF)
- iPopulator: Extracted facts from Wikipedia (CSV, N3)
Courses
- Bachelor project: A Cloud Platform for On-Demand Access to Open Data
- Master seminar: Large-Scale Data Analysis in the Cloud (see also Billion Triple Challenge 2010 paper and website)
- Bachelor project: Midas: Extreme Web Data Integration for Government Data (see GovWILD project)
- Master Seminar: Linked Data Profiling (see NTII 2010 paper)
- Master Seminar: Large Scale Processing for Semantic Web Technologies (WS 2010/11)
- Bachelor Seminar: Linked Open Data Application Engineering (WS 2009/10)
- Master Seminar: Scalable Data Analysis Algorithms using DBPedia data sets (WS 2011/12)
Master's Theses
- Learning to Extract Structured Information from Wikipedia Articles to Populate Infoboxes (Dustin Lange, 2009) (see also CIKM 2010 paper and tech report)
- Wikipedia cross-lingual Concept Identification and Infobox Alignment (Daniel Rinser, 2010)