
Contact
Prof. Dr. Christoph Meinel
Hasso-Plattner-Institut
an der Universität Potsdam
Tel: +49 0331/5509-222
Fax: +49 0331/5509-325
Mobil: +49 176 10010727
meinel"at"hpi.uni-potsdam.de
Blogs
Contact
Dr. Michael G. Noll
- Personal homepage: http://www.michael-noll.com/
- Email: michael.noll[AT]hpi.uni-potsdam.de
- Chair: Internet Technologies and Systems
Hasso-Plattner-Institut für Softwaresystemtechnik
Universität Potsdam
Prof.-Dr.-Helmert-Str. 2-3
D-14482 Potsdam
Germany
Project
The research and development project SAFER INTERNET aims at providing a technical solution for a more protected use of the Internet for families, schools, and Internet users in general. The main goal is to tackle harmful and objectionable content such as pornography, violence or racism based on individual user preferences and without imposing forced censorship on users.
The Safer Internet Project is a joint binational project between SES ASTRA, Luxembourg; the Hasso Plattner Institute at the University of Potsdam, Germany; and the University of Luxembourg & LIASIT.
Publications
- Ching-Man Au Yeung, Michael G. Noll, Nicholas Gibbins, Christoph Meinel, Nigel Shadbolt
SPEAR: Spamming-resistant Expertise Analysis and Ranking in Collaborative Tagging Systems
International Journal of Computational Intelligence, Wiley-Blackwell, 2010 (to appear) - Michael G. Noll, Ching-Man Au Yeung, Nicholas Gibbins, Christoph Meinel, Nigel Shadbolt
Telling Experts from Spammers: Expertise Ranking in Folksonomies
SIGIR '09: Proc. of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 2009, pp. 612-619, ISBN 978-1-60558-483-6 (ACM Link, BibTeX)
» Read the Technology Review article on this work - Ching-Man Au Yeung, Michael G. Noll, Nicholas Gibbins, Christoph Meinel, Nigel Shadbolt
On Measuring Expertise in Collaborative Tagging Systems
WebSci '09: Proc. of 1st Web Science Conference, Athens, Greece, March 2009 (BibTeX - Michael G. Noll, Christoph Meinel
The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries
WI '08: Proc. of 7th IEEE/WIC/ACM International Conference on Web Intelligence, IEEE CS Press, Sydney, Australia, December 2008, pp. 640-647, ISBN 978-0-7695-3496-1 (IEEE Link, BibTeX) - Michael G. Noll, Christoph Meinel
Building a Scalable Collaborative Web Filter with Free and Open Source Software
SITIS '08: Proc. of 4th IEEE International Conference on Signal-Image Technology & Internet-based Systems, IEEE CS Press, Bali, Indonesia, November 2008, pp. 563-571, ISBN 978-0-7695-3493-0 (IEEE Link, BibTeX) - Michael G. Noll, Christoph Meinel
Exploring Social Annotations for Web Document Classification
SAC '08: Proc. of 23rd International ACM Symposium on Applied Computing, Fortaleza, Ceará, Brazil, March 2008, pp. 2315 - 2320, ISBN: 978-1-59593-753-7 (ACM Link, BibTeX) - Michael G. Noll, Christoph Meinel
Web Search Personalization via Social Bookmarking and Tagging
ISWC '07: Proc. of 6th International Semantic Web Conference & 2nd Asian Semantic Web Conference, Springer LNCS 4825, Busan, South Korea, November 2007. pp. 367 - 380, ISBN: 978-3-540-76297-3 (SpringerLink, BibTeX) - Michael G. Noll, Christoph Meinel
Authors vs. Readers: A Comparative Study of Document Metadata and Content in the WWW
DocEng '07: Proc. of 7th International ACM Symposium on Document Engineering, Winnipeg, Canada, August 2007, pp. 177 - 186, ISBN: 978-1-59593-776-6 (ACM Link, BibTeX) - Michael G. Noll, Christoph Meinel
Design and Anatomy of a Social Web Filtering Service (best paper award)
CIC '06: Proc. of 4th International Conference on Cooperative Internet Computing, Hong Kong, China, October 2006, pp. 35-44, ISBN 962-367-541-0 (WSP Link, BibTeX) - Michael G. Noll, Christoph Meinel
Web Page Classification: An Exploratory Study of the Usage of Internet Content Rating Systems
HACK '05: Proc. of HACK conference, Luxembourg, October 2005, ISBN 978-2-9599708-0-1 (BibTeX) - Michael G. Noll
Evaluation of Content Management Systems (Diploma Thesis)
University of Trier, Trier, Germany, 2004 - C. Meinel, T. Engel, E.-G. Haffner, G. Müllenheim, M. G. Noll, T. Wagner
Die Lock-Keeper Architektur
IT-Services/Institut für Telematik, Trier/Luxembourg, 2003 - M. G. Noll, T. Wagner
Lock-Keeper - doppelter Datendurchsatz bei halber Laufzeit
(freely translated: "Lock-Keeper - double data throughput at half runtime")
In: C. Meinel (eds.): Tätigkeitsbericht 2002 des Institut für Telematik, Trier, Germany, 2003 - M. Schmitt, M. G. Noll, G. Müllenheim, B. Lentes, M. Vieten, T. Engel, C. Meinel
Firewalls und Intrusion-Detection-Systeme: Technologien und Produkte
(translated: "Firewalls and Intrusion Detection Systems: Technologies and Products")
Study for the German Ministry of Science, Education, Research, and Culture, Institut für Telematik, Trier, Germany, 2002 - C. Meinel, G. Müllenheim, M. G. Noll, T. Wagner
Lock-Keeper - eine patentierte Schleusen-Technologie für höchste Sicherheitsansprüche
(freely translated: "Lock-Keeper - a patented lock-based technology for the highest demands in network security")
In: Meinel, C. (eds.): Tätigkeitsbericht 2001 des Institut für Telematik, Trier, Germany, 2002
SPEAR Algorithm
We proposed the graph-based SPEAR algorithm (Spamming-resistant Expertise Analysis and Ranking), which is a new technique to measure the expertise of users by analyzing their behavior and activities in online communities. The focus of the algorithm is on the ability of users to find new, high quality information in the Internet. At the same time, SPEAR has been shown to be very resistant to spamming attacks, i.e. malicious attempts to manipulate its rankings.
We have created a dedicated homepage for the SPEAR algorithm, which provides further details and a reference implementation of the algorithm:
http://www.spear-algorithm.org/
Published research data sets
- CABS120k08 (published 2008) - a large research data set about Web metadata based on a sample of 120,000 web documents with data retrieved from the Open Directory Project, the AOL Search query log corpus AOL500k, Google PageRank, Delicious.com/Yahoo!, and anchor text from incoming hyperlinks
- DMOZ100k06 (published 2007) - a large research data set about document metadata based on a random sample of 100,000 web documents from the Open Directory combined with data retrieved from Delicious.com/Yahoo!, Google, and ICRA
Tutorials
In my PhD project, I have used Hadoop quite a lot. Hadoop is a Yahoo-sponsored open source framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System and of MapReduce. Here are some tutorials to get you started.










