Final Thesis: A Storage Solution for the WOM

Abstract: Most wikis store their pages (articles) as text blobs in an SQL database. A formal parser for MediaWiki’s markup format which drives Wikipedia and many other wiki instances has been developed by the SWEBLE project. This parser produces a structured document as Wiki Object Model (WOM) which unlocks the information stored in each article and enables refactoring and other technologies to the wiki world. These technologies require a storage solution for fast document access and preferably low memory usage. The goal of this thesis is to investigate available storage options for XML documents and implement an interface which supports quick, arbitrary searches over all articles and provides methods for refractoring the WOM documents.

Keywords: Wikitext, Swebe, WOM

PDFs: Thesis unavailable; Work Description

Reference: Thomas Land. A Storage Solution for the WOM. Bachelor Thesis, Friedrich-Alexander-Universität Erlangen-Nürnberg: 2015.