Jump to content
Schedule
- 2010-09-20 – Got the first idea
- 2010-10-01 – Started this page
- 2010-10-19 – Finished preliminary schedule and table of contents
- october – researching different methods, software and materials
- november – "hands on", setup development environment, start implementing (after choosing the method), possible start writing about the methods
- december – implementing continues, but more weight on writing already
- january – first presentation?
- january – implementation mostly finished
- february – do some tests? more writing
- march – finish writing, give the real presentation
- april – tidy up the document
Table of contents
- Introduction
- Methods for handling morphology (wrt to searching)
- Stemming methods
- Uninflecting words
- Search stems
- Software and data
- Lucene
- MediaWiki's search integration(s)
- Dump of Wikipedia (or other) database
- Note about the implementation
- like, from the user interface, input processing...
- Evaluation and results
- Speed (how many languages?)
- Summary
- What was made and to what it is useful
- Wikimedia's requirements for search
- How to further continue the work
-