User:Nike/thesis

From translatewiki.net

Schedule

  • 2010-09-20 – Got the first idea
  • 2010-10-01 – Started this page
  • 2010-10-19 – Finished preliminary schedule and table of contents
  • october – researching different methods, software and materials
  • november – "hands on", setup development environment, start implementing (after choosing the method), possible start writing about the methods
  • december – implementing continues, but more weight on writing already
  • january – first presentation?
  • january – implementation mostly finished
  • february – do some tests? more writing
  • march – finish writing, give the real presentation
  • april – tidy up the document

Table of contents

  1. Introduction
  2. Methods for handling morphology (wrt to searching)
    1. Stemming methods
    2. Uninflecting words
    3. Search stems
  3. Software and data
    1. Lucene
    2. MediaWiki's search integration(s)
    3. Dump of Wikipedia (or other) database
  4. Note about the implementation
    1. like, from the user interface, input processing...
  5. Evaluation and results
    1. Speed (how many languages?)
  6. Summary
    1. What was made and to what it is useful
    2. Wikimedia's requirements for search
    3. How to further continue the work
  • Sources