Search index should ignore punctuation

Search index should ignore punctuation

I just wasted some time trying to locate Vector-simplesearch-containing ("containing..."). The text appearing in the search box on hu.wikipedia is tartalmazza… so I figured searching for "tartalmazza" in the MediaWiki namespace would yield the message but apparently it doesn't because the indexer thinks the ellipsis character (U+2026) is part of the word (searching for tartalmazza… works). This can be quite annoying for translators.

Tgr07:53, 6 June 2010