MediaWiki:Cirrussearch-morelikethis-settings/en


 * 1) This message lets you configure the settings of the "more like this" feature.
 * 2) Changes to this take effect immediately.
 * 3) The syntax is as follows:
 * 4)   * Everything from a "#" character to the end of the line is a comment.
 * 5)   * Every non-blank line is the setting name followed by a ":" character followed by the setting value
 * 6) The settings are:
 * 7)   * min_doc_freq (integer): Minimum number of documents (per shard) that need a term for it to be considered.
 * 8)   * max_doc_freq (integer): Maximum number of documents (per shard) that have a term for it to be considered.
 * 9)                   High frequency terms are generally "stop words".
 * 10)   * max_query_terms (integer): Maximum number of terms to be considered. This value is limited to $wgCirrusSearchMoreLikeThisMaxQueryTermsLimit (100).
 * 11)   * min_term_freq (integer): Minimum number of times the term appears in the input to doc to be considered. For small fields (title) this value should be 1.
 * 12)   * percent_terms_to_match (float 0 to 1): The percentage of terms to match on. Defaults to 0.3 (30 percent).
 * 13)   * min_word_len (integer): Minimal length of a term to be considered. Defaults to 0.
 * 14)   * max_word_len (integer): The maximum word length above which words will be ignored. Defaults to unbounded (0).
 * 15)   * fields (comma separated list of values): These are the fields to use. Allowed fields are title, text, auxiliary_text, opening_text, headings and all.
 * 16)   * use_fields (true|false) : Tell the "more like this" query to use only the field data. Defaults to false: the system will extract the content of the text field to build the query.
 * 17) Examples of good lines:
 * 18) min_doc_freq:2
 * 19) max_doc_freq:20000
 * 20) max_query_terms:25
 * 21) min_term_freq:2
 * 22) percent_terms_to_match:0.3
 * 23) min_word_len:2
 * 24) max_word_len:40
 * 25) fields:text,opening_text
 * 26) use_fields:true
 * 1) use_fields:true