huashan8 发表于 2016-12-16 10:51:56

solr中WordDelimiterFilter与autoGeneratePhraseQueries的用法

WordDelimiterFilter把单词分割为子单词
例如:
*- split on intra-word delimiters (by default, all non alpha-numeric characters).
*   - "Wi-Fi" -> "Wi", "Fi"
*- split on case transitions
*   - "PowerShot" -> "Power", "Shot"
*   - "//hello---there, 'dude'" -> "hello", "there", "dude"
*- trailing "'s" are removed for each subword
*   - "O'Neil's" -> "O", "Neil"
这样,如果文件包含"Wi-Fi",搜索"wifi"或者"wi fi"都会match。

属性autoGeneratePhraseQueries="true" (缺省值)会将搜索的词分割成为短语查询。
例如,WordDelimiterFilter会解析“pdp-11”生成短语“pdp 11”而不是两个词PDP或者11
autoGeneratePhraseQueries="true"对于不是用空格分割的语言效果不佳。

这样,搜索“Wi-Fi”就相当于搜索“Wi Fi”
页: [1]
查看完整版本: solr中WordDelimiterFilter与autoGeneratePhraseQueries的用法