lb5645284 发表于 2016-12-14 10:54:13

solr 同义词搜索

  solr 同义词搜索
solr-synonym功能(同义词)
  1) 配置 
  ==========================schema.xml START=================================================
  <fieldType name="textMaxWord" class="solr.TextField" >^M

      <analyzer type="index">
        <tokenizer class="com.chenlb.mmseg4j.solr.MMSegTokenizerFactory" mode="max-word"/>^M
        <filter class="solr.StopFilterFactory" ignoreCase="false" words="stopwords.txt"/>
        <filter class="solr.StandardFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
      </analyzer>

      <analyzer type="query">
        <tokenizer class="com.chenlb.mmseg4j.solr.MMSegTokenizerFactory" mode="max-word"/>^M
        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
        <filter class="solr.StopFilterFactory" ignoreCase="false" words="stopwords.txt"/>
        <filter class="solr.StandardFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
      </analyzer>
    </fieldType>
  ……
  <field name="title" type="textMaxWord" indexed="true" stored="true" termVectors="true"/>
  ……
  ==========================schema.xml END=================================================
  ==========================conf/synonyms.txt START=================================================
  # Some synonym groups specific to this example
GB,gib,gigabyte,gigabytes
MB,mib,megabyte,megabytes
Television, Televisions, TV, TVs
中国,美国,德国,法国
==========================conf/synonyms.txt END=================================================
  2)使用
  搜索title:中国  结果包括美国、中国的结果都会出现
  引自:http://hi.baidu.com/liwei_8/blog/item/6e0197dfd376700a63279886.html
页: [1]
查看完整版本: solr 同义词搜索