solr 之 synonyms.txt stopwords.txt

本文轉載自查看原文 2014-12-09 16:33 2308 solr

前提：索引和搜索都會加Factory

1. 如果是StandardTokenizerFactory 那么查詢時，synonyms.txt只能配置單個詞或者類似植物》動物但不能 ” 英雄》植物“ 因為StandardTokenizerFactory 中文，默認就是字字分開，直接控制匹配度就行，要詞分的話就用ik。

2. 當然對於WhitespaceTokenizerFactory ，那么” 英雄》植物“ 是完全沒有問題的（分詞應該也沒有問題哈）！！

3. 對於stopwords.txt 例如里面加一個“一”，那么搜索時都會將它忽略！！

4. 詳解：

# blank lines and lines starting with pound are comments.   #Explicit mappings match any token sequence on the LHS of "=>" #and replace with all alternatives on the RHS. These types of mappings  #ignore the expand parameter in the schema.  #Examples:  #-----------------------------------------------------------------------  #some test synonym mappings unlikely to appear in real input text  aaafoo => aaabar bbbfoo => bbbfoo bbbbar cccfoo => cccbar cccbaz fooaaa,baraaa,bazaaa # Some synonym groups specific to this example  GB,gib,gigabyte,gigabytes MB,mib,megabyte,megabytes Television, Televisions, TV, TVs #notice we use "gib" instead of "GiB" so any WordDelimiterFilter coming  #after us won't split it into two words.  飛利浦刮胡刀,飛利浦剃須刀 # Synonym mappings can be used for spelling correction too  pixima => pixma a\,a => b\,b

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 solr6.6 導入文本（txt/json/xml/csv）文件導入txt和導出txt文件 solr6.6 導入 pdf/doc/txt/json/csv/xml文件搜索引擎Solr6.2.1 索引富文本(word/pdf/txt/html) JAVA讀取TXT文件、新建TXT文件、寫入TXT文件 Java讀取txt文件和寫入txt文件 php如何上傳txt文件，並且讀取txt文件 C++讀取txt和保存到txt Javascript寫入txt和讀取txt文件示例什么是TXT記錄？如何設置、檢測TXT記錄