设为首页 收藏本站
查看: 474|回复: 0

[经验分享] Faceted Search with Solr

[复制链接]

尚未签到

发表于 2015-7-17 08:14:11 | 显示全部楼层 |阅读模式
  Faceted search has become a critical feature for enhancing findability and the user search experience for all types of search applications. In this article, Solr creator Yonik Seeley gives an introduction to faceted search with Solr.
  


By Yonik Seeley  




What is Faceted Search?
  Faceted search is the dynamic clustering of items or search results into categories that let users drill into search results (or even skip searching entirely) by any value in any field. Each facet displayed also shows the number of hits within the search that match that category. Users can then “drill down” by applying specific contstraints to the search results. Faceted search is also called faceted browsing, faceted navigation, guided navigation and sometimes parametric search.
  It’s easiest to understand what faceted search is through an example, appropriately from CNET Reviews, the first website to use Solr even before it had been contributed to Apache by CNET.


  This example is actually faceted browsing because it started with all digital cameras and not a user search.Manufacturer is a facet of these search results, and the facet constraints or facet values for this facet includeCanon USA, Sony, and Nikon. A previous screen also included Price and Digital camera type facets, but they are no longer displayed because we already selected the $400-$500 and SLR constraints from those facets. Due to those constraints, the resulting facet counts and camera listings reflect only digital SLR cameras between $400 and $500 in price. Any of the displayed facet constraints can be clicked to drill down and further narrow the search results by that constraint. Applied constraints can be removed by clicking on them within the breadcrumb trail.
  Faceted search provides an effective way to allow users to refine search results, continually drilling down until the desired items are found. The benefits include


  • Superior feedback – users can see at a glance a summary of the search results and how those results break down by different criteria.
  • No surprises or dead ends – users know how many results match before they click. Values with zero counts are normally removed to reduce visual noise and eliminate the possibility of a user accidentally selecting a constraint that would lead to no results.
  • No selection hierarchy is imposed – users are generally free to add or remove constraints in any order.
  




Implementing Faceting with Solr
  It’s relatively simple to get faceting information from Solr, as there are few prerequisites. Solr offers the following types of faceting, all of which can be requested with no prior configuration:


  • Field faceting – retrieve the counts for all terms, or just the top terms in any given field. The field must be indexed.
  • Query faceting – return the number of documents in the current search results that also match the given query.
  • Date faceting – return the number of documents that fall within certain date ranges.
  Faceting commands are added to any normal Solr query request, and the faceting counts come back in the same query response. If you are not familiar with the details of making requests in Solr, see the tutorial.




Implementing the Manufacturer facet
  To implement the Manufacturer facet, I send a field faceting command to Solr. This example assumes a field named “manu” exists in the schema and that it has the manufacturer indexed as a single token. The “string” type in the Solr schema is an appropriate field type to meet these needs.
  Let’s assume for a moment that the user typed “camera” into the search box. The Solr query to retrieve the top “camera” matches would be:

http://localhost:8983/solr/query?q=camera  
  To retrieve facet counts for the “manu” field, we would simply add the following parameters to that query request:

&facet=true &facet.field=manu  Any number of facet commands can be added to a single query request. To facet on both the “manu” field and the “camera_type” field, we would add the following parameters:

&facet=true &facet.field=manu &facet.field=camera_type  
  The query response will now contain facet count information for the given fields in addition to the top matches for the query.

"facet_fields" : {   "manu" : [     "Canon USA" , 25,     "Olympus" , 21,     "Sony" , 12,     "Panasonic" , 9,     "Nikon" , 4 ],   "camera_type" : [     "Compact" , 17,     "Ultracompact" , 11,     "SLR" , 9,     "Full body" , 8 ] }   Facet counts returned are always in the context of the current query. For example, there may be 100 cameras manufacturerd by Canon in the search index, but only 17 that match the current search results. Now it’s up to the presentation layer to display this information to the user in the form of clickable links with counts displayed next to them.




Implementing the Price facet
  If we request field faceting on the “price” field, we get back counts for individual prices. However, we want price ranges, not individual prices. One workaround is to index another field that contains the ranges that the prices fall into (for example 100_200, 200_300, 300_400) and use field faceting on that field. A more flexible solution is to utilize query facets that provide the ability to retrieve facet counts for arbitrary queries.
  Let’s assume that we have an indexed “price” field and we want to get the facet counts for the following ranges of prices: $100 or less, $100-$200, $200-$300, $300-$400, $400-$500, and $500 or over. We simply add a facet.query command to our query request for each desired range:

&facet=true     &facet.query=price:[* TO 100]     &facet.query=price:[100 TO 200]     &facet.query=[price:200 TO 300]     &facet.query=price:[300 TO 400]     &facet.query=[price:400 TO 500]     &facet.query=price:[500 TO *]  In addition to the standard query results and any field faceting counts requested, the query response will also contain a facet count for each facet query that was specified.

"facet_queries" : {   "price:[* TO 100]" : 28,   "price:[100 TO 200]" : 54,   "price:[200 TO 300]" : 98,   "price:[300 TO 400]" : 84,   "price:[400 TO 500]": 73,   "price:[500 TO *]" : 56 }   




Applying Constraints
  Now that we’ve learned how to retrieve facet counts, how do we allow the user to drill down and narrow the search results with any of those constraints? The answer is standard Solr filter queries, where search results are filtered by any number of arbitrary queries.
  Let’s assume again that the user typed “camera” into the search box and we have queried solr for the top matching documents, and also requested some facet counts to be displayed to the user.

http://localhost:8983/solr/query?q=camera   &facet=on   &facet.field=manu   &facet.field=camera_type   &facet.query=price:[* TO 100]   &facet.query=price:[100 TO 200]   &facet.query=[price:200 TO 300]   &facet.query=price:[300 TO 400]   &facet.query=[price:400 TO 500]   &facet.query=price:[500 TO *]  Now let’s assume that the user wants to drill down on the constraint $400-$500 from the Price facet to get a new set of results that include only cameras in that price range. For this we use the fq (filter query) parameter, which allows one to filter by a query. We’ll also send the relevant faceting commands again since we also want to update the facet counts.

http://localhost:8983/solr/query?q=camera   &facet=on&facet.field=manu&facet.field=camera_type   &fq=price:[400 to 500]  The fq command can appear anywhere in the query request – parameter order does not matter. Notice that we no longer request the facet counts for the prices since we are constraining results to one price range and thus already know that the counts for other ranges would be zero.
  Now that we’ve received the Solr response and updated the facet display and results listing in our web frontend (including adding “$400-$500″ breadcrumb to allow the user to remove it if desired), let’s assume that the user clicks on “SLR” in the camera_type facet. To filter our search results further, we’ll simply add an additional fqparameter.

http://localhost:8983/solr/query?q=camera   &facet=on&facet.field=manu   &fq=price:[400 to 500]   &fq=camera_type:SLR  When the Solr response to this request is received by the web front-end, we can add an “SLR” breadcrumb, and once again update our facet counts and top search results.

运维网声明 1、欢迎大家加入本站运维交流群:群②:261659950 群⑤:202807635 群⑦870801961 群⑧679858003
2、本站所有主题由该帖子作者发表,该帖子作者与运维网享有帖子相关版权
3、所有作品的著作权均归原作者享有,请您和我们一样尊重他人的著作权等合法权益。如果您对作品感到满意,请购买正版
4、禁止制作、复制、发布和传播具有反动、淫秽、色情、暴力、凶杀等内容的信息,一经发现立即删除。若您因此触犯法律,一切后果自负,我们对此不承担任何责任
5、所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其内容的准确性、可靠性、正当性、安全性、合法性等负责,亦不承担任何法律责任
6、所有作品仅供您个人学习、研究或欣赏,不得用于商业或者其他用途,否则,一切后果均由您自己承担,我们对此不承担任何法律责任
7、如涉及侵犯版权等问题,请您及时通知我们,我们将立即采取措施予以解决
8、联系人Email:admin@iyunv.com 网址:www.yunweiku.com

所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其承担任何法律责任,如涉及侵犯版权等问题,请您及时通知我们,我们将立即处理,联系人Email:kefu@iyunv.com,QQ:1061981298 本贴地址:https://www.yunweiku.com/thread-87434-1-1.html 上篇帖子: Apache Solr 介绍 下篇帖子: [ solr扩展 ] Solr Spellchecker internals (now with tests!)
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫码加入运维网微信交流群X

扫码加入运维网微信交流群

扫描二维码加入运维网微信交流群,最新一手资源尽在官方微信交流群!快快加入我们吧...

扫描微信二维码查看详情

客服E-mail:kefu@iyunv.com 客服QQ:1061981298


QQ群⑦:运维网交流群⑦ QQ群⑧:运维网交流群⑧ k8s群:运维网kubernetes交流群


提醒:禁止发布任何违反国家法律、法规的言论与图片等内容;本站内容均来自个人观点与网络等信息,非本站认同之观点.


本站大部分资源是网友从网上搜集分享而来,其版权均归原作者及其网站所有,我们尊重他人的合法权益,如有内容侵犯您的合法权益,请及时与我们联系进行核实删除!



合作伙伴: 青云cloud

快速回复 返回顶部 返回列表