Practical Apache Lucene 8: Uncover the Search Capabilities of Your Application

Sharma, Atri

  • 出版商: Apress
  • 出版日期: 2020-11-01
  • 售價: $1,420
  • 貴賓價: 9.5$1,349
  • 語言: 英文
  • 頁數: 103
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1484263448
  • ISBN-13: 9781484263440
  • 相關分類: 全文搜尋引擎 Full-text-search
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Gain a thorough knowledge of Lucene's capabilities and use it to develop your own search applications. This book explores the Java-based, high-performance text search engine library used to build search capabilities in your applications.

Starting with the basics of Lucene and searching, you will learn about the types of queries used in it and also take a look at scoring models. Applying this basic knowledge, you will develop a hello world app using basic Lucene queries and explore functions like scoring and document level boosting.

 

Along the way you will also uncover the concepts of partial searching and matching in Lucene and then learn how to integrate geographical information (geospatial data) in Lucene using spatial queries and n-dimensional indexing. This will prepare you to build a location-aware search engine with a representative data set that allows location constraints to be specified during a search. You'll also develop a text classifier using Lucene and Apache Mahout, a popular machine learning framework.

 

After a detailed review of performance bench-marking and common issues associated with it, you'll learn some of the best practices of tuning the performance of your application. By the end of the book you'll be able to build your first Lucene patch, where you will not only write your patch, but also test it and ensure it adheres to community coding standards.

What You'll Learn

 

 

  • Master the basics of Apache Lucene
  • Utilize different query types in Apache Lucene
  • Explore scoring and document level boosting
  • Integrate geospatial data into your application

 

Who This Book Is For

 

Developers wanting to learn the finer details of Apache Lucene by developing a series of projects with it.

 

商品描述(中文翻譯)

深入了解Lucene的功能,並使用它來開發自己的搜尋應用程式。本書探討了這個基於Java的高效能文字搜尋引擎程式庫,用於在應用程式中建立搜尋功能。

從Lucene和搜尋的基礎開始,您將學習其中使用的查詢類型,並瞭解評分模型。應用這些基本知識,您將使用基本的Lucene查詢開發一個hello world應用程式,並探索評分和文件級別增強等功能。

在此過程中,您還將揭示Lucene中的部分搜尋和匹配概念,然後學習如何使用空間查詢和n維索引將地理資訊(地理空間資料)整合到Lucene中。這將使您能夠建立一個具有代表性資料集的位置感知搜尋引擎,該引擎允許在搜尋期間指定位置限制。您還將使用Lucene和流行的機器學習框架Apache Mahout開發文本分類器。

在詳細檢查性能基準和相關常見問題之後,您將學習調整應用程式性能的一些最佳實踐。通過本書的結尾,您將能夠建立您的第一個Lucene補丁,其中您不僅會編寫補丁,還會測試它並確保其符合社區的編碼標準。

您將學到什麼:

- 掌握Apache Lucene的基礎知識
- 利用Apache Lucene中的不同查詢類型
- 探索評分和文件級別增強
- 將地理空間資料整合到您的應用程式中

本書適合對Apache Lucene的細節有興趣的開發人員,通過開發一系列專案來學習。

作者簡介

Atri is a distributed systems engineer with expertise in building and scaling large data oriented systems, and an Apache Lucene/Solr committer. He has worked for Microsoft, where he was responsible for scaling the storage and query engines for Azure CosmosDB. He is also a long time PostgreSQL contributor and an Apache committer and PMC member for HAWQ, MADLib, and Apex.

 

作者簡介(中文翻譯)

Atri 是一位分散式系統工程師,擅長建立和擴展大型數據導向系統,並且是 Apache Lucene/Solr 的貢獻者。他曾在 Microsoft 工作,負責擴展 Azure CosmosDB 的存儲和查詢引擎。他也是長期的 PostgreSQL 貢獻者,並且是 HAWQ、MADLib 和 Apex 的 Apache 貢獻者和 PMC 成員。