• High Speed Ingestion into Solr with Custom Talend Component Developed by T/DG

    By: Dattatraya Patil | March 11, 2016

    In this blog I will explain how to use High Speed Talend-Solr Ingestion components, released by T/DG as open source, for ingesting documents into Solr and its benefit. T/DG released 3 custom Talend components, which

  • SOLR Security with ManifoldCF

    By: Srinivasa Sarma | February 9, 2016

    Introduction: – This article explains how to implement SOLR “document level security” using Manifold Connector Framework. ManifoldCF is an open source framework for pulling content out of a repository and sending it on to targets

  • Building Docker image with Solr

    By: Omi Tewary | February 8, 2016

    There are two ways to build docker image: Running an image, modifying and committing it. This requires to access live container. Using Dockerfile and build it. Let’s take an example of creating a docker image

  • Understanding and Configuring Solr’s PingRequestHandler

    By: Vijay Mhaskar | October 30, 2015

    In this blog I am talking about simple yet very useful Solr handler which is PingRequestHandler. We can use Ping Request Handler as a endpoint for an HTTP Load-Balancer(Like HAProxy) to use when checking the

  • Solr LocalParams and Security

    By: Vijay Mhaskar | August 7, 2015

    Introduction :  Local Parameters are often called as LocalParams. Using this we can “localize” information about an argument that is being sent to Solr through Solr query. its another way of adding extra information about

  • SolrCloud – 2 Nodes Solr, 1 Node ZK Setup

    By: Susheel Kumar | August 3, 2015

      Here I am going to talk about a basic SolrCloud setup on 2 separate machines (or 2 Solr nodes sitting on different machines) with 1 Zookeeper instance for development purpose. I hardly found any article

  • Using Solr and TikaOCR to search text inside an image.

    By: Vijay Mhaskar | July 17, 2015

      Tesseract is probably the most accurate open source OCR engine available and with Apache Tika 1.7 you can now use the awesome Tesseract OCR parser within Tika! Solr 5.x has support for Tika 1.7

  • Solr vs ElasticSearch

    By: Dikshant Shahi | July 17, 2015

    “Which one should I choose, Solr or ElasticSearch?” The question is quite frequently asked by anyone which is building a search engine or learning one of the two. And why shouldn’t they ask, after all both Solr

  • Grouping Results with Solr

    By: Dattatraya Patil | July 10, 2015

        Grouping Results: Imagine a situation where your data set is divided into different categories, subcategories,price ranges, and things like that. What if you would like to n ot only get information about counts

  • Using Solr’s ComplexPhraseQueryParser

    By: Vijay Mhaskar | July 10, 2015

    Introduction : ComplexPhraseQuery allows complex phrase query syntax e.g “canc* treat* “. It Performs multiple passes over Query text to parse any nested logic in PhraseQueries. First pass takes any PhraseQuery content between quotes and

  • Faceted Search using Solr

    By: Dattatraya Patil | July 3, 2015

      Faceting: Faceted search (also called faceted navigation, guided navigation, or parametric search) breaks up search results into multiple categories, typically showing counts for each category, and allows the user to “drill down” or further

  • Measuring Search Relevance using NDCG

    By: Vijay Mhaskar | June 28, 2015

    Normalized Discounted Cumulative Gain (NDCG) is popular method for measuring the quality of a set of search results. It asserts the following: Very relevant results are more useful than somewhat relevant results which are more

  • Spatial Search with Solr

    By: Dattatraya Patil | June 26, 2015

        In this article we will see how solr supports spatial search. Spatial Search Solr supports location data for use in spatial/geospatial searches. Using spatial search, you can: Index points or other shapes Filter

  • Solr Terms Component usage

    By: Dattatraya Patil | June 19, 2015

      In this article we will see how  solr Terms Component can be used for building Auto-suggest feature  and  Browse index feature. Terms Component: The Terms Component returns information about  indexed terms in a field

  • Solr 5.3: Execute SQL queries

    By: Dikshant Shahi | June 19, 2015

    SQL statement is the most widely used language for querying data and is the natural choice of data analyst.  It’s acceptance is so wide that projects like Apache Hive got birth for the purpose, which

  • Query Rescoring in Solr

    By: Vijay Mhaskar | June 19, 2015

    Introduction Sometimes relevance requirements are very complex and creates performance issues during execution. There is a very nice feature Introduced in Solr 4.9 called “Query Reranking/Rescoring” (SOLR-6088) which allows us to run our query with a less

  • Solr HyperLogLog

    By: Dikshant Shahi | June 12, 2015

    Solr 5.2 introduces HyperLogLog, the probabilistic approach for counting distinct values. Solr already had provision to count distinct values using unique facet function or countDistinct LocalParam in stats component. But this approach doesn’t scale well, as

  • Understanding Solr Explain

    By: Dattatraya Patil | June 12, 2015

      In this article, I will explain how to read information in solr explain. When we search documents on solr, the documents in the result are in descending order of their scores. If we want