• Using Solr and TikaOCR to search text inside an image.

    Vijay Mhaskar

      Tesseract is probably the most accurate open source OCR engine available and with Apache Tika 1.7 you can now use the awesome Tesseract OCR parser within Tika! Solr 5.x has support for Tika 1.7

  • Using Solr’s ComplexPhraseQueryParser

    Vijay Mhaskar

    Introduction : ComplexPhraseQuery allows complex phrase query syntax e.g “canc* treat* “. It Performs multiple passes over Query text to parse any nested logic in PhraseQueries. First pass takes any PhraseQuery content between quotes and