Posts

  • Indexing Rich Documents in Solr

    Tags: | |
    This week I integrated Apache Tika into Moodle to support indexing of Rich Documents like .PDF, .DOC, .PPT etc. Solr’s ExtractingRequestHandler uses Tika allowing users to upload binary files to Solr and have Solr extract text from it and then index it, making them searchable. One has to send the...
    Read more...
  • Global Search API

    Tags: |
    This week I started off starting the Search API functions for Global Search. The idea is to code 3 functions for each module. These will be written in the module’s lib.php file. _get_iterator($from=0) _search_get_documents($id) _search_access($id) The former two functions are used while indexing records while the last one is used...
    Read more...
  • Deleting Solr Index in Global Search

    Tags: |
    I implemented the functionality of allowing the admin to delete solr index recently. The code can be seen here. Solr provides a simple way of deleting indexing using SolrClient::deleteByQuery. I have provided two types of deleting index: Deleting the complete index in one-go. Deleteing index modularily. (For example, deleting index...
    Read more...
  • Updating Solr Index in Global Search

    Tags: |
    Previous week, I started coding the admin page for Global Search. Here are the three indexing configurations that I’ve planned to implement: Adding new documents. (This will be written such that the indexing is resumed from a previous run). Deleting index. Updating index for the updated records For updating index...
    Read more...
  • Handling security in Global Search

    Tags: |
    Handling security issues will be an integral part of Global Search. Last thing we want is users getting access to prohibited records through search. It will be a huge blow to the project if users get access to documents that they are not premissible to view. For this, the solution...
    Read more...