User Tools

Site Tools


components:crawler_plugins
Translations of this page:

**This is an old revision of the document!** ----

A PCRE internal error occured. This might be caused by a faulty plugin

====== Crawler Plugins ====== Crawler Plugins hook into the [[project_info:crawling_process|crawling process]] in order to add advanced functionality. ==== What can crawler plugins do? ==== Some examples: * Modify the result of preparators * by specifying default-values if the chosen preparator does not fill in a certain field (''onBeforePrepare'') * by overriding or modyfing the results of whatever preparator was chosen (''onAfterPrepare'') * Modify their storage in the lucene index * Do sth at every start or end of the crawling process (e.g. inform the administrator via email) ==== How to create a crawler plugin ==== In order to create a crawling Plugin, it needs to implement the interface ''CrawlerPlugin'', be packaged as a .jar, and dropped into the ''plugins''-Directory. ==== Existing Plugins ==== * Create Thumbnails of indexed documents

components/crawler_plugins.1311851227.txt.gz · Last modified: 2024/09/18 08:31 (external edit)