Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

lemur::parse::IndriTextHandler Class Reference

#include <IndriTextHandler.hpp>

Inheritance diagram for lemur::parse::IndriTextHandler:

lemur::api::TextHandler List of all members.

Public Member Functions

 IndriTextHandler (const string &name, int memory, const lemur::api::Parser *p)
 ~IndriTextHandler ()
char * handleDoc (char *docno)
 Signal the start of a document.

void handleEndDoc ()
 Handle end of document.

char * handleWord (char *word, const char *original, PropertyList *list)
 Handles a word to be indexed.

char * handleBeginTag (char *tag, const char *orig, PropertyList *props)
 Handle a begin tag.

char * handleEndTag (char *tag, const char *orig, PropertyList *props)
 Handle an end tag.


Protected Attributes

indri::api::IndexEnvironment env
 The indexer.

indri::api::ParsedDocument document
 Document properties object to be reused over repeated calls.

indri::parse::MetadataPair docid
 Handle the document id.

char * curdocno
 the current document id

int docbegin
 the beginning byte offset into file of this documents

const lemur::api::Parserparser
 We need a handle to the parser to get file byte locations.


Constructor & Destructor Documentation

lemur::parse::IndriTextHandler::IndriTextHandler const string &  name,
int  memory,
const lemur::api::Parser p
 

Create an IndriIndex (repository) with the given name Pass in the parser being used at the beginning of the chain

lemur::parse::IndriTextHandler::~IndriTextHandler  ) 
 


Member Function Documentation

char * lemur::parse::IndriTextHandler::handleBeginTag char *  tag,
const char *  orig,
PropertyList props
[virtual]
 

Handle a begin tag.

Reimplemented from lemur::api::TextHandler.

char * lemur::parse::IndriTextHandler::handleDoc char *  docno  )  [virtual]
 

Signal the start of a document.

Reimplemented from lemur::api::TextHandler.

void lemur::parse::IndriTextHandler::handleEndDoc  )  [virtual]
 

Handle end of document.

Reimplemented from lemur::api::TextHandler.

char * lemur::parse::IndriTextHandler::handleEndTag char *  tag,
const char *  orig,
PropertyList props
[virtual]
 

Handle an end tag.

Reimplemented from lemur::api::TextHandler.

char * lemur::parse::IndriTextHandler::handleWord char *  word,
const char *  original,
PropertyList list
[virtual]
 

Handles a word to be indexed.

Reimplemented from lemur::api::TextHandler.


Member Data Documentation

char* lemur::parse::IndriTextHandler::curdocno [protected]
 

the current document id

int lemur::parse::IndriTextHandler::docbegin [protected]
 

the beginning byte offset into file of this documents

indri::parse::MetadataPair lemur::parse::IndriTextHandler::docid [protected]
 

Handle the document id.

indri::api::ParsedDocument lemur::parse::IndriTextHandler::document [protected]
 

Document properties object to be reused over repeated calls.

indri::api::IndexEnvironment lemur::parse::IndriTextHandler::env [protected]
 

The indexer.

const lemur::api::Parser* lemur::parse::IndriTextHandler::parser [protected]
 

We need a handle to the parser to get file byte locations.


The documentation for this class was generated from the following files:
Generated on Tue Jun 15 11:03:06 2010 for Lemur by doxygen 1.3.4