Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

lemur::parse::PropIndexTH Class Reference

#include <PropIndexTH.hpp>

Inheritance diagram for lemur::parse::PropIndexTH:

lemur::api::TextHandler List of all members.

Public Member Functions

 PropIndexTH ()
 PropIndexTH (const string &filename, int bufferSize, bool countStopWds)
 ~PropIndexTH ()
char * handleDoc (char *docno)
 Pushes the doc into the InvFPPushIndex.

char * handleWord (char *word, const char *original, PropertyList *list)
 Pushes the word into the InvFPPushIndex.

void setDocManager (const string &mgrID)
 passes mgrID onto index


Protected Member Functions

void endDoc ()
 Ends a document in the collection.

void endCollection ()

Protected Attributes

lemur::index::PushIndexindex
 The indexer.

DocumentPropsdp
 Document properties object to be reused over repeated calls.

lemur::index::InvFPTermterm
 Term object to be reused over repeated calls.

int docLength
 Keeps track of document length.

bool countStopWds
 Indicates whether or not to count stop words in doc length.

bool first

Detailed Description

PropIndexTH builds a positional indexer. This class is a destination TextHandler. It expects the position of the token to be given (not counted) and will index certain properties if passed in the PropertyList. PropertyList *must* have one property called "position" with the token position. Any and ALL other properties of type STRING will be indexed at the same position as the token, but added after the token. This is usually the last object in TextHandler chain. If a property should NOT be indexed, remove it before it gets here.


Constructor & Destructor Documentation

lemur::parse::PropIndexTH::PropIndexTH  )  [inline]
 

lemur::parse::PropIndexTH::PropIndexTH const string &  filename,
int  bufferSize,
bool  countStopWds
 

Create a PropIndexTH with index name filename (minus the .ifp extension) and specified buffer size.

lemur::parse::PropIndexTH::~PropIndexTH  ) 
 


Member Function Documentation

void lemur::parse::PropIndexTH::endCollection  )  [protected]
 

void lemur::parse::PropIndexTH::endDoc  )  [protected]
 

Ends a document in the collection.

char * lemur::parse::PropIndexTH::handleDoc char *  docno  )  [virtual]
 

Pushes the doc into the InvFPPushIndex.

Reimplemented from lemur::api::TextHandler.

char * lemur::parse::PropIndexTH::handleWord char *  word,
const char *  original,
PropertyList list
[virtual]
 

Pushes the word into the InvFPPushIndex.

Reimplemented from lemur::api::TextHandler.

void lemur::parse::PropIndexTH::setDocManager const string &  mgrID  ) 
 

passes mgrID onto index


Member Data Documentation

bool lemur::parse::PropIndexTH::countStopWds [protected]
 

Indicates whether or not to count stop words in doc length.

int lemur::parse::PropIndexTH::docLength [protected]
 

Keeps track of document length.

DocumentProps* lemur::parse::PropIndexTH::dp [protected]
 

Document properties object to be reused over repeated calls.

bool lemur::parse::PropIndexTH::first [protected]
 

Indicates whether object is before first doc true = yes, false = foundDoc has been called once.

lemur::index::PushIndex* lemur::parse::PropIndexTH::index [protected]
 

The indexer.

lemur::index::InvFPTerm* lemur::parse::PropIndexTH::term [protected]
 

Term object to be reused over repeated calls.


The documentation for this class was generated from the following files:
Generated on Tue Jun 15 11:03:06 2010 for Lemur by doxygen 1.3.4