Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

indri::parse Namespace Reference

File input, parsing, stemming, and stopping classes. More...


Classes

class  AnchorTextAnnotator
class  AnchorTextHarvester
class  AnchorTextWriter
struct  AttributeValuePair
class  Combiner
struct  Combiner::strcompst
struct  Combiner::strhash
struct  Combiner::url_entry
class  Conflater
struct  Conflater::attribute_pattern
struct  Conflater::tag_pattern
struct  conflation_pair
struct  ConflationPattern
class  DateParse
class  DocumentIterator
class  DocumentIteratorFactory
struct  FileClassEnvironment
class  FileClassEnvironmentFactory
struct  FileClassEnvironmentFactory::Specification
 Parsing information for a file class. Used to create a FileClassEnvironment. More...

class  HTMLParser
class  KrovetzStemmer
struct  KrovetzStemmer::cacheEntry
 Two term hashtable entry for caching across calls. More...

struct  KrovetzStemmer::dictEntry
 Dictionary table entry. More...

struct  KrovetzStemmer::eqstr
class  KrovetzStemmerTransformation
class  LessTagExtent
class  MboxDocumentIterator
struct  MetadataPair
class  MetadataPair::key_equal
class  NormalizationTransformation
class  NumericFieldAnnotator
class  ObjectHandler
class  OffsetAnnotationAnnotator
struct  OffsetAnnotationAnnotator::ReadAnnotationTag
class  OffsetMetadataAnnotator
class  PageRank
class  pagerank
struct  pagerank::pagerank_greater
class  Parser
class  ParserFactory
class  PDFDocumentExtractor
class  Porter_Stemmer
class  PorterStemmerTransformation
class  prEntry
struct  prEntry::prEntry_greater
class  RawTextParser
class  StemmerFactory
class  StopperTransformation
struct  StopperTransformation::eqstr
class  Tag
struct  TagEvent
struct  TagExtent
struct  TagExtent::lowest_end_first
class  TaggedDocumentIterator
class  TaggedTextParser
struct  TaggedTextParser::tag_properties
class  TagList
struct  TagList::tag_entry
struct  TermExtent
class  TextDocumentExtractor
class  TextParser
class  TextTokenizer
struct  TokenizedDocument
class  Tokenizer
class  TokenizerFactory
class  Transformation
struct  UnparsedDocument
class  URLTextAnnotator
class  UTF8CaseNormalizationTransformation
class  UTF8Transcoder
class  WARCDocumentIterator
class  WARCRecord

Typedefs

typedef indri::parse::FileClassEnvironmentFactory::Specification Specification

Enumerations

enum  OffsetAnnotationIndexHint { OAHintDefault, OAHintOrderedAnnotations, OAHintSizeBuffers, OAHintNone }

Variables

const char * exceptions []
const struct conflation_pair conflations []
const char *const  headwords []


Detailed Description

File input, parsing, stemming, and stopping classes.

Typedef Documentation

typedef indri::parse::FileClassEnvironmentFactory::Specification indri::parse::Specification
 


Enumeration Type Documentation

enum indri::parse::OffsetAnnotationIndexHint
 

Enumeration values:
OAHintDefault 
OAHintOrderedAnnotations 
OAHintSizeBuffers 
OAHintNone 


Variable Documentation

const struct conflation_pair indri::parse::conflations[] [static]
 

const char* indri::parse::exceptions[] [static]
 

const char* const indri::parse::headwords[] [static]
 


Generated on Tue Jun 15 11:03:03 2010 for Lemur by doxygen 1.3.4