News
Features
The Lemur Toolkit
Indri Search Engine
Lemur Query Log Toolbar
Lemur Wiki
Download
People
Discussion
Archived Forums
Tutorials
Sign Up

 
CMU - Language Technologies Institute
Carnegie Mellon University
CIIR, University of Massachusetts Amherst
University of Massachusetts
 

The Lemur Project is sponsored by the Advanced Research and Development Activity in Information Technology (ARDA) under its Statistical Language Modeling for Information Retrieval Research Program and by the National Science Foundation.


Note: These tutorials are out of date, please see the Lemur Wiki instead.


Lemur Project Tutorials:
Starting Out

Retrieval: Batch Retrieval


Contents

  1. IndriRunQuery
  2. RetEval

IndriRunQuery

IndriRunQuery is a command-line interface for batch retrieval via the Indri query interface. To issue a query via the IndriRunQuery application, you need to create a parameter file, much like one that was created to build an index, and is run by executing "IndriRunQuery <parameter_file>"

At the most basic, an IndriRunQuery parameter file should consist of a memory parameter, an index path, and a query. As an example:

<parameters>
	<memory>256M</memory>
	<index>/path/to/the/index</index>
	<query>the query to issue</query>
</parameters>
	

If you do not want to create a parameter file, the various paramters can be issued from the command line. To issue the same parameters from the example above:
IndriRunIndex -memory=256M -index=/path/to/the/index -query="the query to issue".

For a full listing of available parameters to use with IndriRunQuery, see the API documentation.

RetEval

Much like IndriRunQuery above, the RetEval application is a command-line interface for batch retrieval, but permits querying via other methods such as TF/IDF, Okapi and KL-Divergence.

To run RetEval, issue the command "RetEval <parameter_file>" where the <parameter_file> is the path to the parameter file used for your queries. The parameter file's structure and options are below:

retModel

This is the retrieval model to use. It currently supports six different models:

Other common parameters (for all retrieval methods) are:

Model-specific parameters are: