News
Features
The Lemur Toolkit
Indri Search Engine
Lemur Query Log Toolbar
Lemur Wiki
Download
People
Publications
Discussion
Archived Forums
Sign Up

 
CMU - Language Technologies Institute
Carnegie Mellon University
CIIR, University of Massachusetts Amherst
University of Massachusetts
 

The Lemur Project is sponsored by the Advanced Research and Development Activity in Information Technology (ARDA) under its Statistical Language Modeling for Information Retrieval Research Program and by the National Science Foundation.

Lemur Language Modeling Toolkit (4.8)

Older versions:
1.0, 1.1, 1.9, 2.0, 2.1, 2.2, 3.0, 3.1,
4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7

Below are links to various pieces of documentation and examples of using the Lemur Toolkit. For further discussion and more technical details, please see the Lemur Wiki Pages.

Introduction to the Lemur Toolkit and Language Modeling

Overview An overview of what Lemur is, what it has, and how people use it.
Beginner's Guide to Indexing A Q/A beginner's guide to indexing, what Lemur can index, and what kind of indexes Lemur has.
Language Modeling Background A summary on recent research on language modeling and information retrieval.

Getting Started with the Lemur Toolkit and Indri Search Engine

Installing and Running Lemur Step-by-step instructions on how to compile, configure and test the toolkit.
Basic Components A description of the organization and design of the sytem, including major classes and how they interact within the indexing, retrieval, and language modeling framework.
Working with offset annotations A guide to working with offset annotations in the Lemur Toolkit
Installing and Running Lemur CGI and GUI Instructions on how to compile and use the CGI web interface and stand-alone retrieval GUI. Also some tips about which indexes to use and how to deal with query processing.
Indri More about Indri, Lemur's latest search engine that is also available on its own when all you need is a search engine. Indri has an index capable of indexing very large collections and a structured query language that supports fields and passages.
Lemur Tutorials A set of tutorials and trails to help you get started working with the Lemur Toolkit.

Lemur Reference Material

Lemur Modules and Applications General descriptions for the major modules in Lemur plus documentation for how to use all the Lemur applications. (Parsing and Preprocessing, Building/Adding to an index, General Retrieval and Evaluation, User Interfaces, Distributed IR and Query-based Sampling, Structured Query Language, Summarization, Clustering)
Basic Lemur Indexing and Retrieval API Documentation for the Application Program Interface, which allows users to build customized retrieval applications using language modeling or other methods.
API Programming Examples A number of examples to illustrate how to add new methods to the Lemur toolkit, adhering to the API, and how to write programs that use the Lemur API.
Source Code/API Documentation Automatically generated documentation for all of the source code.

Lemur Support

Download Archive Download past versions of the Lemur Toolkit. The most recent version of the toolkit can always be found here.
Public Forum This forum is for the users and developers of the Lemur toolkit to discuss the software. Feel free to share tips on using Lemur as well as to ask questions. The developers of the toolkit monitor this forum on a regular basis.
  Feel free to email us with questions, comments and feedback. For general questions, please use the forum so that others may also benefit from the answers.
Licensing Terms for using the Lemur Toolkit.

 


The Lemur Project The Lemur Project
Last modified:December 22, 2008. 04:20:29 am