Lemur Language Modeling Toolkit (4.12)

Older versions:
1.0, 1.1, 1.9, 2.0, 2.1, 2.2, 3.0, 3.1,
4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 4.10 4.11

Introduction to the Lemur Toolkit and Language Modeling

Overview An overview of what Lemur is, what it has, and how people use it.
Beginner's Guide to Indexing A Q/A beginner's guide to indexing, what Lemur can index, and what kind of indexes Lemur has.
Language Modeling Background A summary on recent research on language modeling and information retrieval.

Getting Started with the Lemur Toolkit and Indri Search Engine

Installing and Running Lemur Step-by-step instructions on how to compile, configure and test the toolkit.
Basic Components A description of the organization and design of the sytem, including major classes and how they interact within the indexing, retrieval, and language modeling framework.
Indri More about Indri, Lemur's latest search engine that is also available on its own when all you need is a search engine. Indri has an index capable of indexing very large collections and a structured query language that supports fields and passages.
Working with offset annotations A guide to working with offset annotations in the Lemur Toolkit
Installing and Running Lemur CGI and GUI Instructions on how to compile and use the CGI web interface and stand-alone retrieval GUI. Also some tips about which indexes to use and how to deal with query processing.

Lemur Reference Material

Lemur Modules and Applications General descriptions for the major modules in Lemur plus documentation for how to use all the Lemur applications. (Parsing and Preprocessing, Building/Adding to an index, General Retrieval and Evaluation, User Interfaces, Distributed IR and Query-based Sampling, Structured Query Language, Summarization, Clustering)
Basic Lemur Indexing and Retrieval API Documentation for the Application Program Interface, which allows users to build customized retrieval applications using language modeling or other methods.
API Programming Examples A number of examples to illustrate how to add new methods to the Lemur toolkit, adhering to the API, and how to write programs that use the Lemur API.
Source Code/API Documentation Automatically generated documentation for all of the source code.

Lemur Support

Download Archive Download past versions of the Lemur Toolkit. The most recent version of the toolkit can always be found here.
Public Forum This forum is for the users and developers of the Lemur toolkit to discuss the software. Feel free to share tips on using Lemur as well as to ask questions. The developers of the toolkit monitor this forum on a regular basis.
Licensing Terms for using the Lemur Toolkit.