David A. Hull

CLARIT Experiments in Batch Filtering: (2003)

David A. Evans, James Shanahan, Norbert Roma, Jeffrey Bennett, Victor Sheftel, Emilia Stoica, ...

Introduction The Clairvoyance team participated in the Filtering Track, submitting two runs in the Batch Filtering category. While we have been exploring the question of both topic modeling and...

The TREC--9 Filtering Track Final Report (2001)

Stephen Robertson, David A. Hull

The TREC--9 filtering track measures the ability of systems to build persistent user profiles which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...

Method Combination For Document Filtering (2001)

David A. Hull, Jan O. Pedersen, Hinrich Schutze

There is strong empirical and theoretic evidence that combination of retrieval methods can improve performance. In this paper, we systematically compare combination strategies in the context of...

Information Retrieval Using Statistical Classification (2000)

David A. Hull, Jerome Friedman

In the classical information retrieval (IR) problem, the system must find all documents in a collection that are related to a topic defined by a user's query. A common approach to the IR problem is...

The TREC-8 Filtering Track Final Report (2000)

David A. Hull, Stephen Robertson

The TREC-8 filtering track measures the ability of systems to build persistent user profiles which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...

The TREC-8 Filtering Track Final Report (2000)

David A. Hull, Stephen Robertson

The TREC-8 #ltering track measures the ability of systems to build persistent user pro#les which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...

Xerox TREC-8 Question Answering Track Report (2000)

David A. Hull

This report describes the Xerox work on the TREC-8 Question Answering Track. We linked together a few basic NLP components (a question parser, a sentence boundary identifier, and a proper noun...

Term Alignment in Use: Machine-Aided Human Translation (1999)

Eric Gaussier, David A. Hull, Salah Ait-mokhtar

Keywords: Machine-Aided Human Translation, Translation Memory, Word Alignment, Terminology Extraction 1 Introduction Parallel texts are a resource with many interesting applications. In this chapter,...

Term Alignment in Use: Machine-Aided Human Translation (1999)

Eric Gaussier, David A. Hull, Salah Ait-mokhtar

Keywords: Machine-Aided Human Translation, Translation Memory, Word Alignment, Terminology

The TREC-7 Filtering Track: Description and Analysis (1999)

David A. Hull

This article describes the experiments conducted in the TREC-7 filtering track, which consisted of three subtasks: adaptive filtering, batch filtering, and routing. The focus this year is on adaptive...

The TREC-6 Filtering Track: Description and Analysis (1998)

David A. Hull

This article details the experiments conducted in the TREC-6 filtering track. The filtering track is an extension of the routing track which adds time sequencing of the document stream and set-based...

Xerox TREC-6 Site Report: Cross Language Text Retrieval (1998)

Eric Gaussier, Gregory Grefenstette, David A. Hull, B. Maximilian Schulze

Xerox participated in the Cross Language Information Retrieval (CLIR) track of TREC-6. This track examines the problem of retrieving documents written in one language using queries written in another...

The TREC-6 Filtering Track: Description and Analysis (1998)

David A. Hull

This article details the experiments conducted in the TREC-6 filtering track. The filtering track is an extension of the routing track which adds time sequencing of the document stream and set-based...

Xerox TREC-6 Site Report: Cross Language Text Retrieval (1998)

Eric Gaussier, Gregory Grefenstette, David A. Hull, B. Maximilian Schulze

Xerox participated in the Cross Language Information Retrieval (CLIR) track of TREC-6. This track examines the problem of retrieving documents written in one language using queries written in another...

Xerox TREC-5 Site Report: Routing, Filtering, NLP, and Spanish Tracks (1997)

David A. Hull, Gregory Grefenstette, B. Maximilian Schulze, Eric Gaussier, Hinrich Schutze

this report is divided into three sections. The first section describes our work on routing and filtering (Hull, Schutze, and Pedersen), the second section covers the NLP track (Grefenstette,...

Using Structured Queries for Disambiguation in Cross-Language Information Retrieval (1997)

David A. Hull

Bilingual transfer dictionaries are an important resource for query translation in cross-language text retrieval. However, term translation is not an isomorphic process, so dictionary-based systems...

Using Structured Queries for Disambiguation in Cross-Language Information Retrieval (1997)

David A. Hull

Bilingual transfer dictionaries are an important resource for query translation in cross-language text retrieval. However, term translation is not an isomorphic process, so dictionary-based systems...

A Comparison of Classifiers and Document Representations for the Routing Problem (1996)

Hinrich Sch Utze, David A. Hull, Jan O. Pedersen

In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...

A Comparison of Classifiers and Document Representations for the Routing Problem (1996)

Hinrich Sch Utze, David A. Hull, Jan O. Pedersen

In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...

Experiments in Multilingual Information Retrieval (1996)

David A. Hull, Gregory Grefenstette

The multilingual information retrieval system of the future will need to be able to retrieve documents across language boundaries. This extension of the classical IR problem is particularly...

Stemming Algorithms - A Case Study for Detailed Evaluation (1996)

David A. Hull

The majority of information retrieval experiments are evaluated by measures such as average precision and average recall. Fundamental decisions about the superiority of one retrieval technique over...

A Detailed Analysis of English Stemming Algorithms (1996)

David A. Hull, Gregory Grefenstette

We present a study comparing the performance of traditional stemming algorithms based on suffix removal to linguistic methods performing morphological analysis. The results indicate that most...

A Comparison of Classifiers and Document Representations for the Routing Problem (1995)

Hinrich Sch Utze, David A. Hull, Jan O. Pedersen

In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...

Information retrieval using statistical classification /--by David A. Hull. (1994)

Hull, David A., Stanford University.--Dept. Of Statistics.

Submitted to the Department of Statistics.