CLARIT Experiments in Batch Filtering: (2003)
David A. Evans, James Shanahan, Norbert Roma, Jeffrey Bennett, Victor Sheftel, Emilia Stoica, ...
Introduction The Clairvoyance team participated in the Filtering Track, submitting two runs in the Batch Filtering category. While we have been exploring the question of both topic modeling and...
The TREC--9 Filtering Track Final Report (2001)
Stephen Robertson, David A. Hull
The TREC--9 filtering track measures the ability of systems to build persistent user profiles which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...
Method Combination For Document Filtering (2001)
David A. Hull, Jan O. Pedersen, Hinrich Schutze
There is strong empirical and theoretic evidence that combination of retrieval methods can improve performance. In this paper, we systematically compare combination strategies in the context of...
Information Retrieval Using Statistical Classification (2000)
David A. Hull, Jerome Friedman
In the classical information retrieval (IR) problem, the system must find all documents in a collection that are related to a topic defined by a user's query. A common approach to the IR problem is...
The TREC-8 Filtering Track Final Report (2000)
David A. Hull, Stephen Robertson
The TREC-8 filtering track measures the ability of systems to build persistent user profiles which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...
The TREC-8 Filtering Track Final Report (2000)
David A. Hull, Stephen Robertson
The TREC-8 #ltering track measures the ability of systems to build persistent user pro#les which successfully separate relevant and non-relevant documents. It consists of three major subtasks:...
Xerox TREC-8 Question Answering Track Report (2000)
This report describes the Xerox work on the TREC-8 Question Answering Track. We linked together a few basic NLP components (a question parser, a sentence boundary identifier, and a proper noun...
Term Alignment in Use: Machine-Aided Human Translation (1999)
Eric Gaussier, David A. Hull, Salah Ait-mokhtar
Keywords: Machine-Aided Human Translation, Translation Memory, Word Alignment, Terminology Extraction 1 Introduction Parallel texts are a resource with many interesting applications. In this chapter,...
Term Alignment in Use: Machine-Aided Human Translation (1999)
Eric Gaussier, David A. Hull, Salah Ait-mokhtar
Keywords: Machine-Aided Human Translation, Translation Memory, Word Alignment, Terminology
The TREC-7 Filtering Track: Description and Analysis (1999)
This article describes the experiments conducted in the TREC-7 filtering track, which consisted of three subtasks: adaptive filtering, batch filtering, and routing. The focus this year is on adaptive...
The TREC-6 Filtering Track: Description and Analysis (1998)
This article details the experiments conducted in the TREC-6 filtering track. The filtering track is an extension of the routing track which adds time sequencing of the document stream and set-based...
Xerox TREC-6 Site Report: Cross Language Text Retrieval (1998)
Eric Gaussier, Gregory Grefenstette, David A. Hull, B. Maximilian Schulze
Xerox participated in the Cross Language Information Retrieval (CLIR) track of TREC-6. This track examines the problem of retrieving documents written in one language using queries written in another...
The TREC-6 Filtering Track: Description and Analysis (1998)
This article details the experiments conducted in the TREC-6 filtering track. The filtering track is an extension of the routing track which adds time sequencing of the document stream and set-based...
Xerox TREC-6 Site Report: Cross Language Text Retrieval (1998)
Eric Gaussier, Gregory Grefenstette, David A. Hull, B. Maximilian Schulze
Xerox participated in the Cross Language Information Retrieval (CLIR) track of TREC-6. This track examines the problem of retrieving documents written in one language using queries written in another...
Xerox TREC-5 Site Report: Routing, Filtering, NLP, and Spanish Tracks (1997)
David A. Hull, Gregory Grefenstette, B. Maximilian Schulze, Eric Gaussier, Hinrich Schutze
this report is divided into three sections. The first section describes our work on routing and filtering (Hull, Schutze, and Pedersen), the second section covers the NLP track (Grefenstette,...
Using Structured Queries for Disambiguation in Cross-Language Information Retrieval (1997)
Bilingual transfer dictionaries are an important resource for query translation in cross-language text retrieval. However, term translation is not an isomorphic process, so dictionary-based systems...
Using Structured Queries for Disambiguation in Cross-Language Information Retrieval (1997)
Bilingual transfer dictionaries are an important resource for query translation in cross-language text retrieval. However, term translation is not an isomorphic process, so dictionary-based systems...
A Comparison of Classifiers and Document Representations for the Routing Problem (1996)
Hinrich Sch Utze, David A. Hull, Jan O. Pedersen
In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...
A Comparison of Classifiers and Document Representations for the Routing Problem (1996)
Hinrich Sch Utze, David A. Hull, Jan O. Pedersen
In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...
Experiments in Multilingual Information Retrieval (1996)
David A. Hull, Gregory Grefenstette
The multilingual information retrieval system of the future will need to be able to retrieve documents across language boundaries. This extension of the classical IR problem is particularly...
Stemming Algorithms - A Case Study for Detailed Evaluation (1996)
The majority of information retrieval experiments are evaluated by measures such as average precision and average recall. Fundamental decisions about the superiority of one retrieval technique over...
A Detailed Analysis of English Stemming Algorithms (1996)
David A. Hull, Gregory Grefenstette
We present a study comparing the performance of traditional stemming algorithms based on suffix removal to linguistic methods performing morphological analysis. The results indicate that most...
A Comparison of Classifiers and Document Representations for the Routing Problem (1995)
Hinrich Sch Utze, David A. Hull, Jan O. Pedersen
In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification...
Information retrieval using statistical classification /--by David A. Hull. (1994)
Hull, David A., Stanford University.--Dept. Of Statistics.
Submitted to the Department of Statistics.
Typescript.