« The Functions and Implementation Method of "Thematic Mapping Search" in "Net-de-hyakka" | Main | A Method of Geographical Name Extraction from Japanese Text for Thematic Geographical Search »

Methods of Extracting Year References for Chronological-table-generating Text Searching

Kanada, Y., International Symposium on Digital Library 1999, pp. 135-142, 1999

[ 日本語のページ ]
[ Paper PDF file ] [ Paper PostScript file ]

Abstract: A method of extracting year references for a textual information retrieval method called the thematic chronological-table search method is explained in this paper. This search method generates an index by extracting and collecting year references from a text collection. The resulting index and a full-text index are used for searching statements that contain year references and search words. The results are displayed in the form of a chronological table with hyperlinks to the original text. Seven forms of year or century references are extracted and normalized using string matching patterns. The extraction error rate is reduced by using both local and nonlocal contexts. If the lower two digits of a Gregorian year, which matches a form, occurs, it is normalized by supplementing the upper digits using the non-local context. This method has been applied to a Japanese encyclopedia. An evaluation shows the precision of extraction to be higher than 99% in most cases.

Introduction to this research theme: Axis-Specified Search (Thematic Search)

Keywords: Text search, Axis-specified search, Thematic chronological search, Year-axis search, Time-axis search, Encyclopedia search, Chronological information extraction, Information organization, Search result organization, Organizing search, Search result structurization, Structurizing search

Post a comment

About

This page contains a single entry from the blog posted on September 28, 1999 12:00 AM.

Many more can be found on the main index page or by looking through the archives.

(C) Copyright 2007 by Yasusi Kanada
Powered by
Movable Type 3.36