Search
  Advanced Search
 
Journal search
Journal cover: Journal of Documentation

Journal of Documentation

ISSN: 0022-0418

Online from: 1945

Subject Area: Library and Information Studies

Content: Latest Issue | icon: RSS Latest Issue RSS | Previous Issues

 

Previous article.Icon: Print.Table of Contents.Next article.Icon: .

A STEMMING ALGORITHM FOR LATIN TEXT DATABASES


Document Information:
Title:A STEMMING ALGORITHM FOR LATIN TEXT DATABASES
Author(s):ROBYN SCHINKE, (Humanities Research Institute and Departments of History University of Sheffield, Western Bank, Sheffield S10 2TN), MARK GREENGRASS, (Humanities Research Institute and Departments of History University of Sheffield, Western Bank, Sheffield S10 2TN), ALEXANDER M. ROBERTSON, (Information Studies University of Sheffield, Western Bank, Sheffield S10 2TN), PETER WILLETT, (Information Studies University of Sheffield, Western Bank, Sheffield S10 2TN)
Citation:ROBYN SCHINKE, MARK GREENGRASS, ALEXANDER M. ROBERTSON, PETER WILLETT, (1996) "A STEMMING ALGORITHM FOR LATIN TEXT DATABASES", Journal of Documentation, Vol. 52 Iss: 2, pp.172 - 187
Article type:General review
DOI:10.1108/eb026966 (Permanent URL)
Publisher:MCB UP Ltd
Abstract:This paper describes the design of a stemming algorithm for searching databases of Latin text. The algorithm uses a simple longest-match approach with some recoding but differs from most stemmers in its use of two separate suffix dictionaries (one for nouns and adjectives and one for verbs) for processing query and database words. These dictionaries and the associated stemming rules are arranged in such a way that the stemmer does not need to know the grammatical category of the word that is being stemmed. It is very easy to overstem in Latin: the stemmer developed here tends, rather, towards understemming, leaving sufficient grammatical information attached to the stems resulting from its use to enable users to pursue very specific searches for single grammatical forms of individual words.


Backfiles Disclaimer

Articles that form part of the Emerald Backfiles have been created through digital scanning. Whilst all efforts have been made to ensure accuracy, Emerald will not be held responsible for any inaccuracies. If you require further clarification please contact backfiles@emeraldinsight.com.


Fulltext Options:

Login

Login

Existing customers: login
to access this document

Login


- Forgot password?
- Athens/Institutional login

Purchase

Purchase

Downloadable; Printable; Owned
HTML, PDF (902kb)

Due to our platform migration, pay-per-view is temporarily unavailable.

To purchase this item please login or register.

Login


- Forgot password?

Recommend to your librarian

Complete and print this form to request this document from your librarian


Marked list


Bookmark & share

Reprints & permissions