Index
All Packages
All Categories
By Author

ap (3)
cp (3)
dp (3)
exe (3)
gui (0)
gui/gtk (0)
gui/tk (4)
io (1)
lib (11)
math (0)
net (9)
nlp (18)
op (4)
os (2)
program (3)
sp (2)
tool (9)
wp (2)
xml (2)

Porter Stemmer

type:package
id:mogul:/lager/porter-stemmer
section:mogul:/lager
blurb:A Porter stemmer written in C and linked into Oz
author:Torbjörn Lager
category:nlp
documentation:index.html
download:lager-porter-stemmer__1.2.5__source__0.pkg
lager-porter-stemmer__1.3.0__source__0.pkg
provides:[nlp] x-ozlib://lager/porter-stemmer/EnglishStemmer.so{native}
[nlp] x-ozlib://lager/porter-stemmer/stem.exe

This native functor creates a module that exports a function which performs stemming by means of the Porter stemming algorithm. Quoting Martin Porter himself:

The Porter stemming algorithm (or 'Porter stemmer') is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems.

The algorithm was originally described in Porter, M.F., 1980, An algorithm for suffix stripping, Program, 14(3) :130-137. It has since been reprinted in Sparck Jones, Karen, and Peter Willet, 1997, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, ISBN 1-55860-454-4.

This implementation simply links a C-implementation (written by Martin Porter) into Oz.