Index
All Packages
All Categories
By Author

ap (3)
cp (3)
dp (3)
exe (3)
gui (0)
gui/gtk (0)
gui/tk (4)
io (1)
lib (11)
math (0)
net (9)
nlp (18)
op (4)
os (2)
program (3)
sp (2)
tool (9)
wp (2)
xml (2)

Sentence Splitter

type:package
id:mogul:/lager/sentence-splitter
section:mogul:/lager
blurb:A simple sentence splitter
author:Torbjörn Lager
category:nlp
documentation:index.html
download:lager-sentence-splitter__1.2.5__source__0.pkg
lager-sentence-splitter__1.3.0__source__0.pkg
provides:[nlp] x-ozlib://lager/sentence-splitter/SentenceSplitter.ozf
[nlp] x-ozlib://lager/sentence-splitter/split.exe

This is a simple sentence splitter for English. Given a string, assumed to be English text, it returns a list of strings, where each element is an English sentence. By default, it treats occurrences of '.', '?' and '!' as sentence delimiters, but does its best to determine when an occurrence of '.' does not have this role (e.g. in abbreviations, URLs, numbers, etc.). Although the splitter is designed to work for English, it should be straightforward to adapt to other (similar) languages.