This section presents a good
collection of open source dictionary
databases and software that you can
download and use in your computer,
mostly without Internet connection.
A
Dictionary Server Protocol RFC
2229 <http://info.internet.isi.edu/in-notes/rfc/files/rfc2229.txt>
The Dictionary Server
Protocol (DICT) is a TCP
transaction based query/response
protocol that allows a client to
access dictionary definitions
from a set of natural language
dictionary databases.
The
SPECIALIST lexicon is a large
syntactic lexicon of biomedical
and general English,
designed/developed to provide
the lexical information needed
for the SPECIALIST Natural
Language Processing System (NLP)
which includes SemRep, MetaMap,
the and the Lexical Tools. It is
intended to be a general English
lexicon that includes many
biomedical terms. Coverage
includes both commonly occurring
English words and biomedical
vocabulary. The lexicon entry
for each lexical item (word or
term) records the syntactic,
morphological, and orthographic
information needed by the
SPECIALIST NLP System. Click
here to download Lexicon
package, 2006 Release!
The
SPECIALIST spelling resources
include two programs GSpell a
spelling suggestion tool and
BagOwordsPlus a phrase retrieval
tool. GSpell uses several word
similarity algorithms to suggest
correct spellings for misspelled
words. Unlike other spelling
suggestion programs GSpell
treats space as it would any
other letter so that GSpell can
correct errors in word
compounding. GSpell also be used
in word similarity tasks that do
not involve misspelling.
BagOWordsPlus uses the word
similarity algorithms of GSpell
to perform word similarity based
phrase level information
retrieval.