20080314

Corpora: Word sets for use in patents

Various sources can be used to obtain a "core" corpus containing all English words, including the unabridged Oxford Dictionary. However, a suitable source is Roget's International Thesaurus containing 256,000 words organized according to a systematic arrangement created by the meticulous Robert Roget, a surgeon.

No comments: