s | Noun |
---|---|
a | Adjective |
v | Verb |
e | Proper name |
w | Word class without suffixes |
t | Word class without suffixes (e.g. high frequency terms) |
y | Synonym |
q (=SEQ) | Sequence (algorithmically identified phrase) |
m (=MUL) | Phrase |
k (=COM) | Compound |
+ | Part of a compound |
x+ | Unknown part of a hyphenated compound |
? | Unknown word |
MU? | Part of a phrase (unknown word) |
HELP | e.g. Special characters |
ABRV | Possible abbreviation with a full stop in the middle |
PUNC | Punctuation etc. |
OTHR | Other character |
URLS | URL |
NUMS | Number |
--- meeting: attendees: - text_reader: { files: STDIN } - tokenizer: { } - word_searcher: { source: sys-dic, mode: first } - decomposer: { source: sys-dic } - multi_worder: { source: sys-mul } - sequencer: { stopper: 'PUNC,OTHR' } - synonymer: { skip: '?,t', source: sys-syn } - debug_filter: { preamble: false } - text_writer: { ext: STDOUT, sep: "\n" }