What is Jacson? Getting started How it works? Plugin Classes Sources Generic Sources Filters Evaluators Reports Selections Stemmers & Handlers Technologies used Software used Contact How to Contribute? Developer Information Other information
|
The two filters
SelectionExtractFilter
ant
SelectionFilterFilter
are actually meta filters. Their behaviour can be heavily
customized by another layer of plugin classes, Selections.
Selections localize certain pieces within a chunk. These
pieces are taken by above filters and made available to
make up a fresh chunk sequence with all possibilities
for chunk processing provided by Jacson
The following selections are currently provided by Jacson
in the package
de.spieleck.app.jacson.select :
ParameterSelect -
Selects the content of the selection into a JacsonStateParameter
to be later reused by other Selections, Filters, reports or
whatever.
RegExpGroupSelect -
Picks up a range of characters determined by the
groupings in a regular expression.
SubstringSelect -
Picks up a range of characters determined by start
and end (like Java String.substring()).
TokenSelect -
Picks (Ant like) Tokens from a chunk, that is pieces
of string which are prefixed by a customizable sequence
and postfixed by a different sequence. Both sequences
default to "@" to mimic Ants
token filter mechanism.
WordSelect -
The word select breaks a chunk into words separated
by a certain set of delimiters. The delimiters default
to whitespace and other grammatical word separations but
can be customized.
|