Class RegexTokenizer

  • All Implemented Interfaces:
    Tokenizer<CharSequence>

    class RegexTokenizer
    extends Object
    implements Tokenizer<CharSequence>
    A simple word tokenizer that utilizes regex to find words. It applies a regex (\w)+ over the input text to extract words from a given character sequence.
    Since:
    1.0