Class IcuFoldingTokenFilterFactory

All Implemented Interfaces:
NormalizingTokenFilterFactory, TokenFilterFactory, IndexComponent

public class IcuFoldingTokenFilterFactory extends AbstractTokenFilterFactory implements NormalizingTokenFilterFactory
Uses the ICUFoldingFilter. Applies foldings from UTR#30 Character Foldings.

Can be filtered to handle certain characters in a specified way (see http://icu-project.org/apiref/icu4j/com/ibm/icu/text/UnicodeSet.html) E.g national chars that should be retained (filter : "[^åäöÅÄÖ]").

The unicodeSetFilter attribute can be used to provide the UniCodeSet for filtering.

  • Constructor Details

  • Method Details

    • create

      public org.apache.lucene.analysis.TokenStream create(org.apache.lucene.analysis.TokenStream tokenStream)
      Specified by:
      create in interface TokenFilterFactory