Class IcuFoldingTokenFilterFactory

  • All Implemented Interfaces:
    NormalizingTokenFilterFactory, TokenFilterFactory, IndexComponent

    public class IcuFoldingTokenFilterFactory
    extends AbstractTokenFilterFactory
    implements NormalizingTokenFilterFactory
    Uses the ICUFoldingFilter. Applies foldings from UTR#30 Character Foldings.

    Can be filtered to handle certain characters in a specified way (see http://icu-project.org/apiref/icu4j/com/ibm/icu/text/UnicodeSet.html) E.g national chars that should be retained (filter : "[^åäöÅÄÖ]").

    The unicodeSetFilter attribute can be used to provide the UniCodeSet for filtering.

    • Constructor Detail

      • IcuFoldingTokenFilterFactory

        public IcuFoldingTokenFilterFactory​(IndexSettings indexSettings,
                                            Environment environment,
                                            java.lang.String name,
                                            Settings settings)
    • Method Detail

      • create

        public org.apache.lucene.analysis.TokenStream create​(org.apache.lucene.analysis.TokenStream tokenStream)
        Specified by:
        create in interface TokenFilterFactory