Skip navigation links
A C D E F G H I L M N O P Q R S T U 

A

addLexeme(Lexeme) - Method in class org.wltea.analyzer.Context
向分词结果集添加词元
Arabic_Num_Mid - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 

C

CharacterHelper - Class in org.wltea.analyzer.help
字符集识别辅助工具类
CharacterHelper() - Constructor for class org.wltea.analyzer.help.CharacterHelper
 
Chn_Num - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
Chn_Num_Mid - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
CJKSegmenter - Class in org.wltea.analyzer.seg
中文(CJK)词元处理子分词器,涵盖一下范围 1.中文词语 2.姓名 3.地名 4.未知词(单字切分) 5.日文/韩文(单字切分)
CJKSegmenter() - Constructor for class org.wltea.analyzer.seg.CJKSegmenter
 
compareTo(Lexeme) - Method in class org.wltea.analyzer.Lexeme
 
Configuration - Class in org.wltea.analyzer.cfg
IK Analyzer v3.2 简单的配置管理类,单子模式
Context - Class in org.wltea.analyzer
分词器上下文状态
coord(int, int) - Method in class org.wltea.analyzer.lucene.IKSimilarity
 
CustomDictionary - Class in org.wltea.analyzer.dic
 
CustomDictionary() - Constructor for class org.wltea.analyzer.dic.CustomDictionary
 

D

Dictionary - Class in org.wltea.analyzer.dic
IK Analyzer v3.2 词典管理类,单子模式
DictSegment - Class in org.wltea.analyzer.dic
IK Analyzer v3.2 字典子片断 字典匹配核心类 该类采用 数组 结合 HashMap,实现词典存储,词语匹配 当下属的页节点小等于3时,采用数组存储 当下属的页节点大于3时,采用HashMap存储
DictSegment(Character) - Constructor for class org.wltea.analyzer.dic.DictSegment
 

E

end() - Method in class org.wltea.analyzer.lucene.IKTokenizer
 
equals(Object) - Method in class org.wltea.analyzer.Lexeme
 
excludeOverlap() - Method in class org.wltea.analyzer.Context
排除结果集中完全交叠(彼此包含)的词元 进行最大切分的时候,过滤长度较小的交叠词元

F

fillSegment(char[]) - Method in class org.wltea.analyzer.dic.DictSegment
加载填充词典片段
fillSegment(char[], int, int) - Method in class org.wltea.analyzer.dic.DictSegment
加载填充词典片段
firstLexeme() - Method in class org.wltea.analyzer.Context
取出分词结果集中的首个词元

G

getAvailable() - Method in class org.wltea.analyzer.Context
 
getBegin() - Method in class org.wltea.analyzer.dic.Hit
 
getBegin() - Method in class org.wltea.analyzer.Lexeme
 
getBeginPosition() - Method in class org.wltea.analyzer.Lexeme
获取词元在文本中的起始位置
getBuffOffset() - Method in class org.wltea.analyzer.Context
 
getCursor() - Method in class org.wltea.analyzer.Context
 
getEnd() - Method in class org.wltea.analyzer.dic.Hit
 
getEndPosition() - Method in class org.wltea.analyzer.Lexeme
获取词元在文本中的结束位置
getExtDictionarys() - Static method in class org.wltea.analyzer.cfg.Configuration
获取扩展字典配置路径
getExtStopWordDictionarys() - Static method in class org.wltea.analyzer.cfg.Configuration
获取扩展停止词典配置路径
getInstance() - Static method in class org.wltea.analyzer.dic.Dictionary
词典初始化 由于IK Analyzer的词典采用Dictionary类的静态方法进行词典初始化 只有当Dictionary类被实际调用时,才会开始载入词典, 这将延长首次分词操作的时间 该方法提供了一个在应用加载阶段就初始化字典的手段 用来缩短首次分词时的时延
getLastAnalyzed() - Method in class org.wltea.analyzer.Context
 
getLength() - Method in class org.wltea.analyzer.Lexeme
获取词元的字符长度
getLexemeText() - Method in class org.wltea.analyzer.Lexeme
获取词元的文本内容
getLexemeType() - Method in class org.wltea.analyzer.Lexeme
获取词元类型
getMainDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 
getMatchedDictSegment() - Method in class org.wltea.analyzer.dic.Hit
 
getNodeChar() - Method in class org.wltea.analyzer.dic.DictSegment
 
getOffset() - Method in class org.wltea.analyzer.Lexeme
 
getPrepDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 
getQuantifierDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 
getResultSize() - Method in class org.wltea.analyzer.Context
获取分词结果集大小
getStopWordDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 
getSuffixDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 
getSurnameDictInputStream() - Method in class org.wltea.analyzer.dic.CustomDictionary
 

H

hashCode() - Method in class org.wltea.analyzer.Lexeme
 
hasNextNode() - Method in class org.wltea.analyzer.dic.DictSegment
 
Hit - Class in org.wltea.analyzer.dic
IK Analyzer v3.2 表示词典检索的命中结果
Hit() - Constructor for class org.wltea.analyzer.dic.Hit
 

I

IKAnalyzer - Class in org.wltea.analyzer.lucene
实现Lucene Analyzer 基于IKTokenizer的中文分词器
IKAnalyzer() - Constructor for class org.wltea.analyzer.lucene.IKAnalyzer
IK分词器Lucene Analyzer接口实现类 默认最细粒度切分算法
IKAnalyzer(boolean) - Constructor for class org.wltea.analyzer.lucene.IKAnalyzer
IK分词器Lucene Analyzer接口实现类
IKAnalyzerDemo - Class in org.wltea.analyzer.sample
 
IKAnalyzerDemo() - Constructor for class org.wltea.analyzer.sample.IKAnalyzerDemo
 
IKQueryParser - Class in org.wltea.analyzer.lucene
IK查询分析器 实现了对分词歧义结果的非冲突排列组合 有效的优化对歧义关键词的搜索命中 针对IK Analyzer V3的优化实现
IKQueryParser() - Constructor for class org.wltea.analyzer.lucene.IKQueryParser
 
IKSegmentation - Class in org.wltea.analyzer
IK Analyzer v3.2 IK主分词器 注:IKSegmentation是一个lucene无关的通用分词器
IKSegmentation(Reader) - Constructor for class org.wltea.analyzer.IKSegmentation
IK主分词器构造函数 默认最细粒度切分
IKSegmentation(Reader, boolean) - Constructor for class org.wltea.analyzer.IKSegmentation
IK主分词器构造函数
IKSimilarity - Class in org.wltea.analyzer.lucene
IK Analyzer v3.2 相似度评估器 重载了DefaultSimilarity的coord方法 提高词元命中个数在相似度比较中的权重影响,即,当有多个词元得到匹配时,文档的相似度将提高
IKSimilarity() - Constructor for class org.wltea.analyzer.lucene.IKSimilarity
 
IKTokenizer - Class in org.wltea.analyzer.lucene
IK Analyzer v3.2 Lucene3.0 Tokenizer适配器类 它封装了IKSegmentation实现
IKTokenizer(Reader, boolean) - Constructor for class org.wltea.analyzer.lucene.IKTokenizer
Lucene Tokenizer适配器类构造函数
incrementToken() - Method in class org.wltea.analyzer.lucene.IKTokenizer
 
isArabicNumber(char) - Static method in class org.wltea.analyzer.help.CharacterHelper
 
isBufferLocked() - Method in class org.wltea.analyzer.Context
只要buffLocker中存在ISegmenter对象 则buffer被锁定
isCJKCharacter(char) - Static method in class org.wltea.analyzer.help.CharacterHelper
 
ISegmenter - Interface in org.wltea.analyzer.seg
子分词器接口
isEnglishLetter(char) - Static method in class org.wltea.analyzer.help.CharacterHelper
 
isLoadedCustomDictionary() - Method in class org.wltea.analyzer.dic.Dictionary
 
isMatch() - Method in class org.wltea.analyzer.dic.Hit
判断是否完全匹配
isMaxWordLength() - Method in class org.wltea.analyzer.Context
 
isMaxWordLength() - Method in class org.wltea.analyzer.lucene.IKAnalyzer
 
isOverlap(Lexeme) - Method in class org.wltea.analyzer.Lexeme
判断词元是否彼此包含
isPrefix() - Method in class org.wltea.analyzer.dic.Hit
判断是否是词的前缀
isSpaceLetter(char) - Static method in class org.wltea.analyzer.help.CharacterHelper
 
isStopWord(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
判断是否是停止词
isUnmatch() - Method in class org.wltea.analyzer.dic.Hit
判断是否是不匹配

L

lastLexeme() - Method in class org.wltea.analyzer.Context
取出分词结果集中的最后一个词元
LetterSegmenter - Class in org.wltea.analyzer.seg
负责处理字母的子分词器,涵盖一下范围 1.英文单词、英文加阿拉伯数字、专有名词(公司名) 2.IP地址、Email、URL
LetterSegmenter() - Constructor for class org.wltea.analyzer.seg.LetterSegmenter
 
Lexeme - Class in org.wltea.analyzer
IK Analyzer v3.2 语义单元(词元) *
Lexeme(int, int, int, int) - Constructor for class org.wltea.analyzer.Lexeme
 
loadCustomDictionary(CustomDictionary) - Method in class org.wltea.analyzer.dic.Dictionary
 
loadExtendStopWords(Collection<String>) - Static method in class org.wltea.analyzer.dic.Dictionary
加载扩展的停止词条
loadExtendWords(Collection<String>) - Static method in class org.wltea.analyzer.dic.Dictionary
加载扩展的词条
loadSegmenter() - Static method in class org.wltea.analyzer.cfg.Configuration
初始化子分词器实现 (目前暂时不考虑配置扩展)
lockBuffer(ISegmenter) - Method in class org.wltea.analyzer.Context
 

M

main(String[]) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
 
main(String[]) - Static method in class org.wltea.analyzer.sample.IKAnalyzerDemo
 
match(char[]) - Method in class org.wltea.analyzer.dic.DictSegment
匹配词段
match(char[], int, int) - Method in class org.wltea.analyzer.dic.DictSegment
匹配词段
match(char[], int, int, Hit) - Method in class org.wltea.analyzer.dic.DictSegment
匹配词段
matchInMainDict(char[]) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配主词典
matchInMainDict(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配主词典
matchInPrepDict(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配介词、副词词典
matchInQuantifierDict(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配量词词典
matchInSuffixDict(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配在后缀词典
matchInSurnameDict(char[], int, int) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配姓氏词典
matchWithHit(char[], int, Hit) - Static method in class org.wltea.analyzer.dic.Dictionary
检索匹配主词典, 从已匹配的Hit中直接取出DictSegment,继续向下匹配

N

NaN - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_ANM - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_ARABIC - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_CHINESE - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_CNM - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_NE - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
NC_NP - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
next() - Method in class org.wltea.analyzer.IKSegmentation
获取下一个语义单元
nextLexeme(char[], Context) - Method in class org.wltea.analyzer.seg.CJKSegmenter
 
nextLexeme(char[], Context) - Method in interface org.wltea.analyzer.seg.ISegmenter
从分析器读取下一个可能分解的词元对象
nextLexeme(char[], Context) - Method in class org.wltea.analyzer.seg.LetterSegmenter
 
nextLexeme(char[], Context) - Method in class org.wltea.analyzer.seg.QuantifierSegmenter
 
Num_End - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 
Num_Pre - Static variable in class org.wltea.analyzer.seg.QuantifierSegmenter
 

O

org.wltea.analyzer - package org.wltea.analyzer
 
org.wltea.analyzer.cfg - package org.wltea.analyzer.cfg
 
org.wltea.analyzer.dic - package org.wltea.analyzer.dic
 
org.wltea.analyzer.help - package org.wltea.analyzer.help
 
org.wltea.analyzer.lucene - package org.wltea.analyzer.lucene
 
org.wltea.analyzer.sample - package org.wltea.analyzer.sample
 
org.wltea.analyzer.seg - package org.wltea.analyzer.seg
 

P

parse(String, String) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
单连续字窜(不带空格符)单Field查询分析
parse(String) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
解析IK简易查询表达式
parseMultiField(String[], String) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
多Field,单条件查询分析
parseMultiField(String[], String, BooleanClause.Occur[]) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
多Field,单条件,多Occur查询分析
parseMultiField(String[], String[]) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
多Field多条件查询分析
parseMultiField(String[], String[], BooleanClause.Occur[]) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
多Field,多条件,多Occur查询分析
PATH_DIC_MAIN - Static variable in class org.wltea.analyzer.dic.Dictionary
 
PATH_DIC_PREP - Static variable in class org.wltea.analyzer.dic.Dictionary
 
PATH_DIC_QUANTIFIER - Static variable in class org.wltea.analyzer.dic.Dictionary
 
PATH_DIC_STOP - Static variable in class org.wltea.analyzer.dic.Dictionary
 
PATH_DIC_SUFFIX - Static variable in class org.wltea.analyzer.dic.Dictionary
 
PATH_DIC_SURNAME - Static variable in class org.wltea.analyzer.dic.Dictionary
 

Q

QuantifierSegmenter - Class in org.wltea.analyzer.seg
数量词子分词器,涵盖一下范围 1.阿拉伯数字,阿拉伯数字+中文量词 2.中文数字+中文量词 3.时间,日期 4.罗马数字 5.数学符号 % .
QuantifierSegmenter() - Constructor for class org.wltea.analyzer.seg.QuantifierSegmenter
 

R

regularize(char) - Static method in class org.wltea.analyzer.help.CharacterHelper
进行字符规格化(全角转半角,大写转小写处理)
reset(Reader) - Method in class org.wltea.analyzer.IKSegmentation
重置分词器到初始状态
reset(Reader) - Method in class org.wltea.analyzer.lucene.IKTokenizer
 
reset() - Method in class org.wltea.analyzer.seg.CJKSegmenter
 
reset() - Method in interface org.wltea.analyzer.seg.ISegmenter
重置子分析器状态
reset() - Method in class org.wltea.analyzer.seg.LetterSegmenter
 
reset() - Method in class org.wltea.analyzer.seg.QuantifierSegmenter
 
resetContext() - Method in class org.wltea.analyzer.Context
重置上下文

S

setAvailable(int) - Method in class org.wltea.analyzer.Context
 
setBegin(int) - Method in class org.wltea.analyzer.dic.Hit
 
setBegin(int) - Method in class org.wltea.analyzer.Lexeme
 
setBuffOffset(int) - Method in class org.wltea.analyzer.Context
 
setCursor(int) - Method in class org.wltea.analyzer.Context
 
setEnd(int) - Method in class org.wltea.analyzer.dic.Hit
 
setLastAnalyzed(int) - Method in class org.wltea.analyzer.Context
 
setLength(int) - Method in class org.wltea.analyzer.Lexeme
 
setLexemeText(String) - Method in class org.wltea.analyzer.Lexeme
 
setLexemeType(int) - Method in class org.wltea.analyzer.Lexeme
 
setMainDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setMatch() - Method in class org.wltea.analyzer.dic.Hit
 
setMatchedDictSegment(DictSegment) - Method in class org.wltea.analyzer.dic.Hit
 
setMaxWordLength(boolean) - Method in class org.wltea.analyzer.Context
 
setMaxWordLength(boolean) - Method in class org.wltea.analyzer.lucene.IKAnalyzer
 
setMaxWordLength(boolean) - Static method in class org.wltea.analyzer.lucene.IKQueryParser
设置分词策略 isMaxWordLength = true 采用最大词长分词
setOffset(int) - Method in class org.wltea.analyzer.Lexeme
 
setPrefix() - Method in class org.wltea.analyzer.dic.Hit
 
setPrepDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setQuantifierDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setStopWordDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setSuffixDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setSurnameDictInputStream(InputStream) - Method in class org.wltea.analyzer.dic.CustomDictionary
 
setUnmatch() - Method in class org.wltea.analyzer.dic.Hit
 
Sign_Connector - Static variable in class org.wltea.analyzer.seg.LetterSegmenter
 

T

tokenStream(String, Reader) - Method in class org.wltea.analyzer.lucene.IKAnalyzer
 
toString() - Method in class org.wltea.analyzer.Lexeme
 
TYPE_CJK_NORMAL - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_CJK_SF - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_CJK_SN - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_CJK_UNKNOWN - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_LETTER - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_NUM - Static variable in class org.wltea.analyzer.Lexeme
 
TYPE_NUMCOUNT - Static variable in class org.wltea.analyzer.Lexeme
 

U

unlockBuffer(ISegmenter) - Method in class org.wltea.analyzer.Context
 
A C D E F G H I L M N O P Q R S T U 
Skip navigation links

Copyright © 2022. All rights reserved.