Index

A B C D E F G H I K L M N O P Q R S T U V W Y 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form

A

addPastTokensSize(int) - Method in class chat.octet.model.beans.Status
 
AIX - Static variable in class chat.octet.model.utils.Platform
 
allowRequantize - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
allow quantizing non-f32/f16 tensors
ALPACA - Enum constant in enum class chat.octet.model.enums.ModelType
 
ANDROID - Static variable in class chat.octet.model.utils.Platform
 
appendNextToken(Token) - Method in class chat.octet.model.beans.Status
 
appendTokens(int[]) - Method in class chat.octet.model.beans.Status
 
AQUILA - Enum constant in enum class chat.octet.model.enums.ModelType
 
ARCH - Static variable in class chat.octet.model.utils.Platform
 
ASSISTANT - Enum constant in enum class chat.octet.model.beans.ChatMessage.ChatRole
Assistant role

B

BAICHUAN - Enum constant in enum class chat.octet.model.enums.ModelType
 
batch - Variable in class chat.octet.model.beans.LlamaContextParams
prompt processing batch size.
batchDecode(int, int[], int, int) - Static method in class chat.octet.model.LlamaService
Batch decoding.

C

chat(GenerateParameter, String) - Method in class chat.octet.model.Model
Start a conversation and chat in streaming format.
chat(GenerateParameter, String, String) - Method in class chat.octet.model.Model
Start a conversation and chat in streaming format.
chat(String) - Method in class chat.octet.model.Model
Start a conversation and chat in streaming format.
chat(String, String) - Method in class chat.octet.model.Model
Start a conversation and chat in streaming format.
chat.octet.model - package chat.octet.model
 
chat.octet.model.beans - package chat.octet.model.beans
 
chat.octet.model.components.criteria - package chat.octet.model.components.criteria
 
chat.octet.model.components.criteria.impl - package chat.octet.model.components.criteria.impl
 
chat.octet.model.components.processor - package chat.octet.model.components.processor
 
chat.octet.model.components.processor.impl - package chat.octet.model.components.processor.impl
 
chat.octet.model.enums - package chat.octet.model.enums
 
chat.octet.model.exceptions - package chat.octet.model.exceptions
 
chat.octet.model.parameters - package chat.octet.model.parameters
 
chat.octet.model.utils - package chat.octet.model.utils
 
chatCompletions(GenerateParameter, String) - Method in class chat.octet.model.Model
Start a conversation and chat.
chatCompletions(GenerateParameter, String, String) - Method in class chat.octet.model.Model
Start a conversation and chat.
chatCompletions(String) - Method in class chat.octet.model.Model
Start a conversation and chat.
ChatMessage - Class in chat.octet.model.beans
Chat message entity
ChatMessage() - Constructor for class chat.octet.model.beans.ChatMessage
 
ChatMessage(ChatMessage.ChatRole, String) - Constructor for class chat.octet.model.beans.ChatMessage
 
ChatMessage.ChatRole - Enum Class in chat.octet.model.beans
Chat role define
clearCache(int) - Static method in class chat.octet.model.LlamaService
Clear cache in K-V sequences.
clearCache(int, int, int) - Static method in class chat.octet.model.LlamaService
Clear cache in K-V sequences.
close() - Method in class chat.octet.model.Model
Close the model and release resources.
COMMON - Enum constant in enum class chat.octet.model.enums.ModelType
 
CompletionResult - Class in chat.octet.model.beans
Completion result entity
CompletionResult() - Constructor for class chat.octet.model.beans.CompletionResult
 
completions(GenerateParameter, String) - Method in class chat.octet.model.Model
Generate complete text.
completions(String) - Method in class chat.octet.model.Model
Generate complete text.
copyToStatus(Status) - Method in class chat.octet.model.beans.Status
 
createNewContextWithModel(LlamaContextParams) - Static method in class chat.octet.model.LlamaService
Create new context with model.
criteria(int[], float[], Object...) - Method in class chat.octet.model.components.criteria.impl.MaxTimeCriteria
 
criteria(int[], float[], Object...) - Method in interface chat.octet.model.components.criteria.StoppingCriteria
Stopping criteria
criteria(int[], float[], Object...) - Method in class chat.octet.model.components.criteria.StoppingCriteriaList
 
ctx - Variable in class chat.octet.model.beans.LlamaContextParams
text context size.
CustomBiasLogitsProcessor - Class in chat.octet.model.components.processor.impl
 
CustomBiasLogitsProcessor(Map<Integer, String>, int) - Constructor for class chat.octet.model.components.processor.impl.CustomBiasLogitsProcessor
 

D

dataTypeK - Variable in class chat.octet.model.beans.LlamaContextParams
data type for K cache.
dataTypeV - Variable in class chat.octet.model.beans.LlamaContextParams
data type for V cache.
DecodeException - Exception Class in chat.octet.model.exceptions
Batch decode exception
DecodeException(String) - Constructor for exception class chat.octet.model.exceptions.DecodeException
 
DecodeException(String, Throwable) - Constructor for exception class chat.octet.model.exceptions.DecodeException
 
decodeToken(int...) - Static method in class chat.octet.model.TokenDecoder
 
DEFAULT_ALPACA_SYSTEM - Static variable in class chat.octet.model.utils.PromptBuilder
Default prompt of Alpaca model.
DEFAULT_COMMON_SYSTEM - Static variable in class chat.octet.model.utils.PromptBuilder
Default common prompt.
DISABLED - Enum constant in enum class chat.octet.model.parameters.GenerateParameter.MirostatMode
 

E

embedding - Variable in class chat.octet.model.beans.LlamaContextParams
embedding mode only.
equals(Object) - Method in class chat.octet.model.beans.Token
 

F

FALCON - Enum constant in enum class chat.octet.model.enums.ModelType
 
FINISHED - Enum constant in enum class chat.octet.model.enums.FinishReason
Completed generation.
FinishReason - Enum Class in chat.octet.model.enums
Token generate status
format(ModelType, ChatMessage...) - Static method in class chat.octet.model.utils.PromptBuilder
Format chat messages as prompt text.
format(ModelType, String) - Static method in class chat.octet.model.utils.PromptBuilder
Format user question as prompt text.
format(ModelType, String, String) - Static method in class chat.octet.model.utils.PromptBuilder
Format user question as prompt text.
FREEBSD - Static variable in class chat.octet.model.utils.Platform
 

G

generate(GenerateParameter, String) - Method in class chat.octet.model.Model
Generate text in stream format.
generate(String) - Method in class chat.octet.model.Model
Generate text in stream format.
GenerateParameter - Class in chat.octet.model.parameters
Generate parameter
GenerateParameter() - Constructor for class chat.octet.model.parameters.GenerateParameter
 
GenerateParameter.MirostatMode - Enum Class in chat.octet.model.parameters
Mirostat sampling mode define
GenerationException - Exception Class in chat.octet.model.exceptions
Generation exception
GenerationException(String) - Constructor for exception class chat.octet.model.exceptions.GenerationException
 
GenerationException(String, Throwable) - Constructor for exception class chat.octet.model.exceptions.GenerationException
 
Generator - Class in chat.octet.model
Model inference generator, Supports streaming output text and generating complete text.
Generator(GenerateParameter, String) - Constructor for class chat.octet.model.Generator
Create inference generator.
Generator(GenerateParameter, String, Status) - Constructor for class chat.octet.model.Generator
Create inference generator.
getContextSize() - Static method in class chat.octet.model.LlamaService
Get model context size.
getEmbedding() - Static method in class chat.octet.model.LlamaService
Get embedding
getFinishReason() - Method in class chat.octet.model.beans.Status
 
getLlamaContextDefaultParams() - Static method in class chat.octet.model.LlamaService
Get llama context default params.
getLlamaModelDefaultParams() - Static method in class chat.octet.model.LlamaService
Get llama model default params.
getLlamaModelQuantizeDefaultParams() - Static method in class chat.octet.model.LlamaService
Get llama model quantize default params.
getLlamaTokenType(int) - Static method in class chat.octet.model.LlamaService
Get token type define.
getLogits(int) - Static method in class chat.octet.model.LlamaService
Get Logits based on index, and the default index must be 0.
getLogitsIndex() - Method in class chat.octet.model.beans.Status
 
getOSType() - Static method in class chat.octet.model.utils.Platform
 
getSamplingMetrics(boolean) - Static method in class chat.octet.model.LlamaService
Get sampling metrics
getSystemInfo() - Static method in class chat.octet.model.LlamaService
Get system parameter information.
getTokenBOS() - Static method in class chat.octet.model.LlamaService
Get special BOS token.
getTokenEOS() - Static method in class chat.octet.model.LlamaService
Get special EOS token.
getTokenType(int) - Static method in class chat.octet.model.LlamaService
Get token type code.
getUtf8ByteLength(byte) - Static method in class chat.octet.model.TokenDecoder
 
getVocabSize() - Static method in class chat.octet.model.LlamaService
Get model vocab size.
GNU - Static variable in class chat.octet.model.utils.Platform
 
gpuLayers - Variable in class chat.octet.model.beans.LlamaModelParams
number of layers to store in VRAM.

H

hashCode() - Method in class chat.octet.model.beans.Token
 

I

initNative() - Static method in class chat.octet.model.LlamaService
initial JNI context.
isFinished() - Method in enum class chat.octet.model.enums.FinishReason
Check if the token has been completed else return false.
isLinux() - Static method in class chat.octet.model.utils.Platform
 
isMac() - Static method in class chat.octet.model.utils.Platform
 
isMlockSupported() - Static method in class chat.octet.model.LlamaService
Check whether MLOCK is supported.
isMmapSupported() - Static method in class chat.octet.model.LlamaService
Check whether MMAP is supported.
isMultiByte(int) - Static method in class chat.octet.model.TokenDecoder
 
isWindows() - Static method in class chat.octet.model.utils.Platform
 
iterator() - Method in class chat.octet.model.Generator
Return inference iterator.

K

KFREEBSD - Static variable in class chat.octet.model.utils.Platform
 

L

LENGTH - Enum constant in enum class chat.octet.model.enums.FinishReason
Generation has exceeded the maximum token limit and has been truncated.
LIB_RESOURCE_PATH - Static variable in class chat.octet.model.utils.Platform
 
LINUX - Static variable in class chat.octet.model.utils.Platform
 
LLAMA_FTYPE_ALL_F32 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_GUESSED - Enum constant in enum class chat.octet.model.enums.ModelFileType
not specified in the model file
LLAMA_FTYPE_MOSTLY_F16 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q2_K - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q3_K_L - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q3_K_M - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q3_K_S - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q4_0 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q4_1 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 - Enum constant in enum class chat.octet.model.enums.ModelFileType
tok_embeddings.weight and output.weight are F16
LLAMA_FTYPE_MOSTLY_Q4_K_M - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q4_K_S - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q5_0 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q5_1 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q5_K_M - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q5_K_S - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q6_K - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_FTYPE_MOSTLY_Q8_0 - Enum constant in enum class chat.octet.model.enums.ModelFileType
 
LLAMA_ROPE_SCALING_LINEAR - Enum constant in enum class chat.octet.model.enums.LlamaRoPEScalingType
Scaling linear type.
LLAMA_ROPE_SCALING_NONE - Enum constant in enum class chat.octet.model.enums.LlamaRoPEScalingType
Scaling none type.
LLAMA_ROPE_SCALING_UNSPECIFIED - Enum constant in enum class chat.octet.model.enums.LlamaRoPEScalingType
unspecified type.
LLAMA_ROPE_SCALING_YARN - Enum constant in enum class chat.octet.model.enums.LlamaRoPEScalingType
Scaling YaRN type.
LLAMA_TOKEN_TYPE_BYTE - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Byte type.
LLAMA_TOKEN_TYPE_CONTROL - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Control type.
LLAMA_TOKEN_TYPE_NORMAL - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Normal type.
LLAMA_TOKEN_TYPE_UNDEFINED - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Undefined type.
LLAMA_TOKEN_TYPE_UNKNOWN - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Unknown type.
LLAMA_TOKEN_TYPE_UNUSED - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
Unused type.
LLAMA_TOKEN_TYPE_USER_DEFINED - Enum constant in enum class chat.octet.model.enums.LlamaTokenType
User defined type.
LLAMA2 - Enum constant in enum class chat.octet.model.enums.ModelType
 
llamaBackendFree() - Static method in class chat.octet.model.LlamaService
Call once at the end of the program.
llamaBackendInit(boolean) - Static method in class chat.octet.model.LlamaService
Initialize the llama + ggml backend If numa is true, use NUMA optimizations Call once at the start of the program.
LlamaContextParams - Class in chat.octet.model.beans
Llama context params entity
LlamaContextParams() - Constructor for class chat.octet.model.beans.LlamaContextParams
 
LlamaModelParams - Class in chat.octet.model.beans
Llama model params entity
LlamaModelParams() - Constructor for class chat.octet.model.beans.LlamaModelParams
 
llamaModelQuantize(String, String, LlamaModelQuantizeParams) - Static method in class chat.octet.model.LlamaService
Quantize the model.
llamaModelQuantize(String, String, ModelFileType) - Static method in class chat.octet.model.LlamaService
Quantize the model.
LlamaModelQuantizeParams - Class in chat.octet.model.beans
Llama model quantize params entity
LlamaModelQuantizeParams() - Constructor for class chat.octet.model.beans.LlamaModelQuantizeParams
 
LlamaRoPEScalingType - Enum Class in chat.octet.model.enums
Llama RoPE scaling type define
LlamaService - Class in chat.octet.model
Llama.cpp API
LlamaService() - Constructor for class chat.octet.model.LlamaService
 
LlamaTokenType - Enum Class in chat.octet.model.enums
Llama token type define
loadLibraryResource() - Static method in class chat.octet.model.utils.Platform
 
loadLlamaGrammar(String) - Static method in class chat.octet.model.LlamaService
Load llama grammar by rules.
loadLlamaModelFromFile(String, LlamaModelParams) - Static method in class chat.octet.model.LlamaService
Load Llama model from file.
loadLoraModelFromFile(String, float, String, int) - Static method in class chat.octet.model.LlamaService
Apply a LoRA adapter to a loaded model path_base_model is the path to a higher quality model to use as a base for the layers modified by the adapter.
logitsAll - Variable in class chat.octet.model.beans.LlamaContextParams
the llama_eval() call computes all logits, not just the last one.
LogitsProcessor - Interface in chat.octet.model.components.processor
Customize a processor to adjust the probability distribution of words and control the generation of model inference results.
LogitsProcessorList - Class in chat.octet.model.components.processor
Stopping criteria list
LogitsProcessorList() - Constructor for class chat.octet.model.components.processor.LogitsProcessorList
 
LogitsProcessorList(Collection<? extends LogitsProcessor>) - Constructor for class chat.octet.model.components.processor.LogitsProcessorList
 

M

MAC - Static variable in class chat.octet.model.utils.Platform
 
mainGpu - Variable in class chat.octet.model.beans.LlamaModelParams
the GPU that is used for scratch and small tensors.
MaxTimeCriteria - Class in chat.octet.model.components.criteria.impl
 
MaxTimeCriteria(long) - Constructor for class chat.octet.model.components.criteria.impl.MaxTimeCriteria
 
MaxTimeCriteria(long, long) - Constructor for class chat.octet.model.components.criteria.impl.MaxTimeCriteria
 
metrics() - Method in class chat.octet.model.Model
Print generation metrics.
Metrics - Class in chat.octet.model.beans
Generation metrics
Metrics() - Constructor for class chat.octet.model.beans.Metrics
 
METRICS_TEMPLATE - Static variable in class chat.octet.model.beans.Metrics
 
mlock - Variable in class chat.octet.model.beans.LlamaModelParams
force system to keep model in RAM.
mmap - Variable in class chat.octet.model.beans.LlamaModelParams
use mmap if possible.
Model - Class in chat.octet.model
LLama model, which provides functions for generating and chatting conversations.
Model(ModelParameter) - Constructor for class chat.octet.model.Model
 
Model(String) - Constructor for class chat.octet.model.Model
 
ModelException - Exception Class in chat.octet.model.exceptions
Model exception
ModelException(String) - Constructor for exception class chat.octet.model.exceptions.ModelException
 
ModelException(String, Throwable) - Constructor for exception class chat.octet.model.exceptions.ModelException
 
modelFileType - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
quantize to this llama_ftype
ModelFileType - Enum Class in chat.octet.model.enums
Model file type define
ModelParameter - Class in chat.octet.model.parameters
Llama model parameters
ModelParameter() - Constructor for class chat.octet.model.parameters.ModelParameter
 
ModelType - Enum Class in chat.octet.model.enums
Model type
mulMatQ - Variable in class chat.octet.model.beans.LlamaContextParams
if true, use experimental mul_mat_q kernels.

N

NETBSD - Static variable in class chat.octet.model.utils.Platform
 
NoBadWordsLogitsProcessor - Class in chat.octet.model.components.processor.impl
 
NoBadWordsLogitsProcessor(int[]) - Constructor for class chat.octet.model.components.processor.impl.NoBadWordsLogitsProcessor
 
NONE - Enum constant in enum class chat.octet.model.enums.FinishReason
Default type.

O

OASST_LLAMA - Enum constant in enum class chat.octet.model.enums.ModelType
 
offloadKqv - Variable in class chat.octet.model.beans.LlamaContextParams
whether to offload the KQV ops (including the KV cache) to GPU.
onlyCopy - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
only copy tensors - ftype, allow_requantize and quantize_output_tensor are ignored
OPEN_BUDDY - Enum constant in enum class chat.octet.model.enums.ModelType
 
OPENBSD - Static variable in class chat.octet.model.utils.Platform
 
output() - Method in class chat.octet.model.Generator
Stream outputs the generated text.

P

Platform - Class in chat.octet.model.utils
 
processor(int[], float[], Object...) - Method in class chat.octet.model.components.processor.impl.CustomBiasLogitsProcessor
 
processor(int[], float[], Object...) - Method in class chat.octet.model.components.processor.impl.NoBadWordsLogitsProcessor
 
processor(int[], float[], Object...) - Method in interface chat.octet.model.components.processor.LogitsProcessor
Logits processor
processor(int[], float[], Object...) - Method in class chat.octet.model.components.processor.LogitsProcessorList
 
PromptBuilder - Class in chat.octet.model.utils
Prompt builder
pure - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
disable k-quant mixtures and quantize all tensors to the same type

Q

quantizeOutputTensor - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
quantize output.weight
QWEN - Enum constant in enum class chat.octet.model.enums.ModelType
 

R

REDPAJAMA_INCITE - Enum constant in enum class chat.octet.model.enums.ModelType
 
release() - Static method in class chat.octet.model.LlamaService
Close model and release all resources.
removeAllChatStatus() - Method in class chat.octet.model.Model
Delete all user session states.
removeChatStatus(String) - Method in class chat.octet.model.Model
Delete the session state of the specified user.
reset() - Method in class chat.octet.model.beans.Status
 
result() - Method in class chat.octet.model.Generator
Return the generated complete text.
ropeFreqBase - Variable in class chat.octet.model.beans.LlamaContextParams
RoPE base frequency.
ropeFreqScale - Variable in class chat.octet.model.beans.LlamaContextParams
RoPE frequency scaling factor.
ropeScalingType - Variable in class chat.octet.model.beans.LlamaContextParams
RoPE scaling type, from `enum llama_rope_scaling_type`.

S

sampling(float[], int[], int, float, float, float, boolean, int, float, float, float, int, float, float, float, float, int, int) - Static method in class chat.octet.model.LlamaService
Inference sampling the next token.
seed - Variable in class chat.octet.model.beans.LlamaContextParams
RNG seed, -1 for random.
SNOOZY - Enum constant in enum class chat.octet.model.enums.ModelType
 
SOLARIS - Static variable in class chat.octet.model.utils.Platform
 
splitMode - Variable in class chat.octet.model.beans.LlamaModelParams
how to split the model across multiple GPUs.
Status - Class in chat.octet.model.beans
 
Status() - Constructor for class chat.octet.model.beans.Status
 
Status(Status) - Constructor for class chat.octet.model.beans.Status
 
STOP - Enum constant in enum class chat.octet.model.enums.FinishReason
Generation stopped by StoppingCriteria.
StoppingCriteria - Interface in chat.octet.model.components.criteria
Customize a controller to implement stop rule control for model inference.
StoppingCriteriaList - Class in chat.octet.model.components.criteria
Stopping criteria list
StoppingCriteriaList() - Constructor for class chat.octet.model.components.criteria.StoppingCriteriaList
 
StoppingCriteriaList(Collection<? extends StoppingCriteria>) - Constructor for class chat.octet.model.components.criteria.StoppingCriteriaList
 
subInputIds(int) - Method in class chat.octet.model.beans.Status
 
subInputIds(int, int) - Method in class chat.octet.model.beans.Status
 
SYSTEM - Enum constant in enum class chat.octet.model.beans.ChatMessage.ChatRole
System prompt

T

tensorSplit - Variable in class chat.octet.model.beans.LlamaModelParams
how to split layers across multiple GPUs (size: LLAMA_MAX_DEVICES).
thread - Variable in class chat.octet.model.beans.LlamaModelQuantizeParams
number of threads to use for quantizing, if invalid input: '<'=0 will use std::thread::hardware_concurrency()
threads - Variable in class chat.octet.model.beans.LlamaContextParams
number of threads used for generation.
threadsBatch - Variable in class chat.octet.model.beans.LlamaContextParams
number of threads used for prompt and batch processing.
toAssistant(String) - Static method in class chat.octet.model.beans.ChatMessage
 
Token - Class in chat.octet.model.beans
Token
Token(int, LlamaTokenType, String) - Constructor for class chat.octet.model.beans.Token
 
TokenDecoder - Class in chat.octet.model
Token decoder
tokenize(byte[], int, int[], int, boolean, boolean) - Static method in class chat.octet.model.LlamaService
Convert the provided text into tokens.
tokenize(String, boolean, boolean) - Static method in class chat.octet.model.LlamaService
Convert the provided text into tokens.
tokenToPiece(int, byte[], int) - Static method in class chat.octet.model.LlamaService
Convert the token id to text piece.
toString() - Method in class chat.octet.model.beans.Metrics
 
toString() - Method in enum class chat.octet.model.enums.LlamaRoPEScalingType
 
toString() - Method in enum class chat.octet.model.enums.LlamaTokenType
 
toString() - Method in enum class chat.octet.model.enums.ModelFileType
 
toString() - Method in class chat.octet.model.Model
 
toSystem(String) - Static method in class chat.octet.model.beans.ChatMessage
 
toUser(String) - Static method in class chat.octet.model.beans.ChatMessage
 
TRUNCATED - Enum constant in enum class chat.octet.model.enums.FinishReason
Generation has exceeded the maximum context limit and has been truncated.

U

UNKNOWN - Enum constant in enum class chat.octet.model.enums.FinishReason
Unknown type, no available token state.
UNSPECIFIED - Static variable in class chat.octet.model.utils.Platform
 
updateFinishReason(FinishReason) - Method in class chat.octet.model.beans.Token
 
USER - Enum constant in enum class chat.octet.model.beans.ChatMessage.ChatRole
User role

V

V1 - Enum constant in enum class chat.octet.model.parameters.GenerateParameter.MirostatMode
 
V2 - Enum constant in enum class chat.octet.model.parameters.GenerateParameter.MirostatMode
 
valueOf(String) - Static method in enum class chat.octet.model.beans.ChatMessage.ChatRole
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.enums.FinishReason
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.enums.LlamaRoPEScalingType
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.enums.LlamaTokenType
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.enums.ModelFileType
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.enums.ModelType
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class chat.octet.model.parameters.GenerateParameter.MirostatMode
Returns the enum constant of this class with the specified name.
valueOfType(int) - Static method in enum class chat.octet.model.enums.LlamaRoPEScalingType
 
valueOfType(int) - Static method in enum class chat.octet.model.enums.LlamaTokenType
 
valueOfType(int) - Static method in enum class chat.octet.model.enums.ModelFileType
 
values() - Static method in enum class chat.octet.model.beans.ChatMessage.ChatRole
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.enums.FinishReason
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.enums.LlamaRoPEScalingType
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.enums.LlamaTokenType
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.enums.ModelFileType
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.enums.ModelType
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class chat.octet.model.parameters.GenerateParameter.MirostatMode
Returns an array containing the constants of this enum class, in the order they are declared.
VICUNA - Enum constant in enum class chat.octet.model.enums.ModelType
 
vocabOnly - Variable in class chat.octet.model.beans.LlamaModelParams
only load the vocabulary, no weights.

W

WINDOWS - Static variable in class chat.octet.model.utils.Platform
 
WINDOWSCE - Static variable in class chat.octet.model.utils.Platform
 

Y

yarnAttnFactor - Variable in class chat.octet.model.beans.LlamaContextParams
YaRN magnitude scaling factor.
yarnBetaFast - Variable in class chat.octet.model.beans.LlamaContextParams
YaRN low correction dim.
yarnBetaSlow - Variable in class chat.octet.model.beans.LlamaContextParams
YaRN high correction dim.
yarnExtFactor - Variable in class chat.octet.model.beans.LlamaContextParams
YaRN extrapolation mix factor, NaN = from model.
yarnOrigCtx - Variable in class chat.octet.model.beans.LlamaContextParams
YaRN original context size.
A B C D E F G H I K L M N O P Q R S T U V W Y 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form