Package chat.octet.model.beans
Class LlamaContextParams
java.lang.Object
chat.octet.model.beans.LlamaContextParams
Llama context params entity
- Author:
- William
-
Field Summary
FieldsModifier and TypeFieldDescriptionintprompt processing batch size.inttext context size.booleanembedding mode only.booleanuse fp16 for KV cache.booleanthe llama_eval() call computes all logits, not just the last one.booleanif true, use experimental mul_mat_q kernels.floatRoPE base frequency.floatRoPE frequency scaling factor.intRoPE scaling type, from `enum llama_rope_scaling_type`.intRNG seed, -1 for random.intnumber of threads used for generation.intnumber of threads used for prompt and batch processing.floatYaRN magnitude scaling factor.floatYaRN low correction dim.floatYaRN high correction dim.floatYaRN extrapolation mix factor, NaN = from model.intYaRN original context size. -
Constructor Summary
Constructors -
Method Summary
-
Field Details
-
seed
public int seedRNG seed, -1 for random. -
ctx
public int ctxtext context size. -
batch
public int batchprompt processing batch size. -
threads
public int threadsnumber of threads used for generation. -
threadsBatch
public int threadsBatchnumber of threads used for prompt and batch processing. -
ropeScalingType
public int ropeScalingTypeRoPE scaling type, from `enum llama_rope_scaling_type`.- See Also:
-
yarnExtFactor
public float yarnExtFactorYaRN extrapolation mix factor, NaN = from model. -
yarnAttnFactor
public float yarnAttnFactorYaRN magnitude scaling factor. -
yarnBetaFast
public float yarnBetaFastYaRN low correction dim. -
yarnBetaSlow
public float yarnBetaSlowYaRN high correction dim. -
yarnOrigCtx
public int yarnOrigCtxYaRN original context size. -
ropeFreqBase
public float ropeFreqBaseRoPE base frequency. -
ropeFreqScale
public float ropeFreqScaleRoPE frequency scaling factor. -
mulMatQ
public boolean mulMatQif true, use experimental mul_mat_q kernels. -
f16KV
public boolean f16KVuse fp16 for KV cache. -
logitsAll
public boolean logitsAllthe llama_eval() call computes all logits, not just the last one. -
embedding
public boolean embeddingembedding mode only.
-
-
Constructor Details
-
LlamaContextParams
public LlamaContextParams()
-