Configuration property fixed at build time - All other configuration properties are overridable at runtime
Type |
Default |
|
|---|---|---|
Path to the directory containing the documents to be ingested. Environment variable: |
string |
required |
Matcher used for filtering which files from the directory should be ingested. This uses the Environment variable: |
string |
|
Whether to recursively ingest documents from subdirectories. Environment variable: |
boolean |
|
Maximum segment size when splitting documents, in tokens. Environment variable: |
int |
|
Maximum overlap (in tokens) when splitting documents. Environment variable: |
int |
|
Maximum number of results to return when querying the retrieval augmentor. Environment variable: |
int |
|
The strategy to decide whether document ingestion into the store should happen at startup or not. The default is ON. Changing to OFF generally only makes sense if running against a persistent embedding store that was already populated. Environment variable: |
|
|
Whether or not to reuse embeddings. Defaults to Environment variable: |
boolean |
|
The file path to load/save embeddings, assuming Defaults to Environment variable: |
string |
|