Configuration property fixed at build time - All other configuration properties are overridable at runtime

Configuration property

Type

Default

Path to the directory containing the documents to be ingested.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_PATH

string

required

Matcher used for filtering which files from the directory should be ingested. This uses the java.nio.file.FileSystem path matcher syntax. Example: glob:**.txt to recursively match all files with the .txt extension. The default is glob:**, recursively matching all files.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_PATH_MATCHER

string

glob:**

Whether to recursively ingest documents from subdirectories.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_RECURSIVE

boolean

true

Maximum segment size when splitting documents, in tokens.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_MAX_SEGMENT_SIZE

int

300

Maximum overlap (in tokens) when splitting documents.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_MAX_OVERLAP_SIZE

int

30

Maximum number of results to return when querying the retrieval augmentor.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_MAX_RESULTS

int

5

The strategy to decide whether document ingestion into the store should happen at startup or not. The default is ON. Changing to OFF generally only makes sense if running against a persistent embedding store that was already populated.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_INGESTION_STRATEGY

on, off

on

Whether or not to reuse embeddings. Defaults to false.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_REUSE_EMBEDDINGS_ENABLED

boolean

false

The file path to load/save embeddings, assuming quarkus.langchain4j.easy-rag.reuse-embeddings.enabled == true.

Defaults to easy-rag-embeddings.json in the current directory.

Environment variable: QUARKUS_LANGCHAIN4J_EASY_RAG_REUSE_EMBEDDINGS_FILE

string

easy-rag-embeddings.json