An overview of rules based ingestion in Morphik
morphik.toml
file using the registered models approach:
batch_size
configuration in morphik.toml
determines how content is split up before passing on the LLM.
Larger batch sizes may improve throughput but require more memory, and with a huge batch size, we could run into unreliable results. For complex rules or larger documents, you might need to adjust this setting based on your hardware capabilities and the latency requirements of your application.