Discussions

What will the latency be like if using a self-hosted LLM

3 months ago

I have my own fine-tuned LLM, just wondering if there is any difference in latency if I use the embedded LLM and my own LLM.