Discussions

Ask a Question
Back to all

What will the latency be like if using a self-hosted LLM

I have my own fine-tuned LLM, just wondering if there is any difference in latency if I use the embedded LLM and my own LLM.