http(s)://my-awesome-llm-hosted-here/some/random/path */chat/completions*
With the header Authorization
and value Bearer INSERT_SECRET_API_KEY
and Body
text/eventstream
and above is used for all chunks. You must ensure that the text body is correctly formatted. Additionally, you must indicate that your response is done by sending a DONE frame. All responses are followed by 2 newline delimiters. Using FastAPI for example, you would have an async generator looking like this:
python app.py
and passing the hosting URL to Simli. (more examples in other languages coming soon)