Create a completion for the provided prompt and parameters. Compatible with OpenAI’s chat completions API. Supports 40+ models including GPT-4, Claude, Gemini, and CaseMark legal AI models. Includes streaming support, token counting, and usage tracking.
API key starting with sk_case_
List of messages comprising the conversation
Model to use for completion. Defaults to casemark/casemark-core-3 if not specified
"casemark/casemark-core-3"
Maximum number of tokens to generate
1000
Sampling temperature between 0 and 2
0.7
Whether to stream back partial progress
false
CaseMark-only: when true, allows reasoning fields in responses. Defaults to false (reasoning is suppressed).
false
Nucleus sampling parameter
Frequency penalty parameter
Presence penalty parameter