Extract boost list from documents
Extracts a categorized word boost list from vault documents or raw text using LLM entity extraction. The resulting list can be passed as word_boost to the transcription endpoint for improved accuracy.
POST
Authorizations
API key starting with sk_case_
Body
application/json
Source documents or text for entity extraction
Vault ID containing the source documents (use with object_ids)
Object IDs of documents to extract entities from (PDFs, text files)
Raw text input for entity extraction (alternative to vault documents)
Optional filter for entity categories to extract
Available options:
person, organization, legal_term, medical, citation, email Previous
Generate boost list from transcriptGenerates a categorized word boost list from a completed transcription job. Extracts entities from the pass-1 transcript for use as `word_boost` in a second transcription pass.
Next

