Results & Response — Schemas

SegmentResult Schema

The structure of each time-aligned segment.


Field reference

FieldTypeDescription
startfloatSegment start time in seconds.
endfloatSegment end time in seconds.
textstringTranscript text for this segment.
speakerstring|nullSpeaker label. Present when enable_diarization=true.
emotionstring|nullDetected emotion. Present when enable_emotion_diarization=true. NOT in nlp_analysis.
wordsarray|nullPer-word timestamps (word, start, end, score). Present when word_timestamps=true.
Important: emotion lives in segments[] — it is NOT in nlp_analysis.