Clip, segment, frame
Video annotation
Scope video annotation before clips are labeled.
Prepare video datasets with labels, timing rules, segment definitions, and review samples before annotation begins.
Object, event, speech, scene
For speech or text context review
For annotation work, scope starts with label definitions, examples, review samples, and output schema. Language review is added when meaning, dialect, script, or audio context affects the label decision.
What this page helps you scope
- Clip classification, scene tagging, event labels, and object review.
- Speech-adjacent tagging where language context matters.
- Video datasets that need frame, segment, or clip-level labels.
- Pilot review before a larger annotation run.
What you receive
- Annotated video dataset.
- Timing and segment notes.
- Output file in agreed structure.
Questions teams ask first
Should labels be frame-level or clip-level?
That depends on the model or review use. The timing level should be named before annotation begins.
Can multilingual speech affect video labels?
Yes. When speech or on-screen text changes the label, language review should be part of the scope.