Creating an AI task.
This method allows you to create an AI task for VOD video processing:

How to use:
task_id.../ai/tasks/{task_id} methodFor more detailed information, see the description of each method separately.
AI Automatic Speech Recognition (ASR)
AI is instrumental in automatic video processing for subtitles creation by using Automatic Speech Recognition (ASR) technology to transcribe spoken words into text, which can then be translated into multiple languages for broader accessibility.
Categories:
transcription – to create subtitles/captions from audio in the original language.translation – to transate subtitles/captions from the original language to 99+ other languages.AI subtitle transcription and translation tools are highly efficient, processing large volumes of audio-visual content quickly and providing accurate transcriptions and translations with minimal human intervention. Additionally, AI-driven solutions can significantly reduce costs and turnaround times compared to traditional methods, making them an invaluable resource for content creators and broadcasters aiming to reach global audiences.
Example response with positive result:
{
"status": "SUCCESS",
"result": {
"subtitles": [
{
"start_time": "00:00:00.031",
"end_time": "00:00:03.831",
"text": "Come on team, ..."
}, ...
]
"vttContent": "WEBVTT\n\n1\n00:00:00.031 --> 00:00:03.831\nCome on team, ...",
"concatenated_text": "Come on team, ...",
"languages": [ "eng" ],
"speech_detected": true
}
}, ...
}
AI Content Moderation (CM)
The AI Content Moderation API offers a powerful solution for analyzing video content to detect various categories of inappropriate material. Leveraging state-of-the-art AI models, this API ensures real-time analysis and flagging of sensitive or restricted content types, making it an essential tool for platforms requiring stringent content moderation.
Categories:
nsfw: Quick algorithm to detect pornographic material, ensuring content is “not-safe-for-work” or normal.hard_nudity: Detailed analisys of video which detects explicit nudity involving genitalia.soft_nudity: Detailed video analysis that reveals both explicit and partial nudity, including the presence of male and female faces and other uncovered body parts.sport: Recognizes various sporting activities.The AI Content Moderation API is an invaluable tool for managing and controlling the type of content being shared or streamed on your platform. By implementing this API, you can ensure compliance with community guidelines and legal requirements, as well as provide a safer environment for your users.
Important notes:
Example response with positive result:
{
"status": "SUCCESS",
"result": {
"nsfw_detected": true,
"detection_results": [ "nsfw" ],
"frames": [
{
"label": "nsfw",
"confidence": 1.0,
"frame_number": 24
},...
]
}
}
Additional information
Billing takes into account the duration of the analyzed video. Or the duration until the stop tag(where applicable), if the condition was triggered during the analysis.
The heart of content moderation is AI, with additional services. They run on our own infrastructure, so the files/data are not transferred anywhere to external services. After processing, original files are also deleted from local storage of AI.
Read more detailed information about our solution, and architecture, and benefits in the knowledge base and blog.
API key for authentication. Make sure to include the word apikey, followed by a single space and then your token.
Example: apikey 1234$abcdef
Name of the task to be performed Name of the task to be performed Name of the task to be performed
URL to the MP4 file to analyse. File must be publicly accessible via HTTP/HTTPS. URL to the MP4 file to analyse. File must be publicly accessible via HTTP/HTTPS.
Language in original audio (transcription only). This value is used to determine the language from which to transcribe.
If this is not set, the system will run auto language identification and the subtitles will be in the detected language. The method also works based on AI analysis. It's fairly accurate, but if it's wrong, then set the language explicitly.
Additionally, when this is not set, we also support recognition of alternate languages in the video (language code-switching).
Language is set by 3-letter language code according to ISO-639-2 (bibliographic code).
We can process languages:
Indicates which language it is clearly necessary to translate into.
If this is not set, the original language will be used from attribute "audio_language".
Please note that:
Meta parameter, designed to store your own identifier. Can be used by you to tag requests from different end-users. It is not used in any way in video processing. Meta parameter, designed to store your own identifier. Can be used by you to tag requests from different end-users. It is not used in any way in video processing.
256Meta parameter, designed to store your own extra information about a video entity: video source, video id, etc. It is not used in any way in video processing.
For example, if an AI-task was created automatically when you uploaded a video with the AI auto-processing option (transcribing, translationing), then the ID of the associated video for which the task was performed will be explicitly indicated here. Meta parameter, designed to store your own extra information about a video entity: video source, video id, etc. It is not used in any way in video processing.
For example, if an AI-task was created automatically when you uploaded a video with the AI auto-processing option (nudity detection, etc), then the ID of the associated video for which the task was performed will be explicitly indicated here.
4096Model for analysis (content-moderation only). Determines what exactly needs to be found in the video.
sport, nsfw, hard_nudity, soft_nudity Response returns ID of the created AI task. Using this AI task ID, you can check the status and get the video processing result. Look at GET /ai/results method.
ID of the created AI task, from which you can get the execution result