How to add Dubbing on Kapwing

How to add Dubbing on Kapwing

If you have a global audience, it might be hard to make a new video over and over in different languages. The workaround? Dubbing!

Dubbing is the adding of a different language to a video that has already been shot. A good example is when you watch a foreign film originally spoken in Mandarin, but you watch it in English.

How do I add dubbing on Kapwing?
How do I dub with a custom voice?
What languages does Kapwing support for dubbing?

How to Dub on Kapwing

You can do dub videos yourself on Kapwing. Follow these instructions to dub your video automatically:

  1. Upload your video(s) to the Kapwing Studio
  2. Click "Subtitles" on the left side of the studio
  3. Click "Dub Audio" to start the process of transcription, translation, and synthetic voice dubbing
  4. Select the assets you want to have dubbed (if you uploaded multiple videos, they will all be "pre-selected" if there is audio detected)
  5. Select your preference(s) for the dubbing, including the original language of the video, the target language you want to dub into, the voice, and the number of speakers in the video
  6. Click "Dub Video"
  7. Make any other edits you want before exporting the project

Kapwing will take a few minutes to dub your video, depending on the video length.

Alternatively, you can dub your video step by step to ensure the highest quality possible.

  1. Upload: Start by uploading the video and auto-generating subtitles in the original language with no translation.
  2. Transcribe: Review the captions to ensure that the transcription is correct. Make any edits to the text and timings of the subtitles if they're off using the subtitles tab.
  3. Translate: Use the "Smart Tools" action menu to find "Translate Subtitles." Translate the subtitles into the target language. Review the translation to ensure that the new dialogue makes sense. Kapwing is fully collaborative, so you can send the URL to a global partner for quality control if needed.
  4. Dub: Click "Smart Tools" > "Dub Audio" to create the synthetic voiceover in the translated language. You'll chose the voice and speaker settings before processing the dub.
  5. [Optional] Lip Sync: Kapwing's AI-powered lip sync feature will change the lips of the speaker to match the new synthetic text. Find this feature under the "Smart Tools" menu after you generate the dub.
  6. [Optional] Translate Text: You can also translate the text within the contents of the video or image. This is helpful if your video has text embedded in it, like in a presentation, conference recording, lecture, training film, or Zoom meeting. Select the video and click "Translate Text" in the right side-bar. Kapwing will identify all text layers in the video, translate them, and add matching text overlays with the translation. You can review all added layers and move, edit, or delete them.
  7. Review: Kapwing is a fully-featured video editor, so users can change the speed of the audio, make cuts, add subtitles, and more. If you make changes to the dubbed voiceover, you can re-generate the synethic voice to match the new script.
  8. Export: Click "Export Project" to download and share the processed MP4 video.

Can I dub using a translated SRT file?

Yes! You can upload an SRT file to Kapwing to use as the basis for the dubbed voice. When you've uploaded the video and clicked "Dub Audio," use the "Upload SRT/VTT" button in the target langauge to import an existing captions file. Kapwing will use this file as the basis for the dubbed video.

How to Dub on Kapwing with a Custom Voice?

Kapwing* offers the ability to save a clone of your voice or upload a voice of your choosing allowing you to create a text to speech layer using your own voice model.

To add a voice clone, you must be a Business customer. Business plan customers can save up to 2 voice clones in their Brand Kit. Once you've upgraded to the Business Plan, click the "Add new Voice" button in the Text to Speech dropdown menu. You'll be prompted to upload an example of the speaker whose voice you want to clone**.

When dubbing, you can select the voice you have uploaded in Brand Kit under the "Voice" dropdown.

To delete a voice clone, go to your Brand Kit and scroll down to the saved voice clones. Hover over a voice model icon and click the delete icon that appears in the upper corner.

*We've enabled Voice Cloning in partnership with Eleven Labs.
**Customers must have the rights to clone a speaker's voice, as noted in Kapwing's terms of service.

Is Video Dubbing Free? How Much Does Video Dubbing on Kapwing Cost?

Video Dubbing on Kapwing is free to try. To try it, create an account on Kapwing and upload a short video. Free users can upload a video less than 8 minutes long to dub a video. Choose a realistic synthetic voice to use as the dubbed audio.

To use Lip Sync, users can upgrade to Kapwing Pro. A Pro subscription includes up to 300 minutes of video dubbing each month, and it also removes the watermark from the exported video. See our pricing page for more info.

To dub a video in the same voice as the original speaker (voice cloning), you'll need to upgrade to Kapwing's Business or Enterprise plan. Both Business and Enterprise plans are billed per-seat, meaning each editor will need a license to access the platform.  

How does Voice Dubbing on Kapwing work?

Originally a video and audio editing platform, Kapwing integrates multiple AI technologies to power our video dubbing product.

  • Clean audio and background sounds: Kapwing extracts the spoken words of the video from other sounds, like music, laughter, and effects. This enhances the transcription and makes the dubbed audio sound more natural.
  • Transcription: The dialogue of the video is extracted using speech-to-text technology and the team's glossary.
  • Translation: Captions are translated using machine vendors.
  • Synthetic voice generation: A new voiceover is created in the dubbed langauge. Kapwing leverages premium text-to-speech providers to make the voice sound highly realistic. For Business and Enterprise customers, the voice is cloned from the original speaker to make it as realistic as possible. Kapwing detects where the speaker changes so that it creates different voices for each speaker, improving the quality of the dubbed audio.
  • Timing: The new dubbed audio is combined with the original video and background audio track in the timeline. Kapwing uses generative AI to adjust the timing of the translated audio, making it match the original video as closely as possible.
  • Lip sync: Users can turn on Lip Sync to generate a new video layer where the lips of the speaker match the new dubbed audio.
  • Translate Text: Kapwing uses advanced technology to scan the video, identify embedded text layers, translate them, and overlay matching text layers to ensure that embedded text is also translated to the target language.

The result is a dubbed audio track that sounds closer to the original video than any other platform. Our base editing tools are designed for training and communications teams to collaborate on video content, so it's customizable at every step.

What voices do you offer?

Currently, Kapwing offers voices from Google AI and Eleven Labs. Please inquire if you have a specific voice that you'll like Kapwing to add or a vendor you're interested in.

What companies use Kapwing?

Our dubbing product launched in 2024 and is used by communications teams at multinational companies like Chevrolet, SHEIN, OEC, and Hollister plus dozens of Universities, a few churches, and multiple government agencies.

What dubbing features does Kapwing have?

Kapwing has roots in AI video editing, so the integrated AI technology and customization on tweaking the audio and video layer sets us apart from other dubbing products. Here's a list of features that we support

  • ✅ Automatic speech recognition (ASR) and captioning
  • ✅ Machine translation, with support from multiple translation vendors
  • ✅ Background sound preservation
  • ✅ Voice cloning
  • ✅ Lip sync
  • ✅ Support for uploaded SRT files and Slavic, Arabic, and RTL languages
  • ✅ Realistic synthetic voices is 30 languages
  • ✅ Import from YouTube and Google Drive
  • ✅ Speaker labels and dubbing for videos with multiple speakers
  • ✅ Translate text embedded in the video
  • ✅ Advanced timing and speed adjustments technology to match the original timing of the video
  • ✅ Real-time collaboration
  • ✅ Custom Spelling glossary for commonly-used words
  • ✅ Regeneration of text to speech layers
  • ✅ Video uploads up to 6GB

Here are some features that Kapwing's Dubbing platform does not support:

  • ❎ Emotive controls or adjustments
  • ❎ Bulk import or export. Each video must be uploaded and exported individually, although it is possible to make a copy of the video for each target language.
  • ❎ LMS integrations: At this time, we do not support integrations with LMS
  • ❎ Custom pronunciation guide

What languages does Kapwing Dubbing support?

Kapwing uses same 30 different languages for dubbing as it does for text to speech. See the full list of supported languages below.

Supported Language List

English (US)
English (UK)
English (AUS)
Arabic (Multi-Region)
Chinese (Mandarin)
Filipino (Tagolog)
Portuguese (Brazil)
Portuguese (Portugal)
Spanish (Spain)
Spanish (Mexico)

* we do not support voice cloning in this language

Additional Resources:

Looking for more help?

Check our Release Notes for tutorials on how to use the latest Kapwing features!