How to Remove Subtitles from a Video

When we can’t find the original video without subtitles but want to do some secondary creations, such as making commentary videos, editing, and translating, we have to process the subtitled videos by removing or covering the subtitles.

Here is an example of processing video frames from the video clip of Soldier Sortie. Click on the picture to watch the video.

Video frame with subtitles:

Video frame with subtitles removed with AI:

Video frame with subtitles blurred:

We can see that the effect of AI algorithms on background restoration is fairly good. However, although the effect of a single image is OK, you can still see obvious traces of changes and fluctuations in the generated video. After all, normal videos have 25 frames per second, and the results of AI processing vary from frame to frame. At present, there is no particularly good solution. If we simply use blurring, there will be no obvious fluctuations.

Update: The new ImageTrans version has added the STTN network model, it can remove text based on multiple frames, which can have a better result.

Here’s how to remove subtitles from a video using ImageTrans and Silhouette.

Open the video with Silhouette and export the video frames as images. Save in PNG format to maintain picture quality.
Use Silhouette to generate subtitles based on sound (another approach is to detect hard subtitles). This allows us to know which images contain captions, reducing the amount of data that needs to be processed.
Open the exported images using ImageTrans. Open the video subtitle remover, import subtitles in SRT format, select the subtitle position, and generate text-removed images. If there are missing subtitled images (generally at the beginning and end of a speech line), you can use OCR and other technologies to detect the text and adjust the range of the images with text.

The text removing method can be set in the project settings.
Use Silhouette to generate the final video based on the text-removed images.