Jump to content

How to get my subtitles - Relevant text only?


hebrew videos

Recommended Posts

Thank-you first of all for an awesome product.

I tried to save an srt file, which worked but then I got the following:

 

001

00:00:00,008 --> 00:00:03,003

<b><font color='#FFFFFF'>ישמח חתני </font></b>

<b><font color='#FFFFFF'>Yismach chatani </font></b>

<b><font color='#FFFFFF'>Let my beloved rejoice </font></b>

 

It shows the line of hebrew, then the transliteration, then the English.

 

But I want to copy and paste these to youtube, without all the breaks (<<>>) And font color='#FFFFFF>

 

Is there a way I can get it so that it would just read:

 

ישמח חתנ

Yismach chatani

Let my beloved rejoice

 

without having to manually delete all of those extra superflous letters?

 

I've posted quite a few songs with subtitles on Youtube, and now I want to add the lyrics (subtiltles) in the description, so I'm trying to save them from videopad as srt file, but I get all those extra letters in the way.

Link to comment
Share on other sites

I suppose I can replace those, <b><font color='#FFFFFF'> and </font></b> with blanks in word. But it would be nice not to have the timing seconds and minutes...and the numbering of subtitles, 1, 2, 3..

 

Is there any easier way to download/save just the subtitles alone?

 

Thanks!

Link to comment
Share on other sites

Hi,

 

VP only exports known formats namely srt and ssa.

 

To remove the mark up language tags you can use a text editor with regular expression capability (e.g. Notepad++). Here's an example for how to do this.

 

As a follow up to this, you can remove the subtitle number, timestamp and all formatting tags with the following regex string:

Find: (^\d+\r\n|^\d\d:\d\d:\d\d,\d\d\d --> \d\d:\d\d:\d\d,\d\d\d\r\n|<[^>]+>)

Replace With: (i.e. nothing)

 

This will leave you with just the text of the subtitles.

Link to comment
Share on other sites

Hi

 

I couldn't get c-majors link to work, but as he says you can use Notepad++. Although you probably already know how to do this here are the basic steps for the standard Notepad.....

 

Open the .srt file in Notepad using "Open with.."

In Notepad the file will appear something like this..

 

d930fac0c000f54411e642f7da50168c.png

 

Select a repeated portion of the text eg <font colour='#000000'>

 

 

 

Right click and Copy the selected text

Click Edit

Click Replace

Past the text into the search box leaving the Replace with box blank

Click the Replace all button

 

This will remove all instances of the selected text from the document and re-format it.

Repeat for all the other elements that are repeated e.g. </font>

Time markings and text block numbers will probably have to be deleted manually as the Notepad text is different for each number portion. As they come up in separate lines, the routine text select and Backspace may be the best way to remove these.

 

You should eventually end up with just your text.

 

 

 

Nat

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...