Settings

Theme

Find a YouTube video based on closed captions keyword search?

32 points by sigmaprimus 2 years ago · 17 comments · 1 min read

Reader

I am trying to find a Jay Leno Headlines item of my cousin's name and picture appearing in the Regina Leader Post as the Paperboy of the week/month. I have watched many videos but there are hundreds of hours and many duplicates.

I was hoping that the closed captions for videos were indexed by google but I am not sure if they are and if there is a certain syntax to use for searching captions.

I tried googling this but have only found searching within a specific video rather than across the library.

To be more specific, his name is Rick Lee and the Headline appeared on the show sometime around Y2K.

Any help and or guidance would be greatly appreciated!

Thanks

OrderlyTiamat 2 years ago

this is not a complete solution, but you seem to know what videos you're going to search through rather than wanting to search through everything. If that's the case, I recommend downloading the subtitles for every video via yt-dlp [1] as shown in this stackoverflow question [2] (it doesn't download the video itself this way to save bandwidth):

    yt-dlp --all-subs --skip-download https://www.youtube.com/watch?v=Ye8mB6VsUHw

(The answer uses youtube-dl, but I prefer yt-dlp, it works with these same options when I tried it)

When you've downloaded all subtitles, you can simply grep through them. I hope this helps!

[1] https://github.com/yt-dlp/yt-dlp

[2] https://superuser.com/a/927532/1071647

dekatron 2 years ago

Not a solution, but somewhat related: https://youglish.com/ lets you search YouTube videos for keywords, but the purpose is to find examples of how to pronounce words from real usage. It also works for a few other languages aside from English.

  • roelschroeven 2 years ago

    Might be a solution, I think. I'd give it a shot: use it to search for the person's name. I think there's a pretty good chance of it working.

  • benboozled 2 years ago

    This restores a little bit of my faith in humanity. Such a cool tool! Thanks for sharing.

sigmaprimusOP 2 years ago

Thank You to everyone who took the time to read my question and especially those that replied. I have read over the replies and there is be many good ideas/solutions to my query.

I am going to start at the top and work my way through them over the next little while and will definitely provide an update of what hopefully worked but it may be a few weeks.

I just wanted to post this comment to let you all know I appreciate the great responses and the HN community. As <B>benboozled</B> commented about his faith being restored in humanity, I am thankful for HN and all of you generous and helpful contributors.

M

pimlottc 2 years ago

There’s literally thousands of episodes of The Tonight Show[0], many of which are probably not on YouTube. I would reach out to the Jay Leno fan community and see if anyone has it in a private archive. Never underestimate the organization skills of a dedicated fan.

0: https://en.m.wikipedia.org/wiki/List_of_The_Tonight_Show_wit...

a2128 2 years ago

Someone made a website to search through channels or playlists. Not sure if it still works but it might help: https://ytks.app/

stonks 2 years ago

You can try to use https://filmot.com/ to search videos by closed captions.

I tried to search but could not find the video.

activiation 2 years ago

There are browser extensions to search captions on a single video but I don't know of any better options.. which is kind of ridiculous considering Google owns YouTube.

Here is one: https://addons.mozilla.org/en-US/firefox/addon/youtube-capti...

fuzztester 2 years ago

A related interesting problem would be: how to search by text keywords in the audio stream of the video.

I guess part of one solution approach could be to convert the speech to text. Could then grep it. But how would we correlate that back to the time positions in the video where those keywords occurred?

noman-land 2 years ago

You can try using yt-fts[0]. I've had great luck with it.

[0] https://github.com/NotJoeMartinez/yt-fts

calderknight 2 years ago

You can open the video transcript and ctrl+f to find the text you are looking for.

This at least would save you having to watch the videos.

lathiat 2 years ago

https://incaptions.com/

  • mvidal01 2 years ago

    Doesnt work very well Im afraid. I looked at the transcript of some youtube videos and then searched on a few words and it didnt find any videos with the word. The word was powell FWIW.

Leftium 2 years ago

These might help:

- https://hw.leftium.com/#/item/34826944

- https://hw.leftium.com/#/item/36009774

rasz 2 years ago

ask chatgpt, Im sure they slurped all YT transcripts for learning

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection