Viewing a single comment thread. View all comments

blaher123 t1_j48jl4q wrote

does anyone have any experience using Youtube videos for text to speech/speech to text data?

I can get the subtitle data for videos, although they don't make it easy. While the subtitles themselves are accurate I also need accurate timestamps and the timestamps from Youtube (which seem to be designed for close captioning rather than accuracy) seem to be just inaccurate enough to make them not useful. Am I just doing things wrong and there is a way you guys use to get an accurate timed Youtube transcripts?

1