Submitted by AutoModerator t3_100mjlp in MachineLearning
blaher123 t1_j48jl4q wrote
does anyone have any experience using Youtube videos for text to speech/speech to text data?
I can get the subtitle data for videos, although they don't make it easy. While the subtitles themselves are accurate I also need accurate timestamps and the timestamps from Youtube (which seem to be designed for close captioning rather than accuracy) seem to be just inaccurate enough to make them not useful. Am I just doing things wrong and there is a way you guys use to get an accurate timed Youtube transcripts?
ephemeral_happiness_ t1_j49ehf4 wrote
Doesnt OpenAi have whispsr which can give you the text?
Viewing a single comment thread. View all comments