Submitted by jplhughes t3_11prxd9 in MachineLearning
[removed]
Submitted by jplhughes t3_11prxd9 in MachineLearning
[removed]
Typical, basing your research on open source projects and then make a commercial product on top of other people's work. Great achievement.
Research pub or Gtfo
It’s fine, open source SOTA will make them irrelevant sooner rather than later
Is it open source?
No
Seems like there's a very generous free tier, then super cheap after that.
Why is this tagged [R]. This is a commercial project at best. Where's the paper, where's the code? Can we use it today on our PC like whisper? This really isn't 'research'.
Release the model. It wants to be free.
>25% improvement over Whisper
>Not open source
>doubt.jpeg
Yes, that's probably a cherry picking marketing only.
Wav2vec2 is still sota as long as this isn’t open source it’s kinda useless lmao
On which metric are you basing this on? I'm not deep in ASR but in the Whisper paper it is compared to word2vec 2.0 and whisper is better in most categories.
Excellent demo on your page, I just used it on a YT video featuring a non-native English speaker. There was only a slight error in punctuation due to an ambiguously long pause in the speech.
Is this a purely commercial product or will there be an open source release?
Pretty sure commercial product only. Speechmatics has never opensourced any of their models.
Seems super cheap to me tbf - no problem with paying for stuff like this.
what is the difference between $1.25/hr for Standard, $1.90/hr for Enhanced
$0.65?
Lmaoo
my guess is model size
Can confirm it is better than whisper, doesn't randomly go off the rails either but I don't wanna have to pay 😅
So is this post kind of a hidden advertisement or what?
Any ways to get the encoded speech features?
Does it support Ukrainian and Russian?
are there wer for other languages? Like in the github page for whisper? I want to compare the performance in other languages
[removed]
I tested it using Japanese and it seems like it misses punctuation for the most part. But, overall, seems to be doing a good job getting the words.
Removed after LOTS of reports. See rules #3 and #8 in the sidebar.
This is incredible
Bulky_Highlight_3352 t1_jc0bp3s wrote
"Hey, we made this commercial tool that is better than open source!"