This uses SenseVoice under the hood, which claims to have better accuracy than W...

jmward01 · 2024-10-13T14:45:49 1728830749

This uses SenseVoice small under the hood. They claim their large model is better than Whisper large v3, not the small version. This small version is definitely worse than Whisper large v3 but still usable and the extra annotation it does is interesting.

khimaros · 2024-10-13T20:31:42 1728851502

this claims to have speaker diarization which is a potentially killer feature missing from most whisper implementations.

pferdone · 2024-10-13T11:04:38 1728817478

I mean they make a bold statement up top just to paddle back a little bit further down with: "[…] In terms of Chinese and Cantonese recognition, the SenseVoice-Small model has advantages."

It feels dishonest to me.

[0] https://github.com/FunAudioLLM/SenseVoice?tab=readme-ov-file...