Hacker News new | past | comments | ask | show | jobs | submit login

Horses for courses, but the fading gimmick would not work for me at all.

However, this is conceptually interesting. It might be fun to speak the first draft of my next piece and transcribe the result with Whisper.




I experimenting with sth similar actually.

One small additional requirement: although I studied linguistics and took a year-long course in English phonology, speech-to-text still struggles with my accent.

The approach I'm playing with atm is inspired by some advice from Simon Willis, here on HN:

record audio → transcribe using whisper → clean up and format using a GPT prompt

So far the results have been pretty good: the original meaning is preserved but the text is much easier to read (and the missing/"misheard" words are often corrected).

What I'm experimenting at the moment:

- picking the right model size, tweaking the prompts

- better UX (e.g. immediate visual feedback)


I think the fading, if I paused to think about anything, would make me lose track of what I'd already written.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: