Aha, ok, I understand. Looking forward to a version supporting on-device TTS then.
Keep up the good work!
The idea and the direction it is heading is definitely awesome.
Here in UK, we have a service called Gousto [0] which delivers meal ingredients and instructions to homes. I would, naturally, like to be able to import those recipes into your app and have it assist me while cooking. Check it out!
The problem: 1. Cooks struggle with messy hands while trying to follow recipes. 2. Existing voice assistants can't navigate complex recipes or set context-aware timers. 3. Phones and printed recipes become unusable with greasy fingers.
Technical solution: 1. Voice-controlled recipe navigation with natural language understanding. 2. Dynamic timer setting based on recipe steps. 3. Cross-device sync using WebSocket connections. 4.Attempted with several speech-to-text techs and a custom NLU layer (Python/Next.js).
Key challenges: 1. Handling ambiguous voice commands in a noisy kitchen environment 2. Synchronizing updates when having multiple timers 3. Normalizing recipes from various formats (PDFs, websites, handwritten scans)
Current stack: 1. Supabase for real-time database 2. Custom voice pipeline (very low latency, check out this live demo video: https://youtu.be/XKxoHOIxyjA)
I'd appreciate feedback on: 1. Preferred recipe import formats 2. Commonly used voice command patterns while cooking 3. Additional timer management / task-performing features you'd find useful.