Yes, Google gemini flash and other models are reasonably fast but on-prem multimodal models will make these dramatically better. We are prepping for that future by being local-first including a desktop app.
Image analysis would be better. I just wanted to test it quickly with different models (Gemini, GPT 4o, etc.) using an AI testing framework I am building
Congratulations on launch! We’ve faced this problem with our autonomous web browsing agent https://www.donobu.com and ended up implementing a css overlay to wait for user input in certain cases.
Slack would be so much better. Excited to try humanlayer out.
very cool - come ping us in discord happy to help out - we did do a demo w/ dendrite/stagehand a few weeks back where the AI can pull you into a browserbase session OR just ping you in plaintext to get things like MFA codes etc
Google Voice still supports texting from their web page, but the feature to text directly from Gmail was discontinued a few years ago. It was a convenient feature, but now you have to use the Google Voice app or web page for texting.
Yes, the connectivity should still work, but it might be limited depending on the model and the network you're trying to connect to. Some older Kindle models may have issues with newer Wi-Fi standards. It's worth giving it a try, though! You might also want to check if there are any firmware updates available for your device.
I had a similar experience now. Never had to do reCaptcha on a payment form with my phone before. And then the payment was rejected. I am a normal US credit card payer. I ended up using PayPal.
I am Vasusen, an engineering manager at Coursera. We are transforming lives through learning. Our platform has reached over 30 million learners, 150+ university partners, and 2,700+ courses worldwide. We are rapidly expanding into offering high-quality degrees and just had our first batch graduate this year — https://goo.gl/tW4nUX.
reply