Good questions! You're right the 200 test runs limit is too low. I want to increase that dramatically, although at the moment each test costs $.05 to run so I need to get the costs down to enable that.
The next update will fix that – and also answer your second question. What I've realized is we don't actually need to use the AI on every test run. We just need to use the AI on the initial run, then subsequent tests can just replay the playwright steps the AI chose.
And if a test fails, before throwing a failure we can re-run it with the AI assistance to see if the test truly failed from a user perspective.
I'm actively working on that this next couple weeks, and hopefully this addresses your concerns around unpredictability :)
Fun story. This reminds me of the summer my friends and I were all still around our small town — we used the Yo! app the same way, as a bat signal to meet up and get into nonsense. Someone would start Yo!ing and then once there was a critical mass of return Yo!s we'd switch to text/phone and link up.
> They show up. I've worked on privacy legislation at the state and local level. Barely anybody calls or writes in support.
This is by design. A lot of people talk about RTO in regards to business real estate but there's also the aspect of keeping people so busy and exhausted that they don't show up when it matters.
I see where you're coming from, but I like the way the story is formatted.
There's plenty to doom and gloom about with the state of tech right now, but this is a good reminder that sometimes even Big Internet tools, with a little ingenuity, can sometimes be repurposed to serve the users and not some corporation's bottom line.
It's kind of a throwback to the olden days where you might stand up an IRC server or something similar just for your friend group. I like seeing people returning to the small internet where it serves as a substrate for real people doing real things.
Those are two mechanisms by which variation occurs. This is not something Darwin got wrong, he just didn't have all the data. Genes were unknown in Darwin's time.
The machine just needs to be coded to run stuff (as shown in this very post). My coworkers can’t be coded to follow procedures and still submit PRs failing basic checks, sadly.
I’m working on canine.sh - an open source alternative to Heroku / render / fly.io / etc.
Desperately wanted something like this at my last company because it didn’t feel like the cost of PaaS should be 3-4x higher than underlying cloud costs.
The first Android phone I owned also had this feature, and it was super helpful.
It's a shame that our phones are becoming more and more voracious surveillance devices without the common courtesy of doing things that are helpful for the user.
The Discord of 2020 and before absolutely supported text channels, voice channels, multiple of the above, customizable notifications, and everything required here. Like OP and many others I found myself on a number of Discord servers during the pandemic for voice/video chat, gaming together, etc. and that app was the straightforward solution both for organizing and the actual event for groups of any size from 4-20.
Your "ideas" are just regurgitations of things you read off the internet; you have no coherent theory of "creativity" beyond some ineffable reference to the sanctity of the human soul.
I'm not sure the log message v structured message comparison makes sense - most log and trace events are batched and compressed before being transported, and so the size difference isn't really an issue. On the receiving side most services have some kind of column store as their main datastore and so there's no issue there either. The benefits of structured logging are worth it.
Regarding pricing - there is an option for every kind of usage out there, it's mostly a solved problem that comes with choosing two of large scale, low cost, low latency. For instance we serve large scale and low cost.
True in real life, too. Whenever someone random comes up to me and tries to engage in a conversation, my guard goes up because 9 times out of 10, they are trying to sell me something or scam me. I can’t imagine what it’s like to be a woman, where your “safety” alarms are also sounding.
Spontaneous, innocent chit chat is dead, both online and offline because everyone’s hustling now.
There's a difference between discovery (asking an MCP server what capabilities it has) and use (actually using a tool on the MCP server).
I think the comment you're replying to is talking about discovery rather than use; that is, offering a million tools to the model, not calling a million tools simultaneously.
Not being forced, but the peer pressure is getting pretty strong.
Claude Code i will admit i find occasionally useful, but the flood of overly verbose and lackingly meaningful "AI Summaries" I'm being forced to waste time reading is really grating on me. Copilot PR summaries turning a 20-line PR into a fifty-line unhelpful essay is driving me insane.
Yeah there are a couple that claim to be like this [0] one, and there are FDA standards to follow for that claim. I wouldn't use one on a cutting board or anything that gets scraped or cut on and you need to let it cure waaaay longer than normal but yeah there are options out there.
The underlying biochemical problem would show up long before hunter gatherers, the equivalent of a 2-3 drop in IQ across billions of years is massive pressure to find a solution.