More

praccu · 2025-11-23T04:19:50 1763871590

This idea seems to be coming up in multiple places.

Here's one from Deepmind:

praccu · 2025-10-30T02:27:58 1761791278

Many years ago (13?), I was around when Amazon moved SABLE from RAM to SSDs. A whole rack came from a single batch, and something like 128 disks went out at once.

I was an intern but everyone seemed very stressed.

praccu · on Aug 1, 2024

https://www.nature.com/articles/s41586-024-07744-y https://research.google/blog/fast-accurate-climate-modeling-...

praccu · on June 26, 2024

You're very confused.

https://www.census.gov/newsroom/stories/poverty-awareness-mo...

missedthecue · on June 26, 2024

I'm not confused. It is not possible to have no dependents and be below the poverty line working 40 hours

praccu · on April 27, 2024

E Ink Todo list on my wall that pulls from Todoist, which I can update from my watch.

https://blog.praccu.com/

praccu · on April 27, 2024

I should update the post with details.

I ended up using DFRobot firebeetle because it respects sleep mode. I'm at 15 months on one charge with a 10Ah battery.

praccu · on Aug 18, 2022

Massachusetts?

praccu · on March 17, 2022

Not for sale, I'm hijacking your thread, but I wear the heck out of my Google+ slippers.

praccu · on Feb 15, 2022

Shameless self promotion: I wrote one of the more cited papers in the field [0], back in 2016.

A key challenge: very few labs have enough data.

Something I view as a key insight: a lot of labs are doing absurdly labor intensive exploratory synthesis without clear hypotheses guiding their work. One of our more useful tasks turned out to be interactively helping scientists refine their experiments before running them.

Another was helping scientists develop hypotheses for _why_ reactions were occuring, because they hadn't been able to build principled models that predicted which properties were predictive of reaction formation.

Going all the way to synthesis is nice, but there's a lot of lower hanging fruit involved in making scientists more effective.

[0] https://www.nature.com/articles/nature17439

entee · on Feb 15, 2022

This is true. Getting datasets with the necessary quality and scale for molecular ML is hard and uncommon. Experimental design is also a huge value add, especially given the enormous search space (estimates suggest there are more possible drug-like structures than there are stars in the universe). The challenge is figuring out how to do computational work in a tight marriage with the lab work to support and rapidly explore the hypotheses generated by the computational predictions. Getting compute and lab to mesh productively is hard. Teams and projects have to be designed to do so from the start to derive maximum benefit.

Also shameless plug: I started a company to do just that, anchored to generating custom million-to-billion point datasets and using ML to interpret and design new experiments at scale.

probably_wrong · on Feb 15, 2022

> A key challenge: very few labs have enough data.

It is also getting harder, not easier, to get.

I am working right now on a retro synthesis project. Our external data provider is raising prices while removing functionality, and no one bats an eye. At the same time our own data is considered a business secret and therefore impossible to share.

As someone who does NLP research where the code, data and papers are typically free, this drives me insane.

cinntaile · on Feb 15, 2022

Are you using NLP to guide what molecules are probably worthwhile to try and synthesize?

probably_wrong · on Feb 16, 2022

A bit. But my main project was to use NLP to identify failed reactions in old lab notebooks to use as negative training data.

czbond · on Feb 15, 2022

Question: How are labs doing the exploratory work without a clear hypothesis? Are they essentially doing some version of brute force?

hashimotonomora · on Feb 15, 2022

Experienced chemists can look at molecule diagrams and have an intuition as to its activity and similarity to other known molecules. It’s like most of science and math: most discoveries begin with intuition and are demonstrated rigorously afterwards. I believe Poincare said something to this end.

amelius · on Feb 15, 2022

Ok, so these experienced chemists can be replaced by AI now?

aemoron · on Feb 15, 2022

In the same way radiologist can be replaced by AI. So, no.

amelius · on Feb 15, 2022

Radiologists have a high responsibility of detecting the right things.

Chemists can just try out things.

I don't think you can compare the two.

aemoron · on Feb 15, 2022

I was implying that you still need a human to make the final decision. AI can be a valuable aid in both fields. Doctors can't just let the AI do all the work in the same way synthetic chemists can't blindly trust the AI to spit out correct and feasible results. Research time is expensive and thus the effort needs to be evaluated, and usually the intuition of said chemists trump that of the AI.

amelius · on Feb 15, 2022

True. But perhaps you can eliminate 9 out of 10 chemists, and replace them by an AI that generates ideas. Then use the 1 chemist to validate those ideas.

fuzzfactor · on Feb 16, 2022

And that's why I want to build me a robot.

Not to generate ideas, there's always more ideas than resources in chemistry.

Mainly to do more automated routines than ever.

9 out of 10 chemists aren't that great at the bench anyway.

Everyone would probably benefit from getting them in front of a computer full-time to leverage their training in a way, and freeing up the bench space to those who can really make the most of it.

fuzzfactor · on Feb 16, 2022

Not the focus of the article, but analytical chemists need to do a lot of proper detecting themselves to be high-performing just like the radiologists do.

kortex · on Feb 15, 2022

The brain is incredibly good at pattern matching while not necessarily being able to articulate why they came to that decision. Organic chemistry has these types of relations in spades. Say for example crystallization. You can kinda brute force it; there's only a few dozen realistic solvents to try, but that's a single solvent system. Then there's binary and ternary solvent systems. Then there's heat/cooling profiles, antisolvent addition, all kinds of things. Hundreds or thousands of possible experiments.

You might just decide that a compound "needs" isopropanol/acetone, plus a bit of water, cause something vaguely similar you encountered years ago crystallized well. You often start with some educated guesses and refine based on what you see.

But there's often no clear hypothesis, no single physical law the system obeys.

kilroy123 · on Feb 15, 2022

I'm trying to get a startup off the ground that tackles this.

Would love to chat more with you about this.

malux85 · on Feb 15, 2022

Me too, also tech nomad. I'll email you

formerly_proven · on Feb 15, 2022

> Something I view as a key insight: a lot of labs are doing absurdly labor intensive exploratory synthesis without clear hypotheses guiding their work.

This lets you stumble over unknown unknowns. Taylor et al discovered high-speed steel by ignoring the common wisdom and doing a huge number of trials, arriving at a material and treatment protocol that improved over the then-state-of-the-art tool steels by an order of magnitude or more. The treatment mechanism was only understood 50-60 years later.

praccu · on Sept 10, 2021

At Amazon I set up an evaluation approach based on whether the system completed the desired task (in that context it was "did the search result using the speech recognition return the same set of items to buy as the transcript.)

https://scholar.google.com/citations?view_op=view_citation&h...

dylanbfox · on Sept 10, 2021

Interesting. It seems like in the "real world" WER is not really the metric that matters, it's more about "is this ASR system performing well to solve my use case" - which is better measured through task-specific metrics like the one you outlined your paper.

6gvONxR4sf7o · on Sept 10, 2021

A pure ASR analog of this is how many/how much continuous utterances it enables. When I use tools like the one lunixbochs builds (including his own) the challenge as a user is trading of doing little bits at a time (slow, but easier to go back and correct) vs saying a whole ‘sentence’ in one go (fast and natural but you’re probably going to have to go back and edit/try again).

Sentence/command error rate (rate of 100% correct sentences/commands that don’t need any editing or re-attempting) is a decent proxy for this. It’s no silver bullet, but it more directly measures how frustrated your users will be.

If you really wanted to take care of the issues in the article, you could interview a bunch of users and find what percent of the, would go back and edit each kind of mistake (if 70% would have to go back and change ‘liked’ to ‘like’ then it’s 70% as bad as substituting ‘pound’ for ‘around’ which presumably every user will go back and edit).

The infuriating thing as a user is when metrics don’t map to the extra work I have to do.

lunixbochs · on Sept 10, 2021

> vs saying a whole ‘sentence’ in one go (fast and natural but you’re probably going to have to go back and edit/try again)

"probably going to have to go back and edit" is generally not the case with my Conformer model, which allows fast paced usage like this with practice: https://twitter.com/lunixbochs/status/1378159234861264896

6gvONxR4sf7o · on Sept 10, 2021

Unfortunately that was the model I had in mind when I wrote that. I used it for maybe a month (I'm pretty sure), and my experience just wasn't as good as yours. It may be better than what preceded it, but it still drove me crazy. I came away with the conclusion that ASR as a technology just isn't there yet.

(and the conclusion that I need to prevent the return of RSI at all costs from now on. Don't get me wrong, I'm very thankful that talon does as well as it does. It was a job saver.)

lunixbochs · on Sept 10, 2021

Are you referring to the test you mentioned in this thread? https://news.ycombinator.com/item?id=26784732

If so, December predates Conformer, so you're talking about the sconv model, which is the model I was complaining about upthread - it was very polarizing with users, and despite the theoretical WER improvements, the errors were much more catastrophic than the model that preceded it.

In either case, I'm constantly making improvements - I'm in the middle of a retrain that fixes some of the biggest issues (such as misrecognizing some short commands as numbers), and I've done a lot of other work recently that has really polished up the experience with the existing model.

6gvONxR4sf7o · on Sept 10, 2021

I totally forgot about that conversation! Yeah I must be referring to sconv then. I was thinking of the new custom-trained model you were releasing to your paid beta patreon subscribers, and confused the two.

As a side rant, it turned out that simply stepping away from work for a few weeks around the holidays nearly fixed my RSI, which makes me so sad about the nature of my career whenever it crops back up.

Btw, any chance you've done any work on the `phones` or related tooling? I remember that (and editing in general) being a pain point.

lunixbochs · on Sept 10, 2021

Yeah for sure, breaks are really important.

sconv was especially disappointing because it looked so good on metrics during my training, but the cracks really started to show once it entered user testing. Conformer has been so much less stressful in comparison because most user complaints are about near misses (or completely ambiguous speech where the output is not _wrong_ per se if you listen to the audio) rather than catastrophic failure.

There's another interesting emergent behavior with my user base as I make improvements, which is that as I release improved models allowing users to speak faster without mistakes, some users will speak even faster until there are mistakes again.

Edit: Yep! There have been several improvements on editing, though that's more in the user script domain and my work has still been mostly on the backing tech. I'm planning on working on "first party" user scripts in the future where that stuff is more polished too.

6gvONxR4sf7o · on Sept 10, 2021

> as I release improved models allowing users to speak faster without mistakes, users will speak even faster until there are mistakes again.

LOL. Users will be users! That's a hilarious case study, thanks for sharing.

> Yep! There have been several improvements on editing, though that's more in the user script domain and my work has still been mostly on the backing tech. I'm planning on working on "first party" user scripts in the future where that stuff is more polished too.

That would be wonderful! If you haven't seen them, I'd suggest looking at Serenade (also ASR) and Nebo (handwriting OCR on ipad) as interesting references for editing UI. They seem to have tight integration between the recognition and editing steps, letting errors be painless to fix by exposing alternative recognitions at the click of a button or short command. It lets them make x% precision@n as convenient as x% accuracy.

lunixbochs · on Sept 10, 2021

I would say not quite as convenient, because they lean on that UI to also make you constantly confirm top-1 commands that would've worked fine. As you can see in my Conformer demo video I can hit top-1 so reliably I don't even need to wait to look at the command before I start saying the next one.

praccu · on April 22, 2018

Shameless plug: I published almost the same thing in a very nearby field 2 years ago:

https://www.nature.com/articles/nature17439

JackFr · on April 22, 2018

Not shameless at all.

Your paper was referenced after all. [17]

silverlake · on April 22, 2018

That’s a terrific idea. Would this be applicable to drug discovery?

et2o · on April 23, 2018

Widely used in drug discovery