More

oergiR · 2025-11-03T20:47:48 1762202868

There is more to this quote than you might think.

Grammatically, in English the verb "swim" requires an "animate subject", i.e. a living being, like a human or an animal. So the question of whether a submarine can swim is about grammar. In Russian (IIRC), submarines can swim just fine, because the verb does not have this animacy requirement. Crucially, the question is not about whether or how a submarine propels itself.

Likewise, in English at least, the verb "think" requires an animate object. the question whether a machine can think is about whether you consider it to be alive. Again, whether or how the machine generates its output is not material to the question.

brianpan · 2025-11-03T20:58:56 1762203536

I don't think the distinction is animate/inanimate.

Submarines sail because they are nautical vessels. Wind-up bathtub swimmers swim, because they look like they are swimming.

Neither are animate objects.

In a browser, if you click a button and it takes a while to load, your phone is thinking.

oergiR · 2025-03-19T16:49:03 1742402943

FreeType was written when fonts were local, trusted, resources, and it was written in low-level C to be fast. The TrueType/OpenType format is also made for fast access, e.g. with internal pointers, making validation a pain.

So though FreeType is carefully written w.r.t. correctness, it was not meant to deal with malicious input and that robustness is hard to put in afterwards.

ack_complete · 2025-03-19T20:07:17 1742414837

TrueType also just has way too much complexity accumulated into it. The character to glyph mapping table alone has nine different encoding formats. I was only writing a TTF file instead of reading it and the complexity was still impressive.

oefrha · 2025-03-19T17:47:41 1742406461

If you think FreeType is bad, wait until you find out win32k.sys used to parse TrueType fonts directly in the kernel. https://msrc.microsoft.com/blog/2009/11/font-directory-entry... (that’s just one of a million vulnerabilities thanks to kernel mode font parsing.)

oergiR · on Oct 18, 2024

That makes it sound like it’s a choice, which it isn’t really. The way to look at it is from a probabilistic perspective: with the fix, you maximise the probability of the data. Without the fix, you fairly arbitrarily raise some probabilities to a power greater than one, and some to a power less than one.

danielhanchen · on Oct 18, 2024

Yes exactly- mathematically it was incorrect to begin with.

oergiR · on Sept 9, 2023

Isn’t the main problem the cost of heating? In the UK at least a kWh of electricity cost 4 times what a kWh of gas costs. Even if a heat pump is twice as efficient as gas heating, the cost of heating is still twice as high.

theGnuMe · on Sept 9, 2023

They could incentivize it by subsidizing a heat pump use rate. There’s technology to identify these remotely I imagine.

oergiR · on April 15, 2023

I agree except for (6). A language model assigns probabilities to sequences. The model needs normalised distributions, eg using a softmax, so that’s the right way of thinking about it.

jaidhyani · on April 16, 2023

This is true in general but not in the use case they presented. If they had explained why a normalized distribution is useful it would have made sense - but they just describe this as pick-the-top-answer next-word predictor, which makes the softmax superfluous.

oergiR · on July 16, 2022

In English everywhere except the US, it’s “horse riding”; “horseback riding” is US English.

Definitely humorous: https://m.youtube.com/watch?v=5wSw3IWRJa0

OJFord · on July 16, 2022

I know it originates in the US (where I'm not from, perhaps hence it grating) but like so many things it does seem to be catching.

oergiR · on Aug 9, 2020

Training a speaker-specific recogniser that improves over a generic recogniser requires a lot more data nowadays. First, generic systems are a lot better and trained on a lot more data nowadays. Second, speaker adaptation worked better for the Gaussian mixture models from the late nineties (don’t know about the eighties) than for neural networks.

oergiR · on Aug 24, 2017

The actual paper is available here: http://www.isca-speech.org/archive/Interspeech_2017/abstract...

oergiR · on Jan 19, 2017

This is how healthcare is set up in Switzerland and the Netherlands.

It is just an implementation issue how payments (through tax or otherwise) are routed. In a system that involves market forces, the key thing is that if everybody is insured, including the healthy, the cost per person comes down.

oergiR · on Dec 4, 2016

The "model" in the title is the model of the world, as a probabilistic model. The good thing about such a model is that it explicitly states your beliefs about the world. Once you've defined it, in theory reasoning about it is straightforward. (In practice a lot of papers get written about how to do approximate inference.) It's also straightforward to do unsupervised learning.

This is a different perspective from (most uses of) neural networks, which do not have this clear separation between the model and how to reason about it. It's funny that Chris Bishop in 1995 wrote the textbook "Neural Networks for Pattern Recognition" and now is effectively arguing against using neural networks.

You can use both by using neural networks as "factors" (the black squares) in probabilistic models.

nl · on Dec 4, 2016

It's funny that Chris Bishop in 1995 wrote the textbook "Neural Networks for Pattern Recognition" and now is effectively arguing against using neural networks.

I haven't read "Neural Networks for Pattern Recognition", but his "Pattern Recognition and Machine Learning"[1] is the text for ML work including Bayesian approaches.

I don't think one should view this as "arguing against" neural networks - it's more that Bayesian approaches give you something different.

[1] http://www.springer.com/gp/book/9780387310732

highd · on Dec 5, 2016

One of the most popular ways of using techniques like this is the "Variational Autoencoder". I've been working on using some alternate distributions with them as of late - it's very interesting, and quite powerful.

nl · on Dec 5, 2016

How does this work? You use the VAE to model variables and then somehow get the distribution from them?

Got a link? (I know the basics of VAEs, but I'm missing how to link them to this)

highd · on Dec 5, 2016

The VAE "coder" is modelling a distribution p(z|x), and the decoder is modelling a distribution p(x|z).

I like these slides: https://home.zhaw.ch/~dueo/bbs/files/vae.pdf