Hacker Newsnew | past | comments | ask | show | jobs | submit | tomthe's commentslogin

The creator of UMAP and PyNNDescent tried to let Opus 4.5 translate PyNNDescent from Python to Rust

No, it is not as good as Veo, but better than Grok, I would say. Definitely better than what was available 2 years ago. And it is only a 7B research model!


If I (and only I) owned such a cable from Europe to the US, how much money could I make by buying cheap solar energy from the bright side and selling it to the dark side of the Atlantic?

First thought: 10 GW * $0.03/kWh 4 hours/day = $1.2Mio per day [0]

I am not sure about my assumptions...

[0]: https://www.wolframalpha.com/input?i=10+GW+*+%24+0.03%2FkWh+...


I suspect you might earn a lot more than $0.03/kWh on average.

The difference between typical market daytime and evening wholesale electricity prices is around $0.06/kWh in the UK right now: https://bmrs.elexon.co.uk/system-prices


Probably more like 8 hours: you can sell in both directions, to US before sunrise in US, to EU after sunset in EU.

How much you can charge probably also depends on storage, but it seems plausible (same magnitude as current transmission/distribution costs?) to my amateur understanding.


You are ignoring many variables, ranging from cable resistance losses and maintenance costs to signal re-synchronisation systems. Not to mention environmental factors such as seabed warming and subsequent changes in ocean currents, over time.


Yes, of course I do ignore a lot in that calculation. I just wanted to calculate the biggest possible usefulness of this cable. Especially the resistance losses could be quite disastrous.


Exploiting the pricing difference would probably diminish it?


To some degree, yes. I just looked it up and the EU produced ~2500GWh in 2023, which is around 280 GW on average.


I wonder if you could implement it with only static hosting?

We would need to split the index into a lot of smaller files that can be practically downloaded by browsers, maybe 20 MB each. The user types in a search query, the browser hashes the query and downloads the corresponding index file which contains only results for that hashed query. Then the browser sifts quickly through that file and gives you the result.

Hosting this would be cheap, but the main barriers remain..


I've done something similar with a static hosted site I'm working on. I opted to not reinvent the wheel, and just use WASM Sqlite in the browser. Sqlite already splits the database into fixed-size pages, so the driver using HTTP Range Requests can download only the required pages. Just have to make good indexes.

I can even use Sqlite's full-text search capabilities!


How would that scale to 10TB+ of plain text though? Presumably the indexes would be many gigabytes, especially with full text search.


The client only needs to get indexes for the specific search; if the index is just a list of TF-IDF term scores per document (which gets you a very reasonable start on search relevance) some extremely back-of-the-envelope math leads me to guess at an upper bound in the low tens of megabytes per (non-stopword) term, which seems doable for a client to download on demand.


I wonder if you could take this one step further and have opaque queries using homomorphic encryption on the index and then somehow extracting ranges around the document(s) you're interested in

Inspired by: "Show HN: Read Wikipedia privately using homomorphic encryption" https://news.ycombinator.com/item?id=31668814


Super interesting.


Very cool open release. Impressive that a 27b model can be as good as the much bigger state of the art models (according to their table of Chatbot Arena, tied with O1-preview and above Sonnet 3.7).

But the example image shows that this model still makes dumb errors or has a poor common sense although it read every information correctly.


It seems to have been very benchmark-tuned for LMArena. In my own experiments, it was roughly in line with other comparably sized models for factual knowledge (like Mistral Small 3), and worse than Mistral Small 3 and Phi-4 at STEM problems and logic. It's much worse than Llama 3.3 70b or Mistral Large 2411 in knowledge or intelligence in reality, even though LMArena ranks it as better than those.


Looking at every other benchmark, it's significantly behind typical big models from a year ago (Claude 3.0, Gemini 1.5, GPT 4.0). I think Google must have extensive LMArena-focused RLHF tuning for their models to juice their scores.


I was thinking the same thing about the receipt calculation: a warning that only tourists tip 18% in Switzerland would no doubt have been appreciated!


It is not sufficient to understand the content very well, you also have to understand the state of the mind of your pupils very well.


I think there are pretty strong traces of pro-china manipulation on Tiktok.

See https://www.nytimes.com/2024/04/24/briefing/tiktok-ban-bill-...

Content about something like Hong-Kong protests or the Tiananmen square are suppressed.


There isn't even a clear methodology explained (where/when/how where the accounts created, etc) on that NCRI "study".

Even the data is often self contradicting.

All we see is that there's much less political discourse on TikTok than ig.

If there's manipulation, the only thing that this study shows is that the manipulation goes towards avoiding politics in general.

Not only that, but the study itself may show the reverse to be true: that IG pushes some narratives more than others.


There is no evidence whatsoever presented in that paid for NCRI study.

In fact, the same result can be used to interpret as "strong traces of anti-China manipulation on American social medias".


Can you elaborate? I had no problem with DuckDB and a few TB of data. But it depends on what exactly you do with it, of course.


few hundred TBs here - no issues :)


Wait, you eat only beef? Or beef as the only source of meat?


Only beef, it's a variant of the carnivore diet that excludes common autoimmune triggers, like eggs & dairy.


  > Only beef, it's a variant of the carnivore diet that excludes common autoimmune triggers, like eggs & dairy.
what about chicken, or fish? are those considered auto-immune triggers?


There is a lot of low quality chicken, both fish and chicken have very little fat. And their fatty acid profile is not ideal.


Won't that give you gout?


No, there are people following a leaner version of the diet who have no such issues. I eat moderate protein though (80-100g).


anymore??


Sorry, I was thirsty.

There were no "big" rivers, ever. More like springs. We have lots of subterranean water, so out of the 18 rivers we have in the city, 16 have their sources here [0]. They were used to power mills in the 19-20th century during the industrialization. Many of the rivers that used to go through the city center flow underground.

I live close to the river Olechówka [1], which flows into a regulated reservoir that used to feed a mill - so the area is called Młynek, "Little Mill" :)

[0] https://podwodnalodz.blogspot.com/2013/09/o-wodzie-po-ktorej... [1] https://i.imgur.com/SIp8CxN.jpeg


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: