More

davedx · 2025-12-26T14:39:56 1766759996

I stumbled upon some great reddit posts this year with reading suggestions, and compiled my own "humanity is fucked" themed reading list, which included:

* Mercy of Gods by James S.A. Corey

* The Light Pirate by Lily Brooks-Dalton

* Oryx and Crake by Margaret Atwood

* Dawn by Octavia Butler

I then diverged from this list (I have more) to re-read (though it's not such a great divergence):

* If This Is a Man / The Truce by Primo Levi

Other books I enjoyed reading this year in no particular order:

* Tau Zero by Poul Anderson

* Machine Vendetta by Alastair Reynolds

* Elysium Fire by Alastair Reynolds

* Aurora Rising by Alastair Reynolds

* Shadow of the Silk Road by Colin Thubron (loved this)

* The Lord of the Rings (the god knows how many times re-read)

* The Centauri Device by M. John Harrison

* Future's Edge by Gareth Powell

* Blueshift by Joshua Dalzelle

* The Heart of a Continent by Francis Younghusband (I didn't quite manage to finish it, but it was a fascinating read nonetheless)

davedx · 2025-12-24T19:50:10 1766605810

There must be a vbulletin version too!

davedx · 2025-12-22T23:33:55 1766446435

I mean yeah, it's the same problem any new social media site needs to solve

davedx · 2025-12-18T21:06:53 1766092013

What is an encoder-decoder model, is it some kind of LLM, or a subcomponent of an LLM?

wood_spirit · 2025-12-18T21:23:31 1766093011

A decoder predicts the next word (token) to iteratively generate a whole sentence. An encoder masks a word in the middle of a sentence and tries to predict that middle.

The original transformer paper from google was encoder-decoder, but then encoder BERT was hot and then decoder GPT was hot; now encoder-decoder is hot again!

Decoders are good at generative tasks - chatbots etc.

Encoders are good at summaration.

Encoder decoders are better at summaration. It’s steps towards “understanding” (quotes needed).

nodja · 2025-12-18T21:28:13 1766093293

It's an alternate architecture of LLMs, they actually predate modern LLMs. An encoder-decoder model was actually the model used in the "Attention if all you need" paper that introduced the transformer and essentially gave birth to modern LLMs.

A encoder-decoder model splits input and output. This makes sense for translation tasks, summarization, etc. They're good when there's a clear separation of "understand the task" and "complete the task", but you can use it for anything really. A example would be send "Translate to english: Le chat est noir." to the encoder, the encoder processes everything in a single step, that is understand the task as a whole, then the output of the encoder is fed to the decoder and then the decoder runs one token at a time.

GPT ditches the encoder altogether and just runs the decoder with some slight changes, this makes it more parameter efficient but tends to hallucinate more due to past tokens containing information that might be wrong. You can see it as the encoder running on each token as they are read/generated.

Edit: On re-read I noticed it might not be clear what I mean by past tokens containing wrong information. I mean that for each token the model generates a hidden state, those states don't change, so for example an input of 100 tokens will have 100 hidden states, the states are generated at once on the encoder model, and one token at a time on the decoder models. Since the decoder doesn't have the full information yet, the hidden state will contain extra information that might not having anything to do with the task, or even confuse it.

For example if you give the model the task "Please translate this to chinese: Thanks for the cat, he's cute. I'm trying to send it to my friend in hong kong.". For a enc-dec model it would read the whole thing at once and understand that you mean cantonese. But a decoder only model would "read" it one token a time it could trip in several places, 1. assume chinese means mandarin chinese not cantonese, 2. assume that the text after "cute." it's something to also translate and not a clarification. This would have several token worth of extra information that would confuse the model. Models are trained with this in mind so they're used to tokens having lots of different meanings embeded in them, then having later tokens narrow down the meanings, but it might cause models to ignore certain tokens, or hallucinate.

subscribed · 2025-12-19T20:39:42 1766176782

Your last paragraph is amazing, and explains that so clearly, thank you!

canyon289 · 2025-12-18T21:19:46 1766092786

Hi, I'm not on the t5 Gemma team but work on gemma in general.

Encoder Decoder comes from the original transformers implementation way back in 2017. If you look at figure 1 you'll see what the first transformer ever looked like.

Since that time different implementations of transformers use either just the encoder portion, or the decoder portion, or both. Its a deep topic so hard to summarize here, but Gemini explains it really well! Hope this gets you started on some prompting to learn more

https://arxiv.org/pdf/1706.03762

wongarsu · 2025-12-18T21:17:56 1766092676

The announcement of the original T5Gemma goes in some more detail [1]. I'd describe it as two LLMs stacked on top of each other: the first understands the input, the second generates the output. "Encoder-decoder models often excel at summarization, translation, QA, and more due to their high inference efficiency, design flexibility, and richer encoder representation for understanding input"

1: https://developers.googleblog.com/en/t5gemma/

davedx · 2025-12-16T09:18:05 1765876685

Yeah that's just flat out wrong then: you can't use the solar array as a radiator.

jcattle · 2025-12-16T09:40:48 1765878048

Of course you can. You can use everything as a radiator. Unless you have something which is literally 0 Kelvin everything radiates.

See here for all the great ways of getting rid of thermal energy in space: https://www.nasa.gov/smallsat-institute/sst-soa/thermal-cont...

notahacker · 2025-12-16T10:27:43 1765880863

You can use everything as a radiator, but you can't use everything as a radiator sufficiently efficient to cool hot chips to safe operating temperature, particularly not if that thing is a thin panel intentionally oriented to capture the sun's rays to convert them to energy. Sure, you can absolutely build a radiator in the shade of the panels (it's the most logical place), but it's going to involve extra mass.

dsr_ · 2025-12-16T16:08:59 1765901339

You also want to orient those radiators at 90 degrees to the power panels, so that they don't send 50% of their radiation right back to the power panels.

oivey · 2025-12-16T15:18:07 1765898287

You can rivet people onto the outside of the ISS to radiate heat, too, but it may be detrimental to the overall system.

davedx · 2025-12-16T09:17:03 1765876623

I think the point is, yes, cooling is a significant engineering challenge in space; but having easy access to abundant energy (solar) and not needing to navigate difficult politically charged permitting processes makes it worthwhile. It's a big set of trade offs, and to only focus on "cooling being very hard in space" is kind of missing the point of why these companies want to do this.

Compute is severely power-constrained everywhere except China, and space based datacenters is a way to get around that.

TheOtherHobbes · 2025-12-16T13:15:52 1765890952

Of course you can build these things if you really want to.

But there is no universe in which it's possible to build them economically.

Not even close. The numbers are simply ridiculous.

And that's not even accounting for the fact that getting even one of these things into orbit is an absolutely huge R&D project that will take years - by which time technology and requirements will have moved on.

JoeAltmaier · 2025-12-16T13:19:37 1765891177

Lift costs dropping geometrically. Cost and weight of solar decreasing similarly. The trend makes space-based centers nearly inevitable.

Reminds me of "Those darn cars! Everybody knows that trains and horses are the way to travel."

Yizahi · 2025-12-16T14:24:39 1765895079

Lift costs are not quite dropping like that lately. Starship is not yet production ready (and you need to fully pack it with payloads, to achieve those numbers). What we saw is cutting off most of the artificial margins of the old launches and arriving to some economic equilibrium with sane margins. Regardless of the launch price the space based stuff will be much more expensive than planet based, the only question if it will be optimistically "only" x10 times more expensive, or pessimistically x100 times more expensive.

I don't get this "inevitable" conclusion. What is even a purpose of the space datacenter in the first place? What would justify paying an order of magnitude more than conventional competitors? Especially if the server in question in question is a dumb number cruncher like a stack of GPUs? I may understand putting some black NSA data up there or drug cartel accounting backup, but to multiply some LLM numbers you really have zero need of extraterritorial lawless DC. There is no business incentive for that.

lurquer · 2025-12-17T04:57:12 1765947432

> Reminds me of "Those darn cars! Everybody knows that trains and horses are the way to travel."… … said nobody ever.

JoeAltmaier · 2025-12-20T14:24:34 1766240674

You must be very young. This was well-known back in the day. Lots of articles (some even posted here some time back) of rant on cars, how they were ruining everything.

Btw The cute one-line slam doesn't really belong here. It's an empty comment, adds zero to the conversation, contributes nothing to the reader. It only makes a twelve year old feel a brief burst of endorphins. Please refrain.

panick21_ · 2025-12-17T22:22:56 1766010176

The idea that its faster and cheaper to launch solar panels then get local councils to approve them is insane. The fact is those Data Center operates simply don't want to do it and instead want politicians to tax people to build the power infrastructure for them.

davedx · 2025-12-15T07:27:43 1765783663

I'm working on https://techposts.eu - Hacker News for Europe.

Focused on all the interesting and exciting happenings in tech here, from AI to defence to deeptech, and posting the most interesting job openings too. Did you know Europe had two space launch startups? I didn't until I started this project!

Feedback very welcome :)

dreadnip · 2025-12-15T08:44:10 1765788250

Color scheme is a bit harsh for me. I understand you're going for EU colours, but maybe a softer background like #fcfcfc and a more muted blue would be easier on the eyes?

kwanbix · 2025-12-15T15:24:16 1765812256

I agree. And I will go a little bit further, why don't do it with a black background? So much white on most websites.

aswegs8 · 2025-12-15T16:13:36 1765815216

seconded

maciejzj · 2025-12-15T08:11:15 1765786275

Great idea, I'm keeping my fingers crossed for this initiative.

I believe that the main challenge would be to get more traction and build a community. Hope you find a way to encourage as many people as possible to join the website.

My very minor nitpick -- I would add some kind of background colour to the main post list, something like #FAFAFA looks fine to me.

davedx · 2025-12-15T08:42:41 1765788161

Thank you! Please consider signing up and occasionally posting something, it would help a lot.

I'll look at the background suggestion too, thanks!

potato-peeler · 2025-12-15T10:32:42 1765794762

What are the community guidelines? Is it okay to post personal projects similar to show hn?

davedx · 2025-12-15T10:46:39 1765795599

Yes, absolutely! The guidelines for now are basically "same as HN, but Euro-centric content please" :) I'll write these down somewhere explicitly soon.

dakoller · 2025-12-15T09:42:55 1765791775

Great idea, I'd appreciate an RSS feed

davedx · 2025-12-15T13:47:07 1765806427

Shouldn't be too hard to do, I'll look into it. Thank you!

benrutter · 2025-12-15T09:39:21 1765791561

Ooh I like this! I love Hacker news and Lobsters but they're both very US centric, seem great to have a European one.

UI is very nice and simple, one tiny bit of feedback is that a 'guidelines' page would be worthwhile, especially while it's new! I thought I'd post my own project on the site - sometimes that's a little bit of a no-no though, and I couldn't find any guidelines to steer me towards what types of things to share, etc.

Edit: Tiny extra feedback, is upvoting something immediately changes the rankings in the browser. It's pretty impressive speedwise, but especially if you're a couple pages in, you can bump something off of the page you're on which makes it a little weird to do something like 'upvote article and then check the comments'.

davedx · 2025-12-15T10:46:09 1765795569

Thanks for the feedback and posting, I appreciate it!

I'm definitely going through the comments I've had later and will take everything onboard. Guidelines is a great idea - for now it's basically "HN guidelines but Euro-centric content please" but I should definitely write that down.

hnben · 2025-12-16T13:41:11 1765892471

I like to browse HN via "/front" and "Go back day" and then look at the couple of top posts for each day. I don't see such a day-by-day view on TPE.

What is the "official" acronym? TPE? TP? TecPeu?

What is language policy? (e.g. it would be nice if people would post any language they want, and the system shows other users what language the link is, and then offers an alternative link to a translated version. I imagine this would be hard to implement in a way that is robust way, but maybe you when user submit a link, they can set the language themselves)

andrepd · 2025-12-15T10:31:00 1765794660

First post:

> Show TP: TreatyHopper - Pay less taxes

> Treaty shopping is a tax strategy where companies route profits through intermediary countries with favorable tax treaties to minimize overall tax liability.

Can't make this up x)

aswegs8 · 2025-12-15T16:14:15 1765815255

How would you like your double Irish with a Dutch sandwich, sir?

econ · 2025-12-15T13:54:58 1765806898

The comment page here on mobile

https://techposts.eu/post/68

It shifts out of the screen on the left cutting off the comments. (The problem is probably how you deal with the long url or not deal with it)

davedx · 2025-12-15T14:29:43 1765808983

Thanks, I'll fix that!

beka-tom · 2025-12-15T16:58:44 1765817924

I noticed that you’re using the favicon from Vercel, or that Vercel is using your icon. :))

thatsit · 2025-12-15T08:13:28 1765786408

Good work, but the headlines are still in „newspaper“ style and not in hacker news style

davedx · 2025-12-15T08:43:27 1765788207

You mean the automatic normalization HN does when you submit the title? Yeah, it's still quite basic compared to the real HN. I want to validate it properly before investing in lots of features :)

nfeutry · 2025-12-15T08:52:39 1765788759

Great initiative. I was confused by the comment section design. The style of the metadata is not distinct enough from the real comment. And it tooks me too long to understand that the responses to comments were not citations.

alexgotoi · 2025-12-15T12:33:12 1765801992

How do you get users to your site? I always felt these products are the hardest to build, but probably the most rewarding ones.

Nurbek-F · 2025-12-15T09:30:58 1765791058

If you could add API on top of it, and make it compatible with HN clients, it would be very nice!

davedx · 2025-12-15T10:47:49 1765795669

Interesting idea! I was kind of playing with the idea of doing something CrunchBase-like for the companies, jobs and funding rounds. But there's a lot of data out there publicaly too so I'm not sure if it's worth it. Will have a look at the HN clients too, thanks for the idea!

Venkatesh10 · 2025-12-15T13:29:17 1765805357

Love to see EU specific. How do you get the jobs posts?

davedx · 2025-12-15T13:46:50 1765806410

Thanks!

I get the job posts the hard way, from scouring about a dozen different sources, including my own shortlist of "interesting companies".

neon_me · 2025-12-15T12:22:38 1765801358

great! singuped! Just please - get rid of that all blue and underlined links. Its hell to read.

davedx · 2025-12-15T12:24:55 1765801495

Ha, thanks for the feedback! People have made a few points about the styling, it definitely needs a harder look. Maybe a silly question but which do you find worse, the blue color or the underlines?

econ · 2025-12-15T14:00:02 1765807202

Install some custom style css extensions and look at all the HN variations. I like the solarized one.

gsky · 2025-12-15T13:07:02 1765804022

Site needs better UI without a doubt.

Copy HN UI as its. no one cares.

Good luck

luplex · 2025-12-15T14:43:23 1765809803

very good start! I hate to be that guy, but I'd like if you had an imprint and privacy policy on the site ;)

davedx · 2025-12-15T14:52:06 1765810326

Good feedback! I'll definitely be filling that kind of content in as I go.

uscnehn · 2025-12-17T08:55:57 1765961757

Hi! Are you looking for a collaborator? I had a list of European companies divided by sector, that follow GDPR rules, with 1.2k stars on GitHub, currently deleted because I wanted to create a website, where people can search also for jobs and projects proposed by those companies, we can make a section of your projects related to it, let me know, please. I really love your idea!

davedx · 2025-12-12T11:46:42 1765540002

Seemed pretty realistic to me!

georgefrowny · 2025-12-12T13:16:06 1765545366

One exception: with the stamp cost rise, I think this might be the year even the staunchest card senders may be reconsidering!

I remember my mum sending out 20 or 30 cards all with first class stamps. I don't see many millennials and down doing it. "Not in this economy"!

davedx · 2025-12-12T10:50:19 1765536619

> currently there is no way to shoot drones in EU due to variety of bullshit legal reasons

Citation?

littlecranky67 · 2025-12-12T10:51:53 1765536713

Literally any news media in the EU in the last 4 weeks. Drone sightings everywhere, action equals none.

lnsru · 2025-12-12T10:52:00 1765536720

Statistics! Nobody took any action during hundreds of drone incidents in many countries.

blitzar · 2025-12-12T10:54:27 1765536867

bullshit citation reasons

davedx · 2025-12-11T08:49:16 1765442956

> Instead, I'd love for Google to understand me well enough to show me which restaurants I would disproportionately love compared to other people based on its understanding of my taste profiles.

I mean... this sounds like the perfect use case for a third party app like "My taste restaurant finder"? There are undoubtedly apps out there like this.

I don't think Google Maps (a general purpose maps app) should try to be everything for everyone. It's good enough for what it is.