More

PeterCorless · 2025-04-26T20:11:20 1745698280

Can you "chunkify" output so that you can rate different elements independently? Like "This part of the answer is totally cool" and "Wait. This part right here includes a hallucination."

Then allow for feedback to be provided if an issue is spotted.

PeterCorless · 2025-04-25T21:34:52 1745616892

Great work Silas! Can this also be trained on externalized sites, such as human-written docs, our web site, public-facing Google docs, etc?

silasalberti · 2025-04-25T22:59:50 1745621990

Currently it uses only the codebase itself. However we‘ve been thinking about adding other source like docs. Can’t promise any timelines yet

PeterCorless · 2025-04-26T16:25:09 1745684709

Another idea: allow more "chunkified" approvals, and verbatim feedback:

"This part is correct, but this next paragraph? You're hallucinating / misinterpreting. What you really need to say is 'x'."

Also, I suggested that feedback also get scored in some way. For instance, what if someone is sending malicious feedback?

How do you discourage or de-weight someone downvoting good answers while upvoting wrong answers?

PeterCorless · 2025-04-25T21:26:11 1745616371

I have tried this so far for the following repos of interest:

• Apache Pinot - https://deepwiki.com/apache/pinot

• Apache Pulsar - https://deepwiki.com/apache/pulsar

• Apache Superset - https://deepwiki.com/apache/superset

• Grafana - https://deepwiki.com/grafana/grafana

• K8sGPT - https://deepwiki.com/k8sgpt-ai/k8sgpt

Seems, at least at a cursory glance, to produce decent output.

PeterCorless · 2025-04-25T21:33:36 1745616816

I also asked Devin a question:

Can I use a star-tree index with upserts?

I was pleasantly surprised by the answer:

https://deepwiki.com/search/can-i-use-a-startree-index-wit_a...

Briefly: "No, you cannot use a star-tree index with upserts in Apache Pinot. This is a built-in restriction that is explicitly enforced in the system."

Why This Restriction Exists

When upserts are enabled for a table in Pinot, the system explicitly checks and prevents the use of star-tree indexes. This validation happens during table configuration validation.

The code in TableConfigUtils.java explicitly enforces this restriction:

[snip]

At cursory glance, I did not detect a hallucination. The answer was true (AFAIK), clear, and objective. I also see that you can peek into other resources to get some additional contextual information.

Very impressive work.

PeterCorless · 2025-04-17T15:01:56 1744902116

My favorite part is how incompetent they were in handling the redaction:

"But when the Kentucky AG’s office was preparing to post their brief against TikTok, whoever was in charge of doing the redaction simply covered the relevant text with black rectangles. Even though you can’t see the text while reading the PDF, you can just use your cursor to select each black section, copy it, and then paste it into another file to read the hidden text. It is great fun to do this — try it yourself! Or just read our version of the brief in which we have done this for you."

fny · 2025-04-17T15:52:35 1744905155

I'd venture to guess this was deliberate. What would you do if you want to convince the public but can't technically share the evidence?

PeterCorless · 2025-04-17T18:59:30 1744916370

Hanlon's razor: "never attribute to malice that which can be adequately explained by neglect, ignorance or incompetence."

https://modelthinkers.com/mental-model/hanlons-razor

PeterCorless · 2025-03-13T17:45:51 1741887951

The picture was clearly based on El Castillo [Temple of Kukulcán, in Chichén Itzá], which is on the Yucatan Peninsula. Note the chamber on top.

It looks nothing like the Aztec Pyramid of Teotihuacan, which is flat on top and has no structure.

In other words, this is AI slop that makes something that looks plausible, and is utterly misleading.

As someone who has spent decades of my life on history, this makes me weep for humanity.

wswope · 2025-03-13T18:14:12 1741889652

TIL the Nika riots took place in the Roman colosseum, and the blues and greens cheered as lone individuals from each deme were sent down for slaughter. Yeah, this is hot garbage in terms of accuracy.

@Samplank2 - this may hurt to hear, but your assertions that better models and pipeline improvements will solve this are pure cope. What you really need to do here is manually curate and tune the prompts, then cherry-pick with a fine eye for detail. There’s no substitute for actual effort and knowledge, but you seem disinterested in that part.

PeterCorless · 2025-03-13T22:17:17 1741904237

It's very cool, so long as you don't care it's eye-wateringly wrong.

PeterCorless · 2025-03-03T19:11:31 1741029091

Yeah. This is going to be bad. This retraction is driven by the stark drop-off in consumer sentiment. Intelligent billionaires like Buffet, etc., take that into consideration. You know, all that big-brained macroeconomy thing, like IS-LM curves. Grown-up stuff of real economists.

Too-cool-for-school sociopathic Pepe avatar tech dudebros don't know how the actual economy works, and worse, don't care.

World Beware.

PeterCorless · 2025-02-27T20:42:32 1740688952

I didn't even read the article, but I love the comments on the thread.

Yes. The implementation language of a system should not matter to people in the least. However, they are used as a form of prestige by developers and, sometimes, as a consumer warning label by practitioners.

"Ugh. This was written in <language-I-hate>."

"Ooo! This was written in <language-I-love>!"

jchw · 2025-02-28T01:11:27 1740705087

There's certainly some aspect of that going on, but I think mainly it's just notable when you write something in a programming language that is relatively new.

Does it matter? In theory no, since you can write pretty much anything in pretty much any language. In practice... It's not quite that black and white. Some programming languages have better tooling than others; like, if a project is written in pure Go, it's going to be a shitload easier to cross compile than a C++ project in most cases. A memory-safe programming language like Go or Rust will tell you about the likely characteristics of the program: the bugs are not likely to be memory or stack corruption bugs since most of the code can't really do that. A GC'd language like Go or Java will tell you that the program will not be ideal for very low latency requirements, most likely. Some languages, like Python, are languages that many would consider easy to hack on, but on the other hand a program written in Python probably doesn't have the best performance characteristics, because CPython is not the fastest interpreter. The discipline that is encouraged by some software ecosystems will also play a role in the quality of software; let's be honest, everyone knows that you CAN write quality software in PHP, but the fact that it isn't easy certainly says something. There's nothing wrong with Erlang but you may need to learn about deploying BEAM in production before actually using Erlang software, since it has its own unique quirks.

And this is all predicated on the idea that nobody ever introduces a project as being "written in C." While it's definitely less common, you definitely do see projects that do this. Generally the programming language is more of a focus for projects that are earlier in their life and not as refined as finished products. I think one reason why it was less common in the past is because writing that something is written in C would just be weird. Of course it's written in C, why would anyone assume otherwise? It would be a lot more notable, at that point, if it wasn't.

I get why people look at this in a cynical way but I think the cynical outlook is only part of the story. In actuality, you do get some useful information sometimes out of knowing what language something is written in.

PeterCorless · 2025-03-05T18:01:14 1741197674

I do know of a shop where an OSS database written in Java was chosen over one written in C++ because of the ability of the internal team to read the code, modify it, troubleshoot it, etc. That makes sense. It that was driven by pragmatics — maintainability. Not simply bias, or aesthetics or "rule of cool."

PeterCorless · 2025-02-27T20:00:09 1740686409

World beware.

PeterCorless · 2025-02-12T22:48:22 1739400502

"Why do we want better artificial intelligence when we have all this raw human stupidity as an abundant renewable resource we haven't yet harnessed?"

PeterCorless · 2025-01-10T23:41:23 1736552483

This is worthy of a bookmark. (If HN supported bookmarks.)

mdaniel · 2025-01-11T03:37:53 1736566673

https://news.ycombinator.com/item?id=42658095#:~:text=favori... and then https://news.ycombinator.com/favorites?id=PeterCorless&comme...

PeterCorless · 2025-01-15T00:05:13 1736899513

Thanks! TIL.

pdimitar · 2025-01-11T19:24:19 1736623459

You can favorite comments. (And posts.)