Hacker News new | past | comments | ask | show | jobs | submit | ncomments login

congrats on the launch win! thrilled to see your product launch after following your journey from the start.

Very nice! I'm personally looking into bot account detection for my own service and have come up with very similar heuristics (albeit simpler ones since I'm doing this at scale) so I will provide some additional ones that I have discovered:

1. Fork to stars ratio. I've noticed that several of the "bot" repos have the same number of forks as stars (or rather, most ratios are above 0.5). Typically a project doesn't have nearly as many forks as stars.

2. Fake repo owners clone real projects and push them directly to their account (not fork) and impersonate the real project to try and make their account look real.

Example bot account with both strategies employed: https://github.com/algariis


Hey HN,

I recently built File Decomposer — a tool that takes files like PDFs, DOCX, EPUB, HTML, etc., and converts them into structured Markdown or JSON.

The main goal is to help devs, AI engineers, and indie hackers who deal with unstructured documents and want clean, usable data to:

feed into LLMs (for RAG/chatbots)

create searchable knowledge bases

automate workflows

or just stop wasting time copying/pasting from PDFs

It handles:

Large files (multi-hundred-page PDFs, technical docs, books)

Structure preservation (headings, lists, sections)

JSON formatting that’s easy to parse or feed into your pipelines

Built this after wasting too much time cleaning up documents to get usable data into my AI projects. Happy to answer questions, and very open to feedback or edge cases you'd like supported.

You can try it here: https://filedecomposer.com/

Would love to hear how others are tackling this problem, or if there are ways I can make this tool more useful for your workflows.


My mother and I recently did an oral glucose tolerance test while wearing CGMs and discovered that in us, the CGM measurements (Dexcom G7) and the reference instrument measurements (whatever Quest Diagnostics uses) virtually _never_ lined up, even when including the G7's advertised error margins. The blood glucometer readings in me didn't line up either. Admittedly this was on the first day using the G7, when it's apparently less accurate, but due to this I've been doubtful of CGM readings in me/her. I'm sure CGMs work on a population level, but for us, on that day? Nope.

My graph: https://i.imgur.com/FzPdH1g.png

Mom's graph: https://i.imgur.com/5DR1G30.png

Discussion: https://reddit.com/r/PeterAttia/comments/1k301o4/my_ogtt_exp...


I'm a native, but I was talking with someone a little while back who echoed that sentiment. For him, it was all about avoiding degradation and feeling like he could sound "like myself" in English.

>price control strategy to pit these suppliers against each other to lower costs

Apparently that rabbit hole goes super deep in which the large auto manufacturers in the US throw their weight around and force suppliers into selling parts at cost or with razor thin profit margins. And on top of that, they force the suppliers to eat the loss when it comes to cyclical business demand (e.g. storage costs for over-producing during low demand and increased labor costs during times to under producing from high demand)


It's worth noting that companies that are too big to fail (as I assume credit bureaus are considered) are great places to park money.

100% fair; I was referring to Spain and Portugal FYI, but I've heard very good things about Italy as well in this regard.

TBH, Uber wouldn't be so much of a problem with decent competition. But the critical mass required for a transit app is quite difficult to establish, and Lyft offers essentially the same deal with a slightly better margin for drivers. It's a de-facto duopoly here in the US where the two companies are incentivized to drive prices as high as the market will sustain, and the local apps (for instance we have one here in DC that is a co-op) are often legislated out of existence.

> Taxis in Shanghai have been largely replaced by app-related taxis.

The APP itself isn't the issue, it's the pricing. But it sounds like China has a much healthier market than we have.

> 1. It's now possible to get a taxi when it's raining.

Why would this make so much of a difference? I'm very curious.


Hm, fair enough. Oddly Dulles is my local airport and I wasn't able to find any taxis the last time I flew in. Clearly I'm looking in the wrong place.

Happened with Firefox. Mozilla now has unlimited rights to use and publish, anything you see, type or upload in Firefox. And it is still marketed as "privacy" friendly. Even Micro$ft has better license in their browser!

Somewhat downplaying it. Charles is easily the most popular tool for reverse engineering client-server communications in mobile apps.

Certificate pinning frustrates Charles by hampering MITM attempts. It can be difficult to extract/replace pinned certificates from the latest versions of Android/iOS apps. Often you can extract them from older versions using specialized tools, if old-enough versions exist and those certificates are still valid for API endpoints of interest.


'propaganda'?

First of, i do know his history very well, i'm quite aware that he was not a saint before but that doesn't change the fact what he is currently doing and it is very good.

Do you have anything real? Like real talking points? Real sources? aything besides just shitting at him?


OAuth exists and can be used to confirm someone's identity by linking their Google account.

Thank you

Interesting approach, but how well does this actually work?

Full-Stack Developer (React/Node) – Help Build a Stealth E-Com Platform (Remote US, Equity+Cash)

We're building a new rich media platform that’s currently in stealth mode. Looking for a co-founder with a track record of building successful E-Com or Social platforms. You’re teaming up with world class talent, to deliver on a proven model that’s already created revenue.

Do you have an interest in culture, or storytelling? Do you cruise social for the next big thing? Do you have a die hard fan mentality? If so, you’re a perfect fit

What You’ll Do - Build and own features across both front-end and back-end - Knowledge of AI Integration is a plus - Architect the foundation of the digital product - Help design user flows. - Work closely with the team to ship and iterate fast - Contribute to both technical decisions and early product feedback What We’re Looking For - Strong full-stack engineering ability, you’ve shipped real products end-to-end with a focus on mobile - Experience with front-end frameworks (React, Vue, etc.) / back-end stacks (Node.js, Python, Go, etc.) - Familiar with GraphQL or REST APIs, and managing databases (Postgres, MongoDB, etc.) - Solid understanding of UI/UX principles and a good eye for product design - Independent builder mindset, self-driven, comfortable with ambiguity, and fast execution - Extra bonus: experience in the development of creator tools

Compensation - Hybrid role: equity + cash - Fully remote, timezone-flexible - Tons of ownership and creative freedom - Early builder status: You will help craft the vision, the product, the roadmap, and the company overall


Hey, I made easy to use background tasks extension in Django. I decided to do it because sometimes I need really straightforward tasks in my private projects but I don't need something as advanced as Celery.

Hey fund Viralia get a unique piece of art

To each their own :). One of my favourite places in London.

The Barbican was never built as social housing - the intended occupants were always central London professional workers and they charged market rates.

Here is CoCalc by SageMath, Inc.'s latest open-source contribution!

http-proxy is one of the oldest and most famous nodejs modules, and it gets downloaded around 15 million times a week. (And we've loved using it for years.)

Unfortunately, it is unmaintained, it has significant leaks that regularly crash production servers, and is written in ancient untyped JavaScript. The maintainers have long since stopped responding, so there is no choice but to fork and start over. This version is currently being used in production on cocalc.com.

We wanted to do our part to help maintain the open-source ecosystem, hence this library. We hope you find it useful!


Seems they have a big opportunity to harness all those books for a gamechanging teaching tool.

Hi, I'm the first author of the paper. Thank you so much for your comment, it means the world to me!

congrats - this seems really cool. just tested it out and can see the value of this!

I wish the universe would give me the option not to restart and apply this update right in the middle of a project! It's not like I was working on something or had my browser tabs the way that I like them.

Hi, I'm the first author of the article. Thank you very much for your interest in the method!

For the 2 spheres, it's as you wish ! It depends on where you place the uv system in the CSG tree, if it is above a single sphere then it will remain on it, if it's above the union, the field should extent on both.

The main interest of the method is the fact that the output remains implicit so one can easily plug it in a CSG pipeline.


very nice

Hi, thanks for your very helpful comments! I just updated the website so you can't choose dates before today, and I also added a delete function. Now if you're the creator or have the password, you can delete the event. Otherwise, you can't delete it.

For the minute-level time selection, I agree it would be very useful, especially for busy people. I'm thinking about the best way to implement this meanwhile keep a clean interface.


That's the best thing to do for the moment.

Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: