csoham's comments

csoham · 2025-09-30T09:50:56 1759225856

Wow this is crazy. What's crazier is the drop in battery level from 7 days to 1.5 days in 9 months!

csoham · 2025-09-30T07:08:29 1759216109

I'm working on ScaleDown [1], a context pruning API.

So over the past few years, I have seen how contexts have been steadily growing in AI apps. And while the context lengths of LLMs have also been increasing, they are still effectively about 200k tokens. The performance drops off a cliff after that (you might have noticed it as well with long AI chats).

It is a simple API that prunes away irrelevant parts of a context for a given prompt, a.k.a. context-aware pruning. Integration is super simple: just an extra API call before the final LLM API call. You can get an API from the website.

I would love to chat if this is something that is relevant to you and if you have any feedback on what we are building!

[1] https://scaledown.ai

csoham · 2025-09-17T14:06:49 1758118009

[self promotion alert] the "you are not alone" point really resonated with me. When I lost my job, I was alone, helpless and not sure what the next steps were. This is why I tried to create a community of people willing to support and be a listening ear for people going through job loss and this tough job market. It's at layoff.supprt. honestly I have not been supporting it for a while but of you find this helpful and would like more features then do let me know!

csoham · 2025-09-17T13:59:36 1758117576

Really intresting. What did the original prompt look like? Perhaps the original prompt was not that good? I feel like the changes claude suggested (except a couple maybe) are already pretty well known prompt engineering practices.

blndrt · 2025-09-17T14:32:21 1758119541

Thank you for the feedback!

In this (telecom) benchmark you can review agent policies and manuals here: 1) https://github.com/sierra-research/tau2-bench/blob/main/data... 2) https://github.com/sierra-research/tau2-bench/blob/main/data...

Of course these are just parts of the prompt, you can inspect benchamark code to see how these are rendered to actual LLM calls.

In case someone is not familiar with framework methodology I've wrote a separate article covering that (with some of my thoughts) -> https://quesma.com/blog/tau2-from-llm-benchmark-to-blueprint...

csoham · 2025-09-01T20:16:29 1756757789

One of the best deep-dives I have seen!

csoham · 2025-04-29T18:51:49 1745952709

https://extension.scaledown.ai

I realised that I'm a horrible prompt writer and so are many other people. So i created this extension to help me write better prompts, using templates, save prompts, and optimize it for better performance. No login, not paid, just a useful little extension. Check it out!

csoham · on Feb 3, 2024

I created this website because being laid off is one of the worst feelings ever, and it is hard to find supportive people to help you out.

All resources are free or very cheap.

Help support your fellow laid-off workers by joining as a mentor!