Hacker Newsnew | past | comments | ask | show | jobs | submit | acchow's commentslogin

> Every MCP server injects its full tool schemas into context on every turn

I consider this a bug. I'm sure the chat clients will fix this soon enough.

Something like: on each turn, a subagent searches available MCP tools for anything relevant. Usually, nothing helpful will be found and the regular chat continues without any MCP context added.


Absoultely.

I'll add to your comment that it isn't a bug of MCP itself. MCP doesn't specify what the LLM sees. It's a bug of the MCP client.

In my toy chatbot, I implement MCP as pseudo-python for the LLM, dropping typing info, and giving the tool infos as abruptly as possible, just a line - function_name(mandatory arg1 name, mandatory arg2 name): Description

(I don't recommend doing that, it's largely obsolete, my point is simply that you feed the LLM whatever you want, MCP doesn't mandate anything. tbh it doesn't even mandate that it feeds into a LLM, hence the MCP CLIs)


Yup, routing is key. Just like how we've had RAG so we don't have to add every biz doc to the context.

I agree with the general idea that models are better trained to use popular cli tools like directory navigation etc, but outside of ls and ps etc the difference isn't really there, new clis are just as confusing to the model as new mcps.


You’re spot on. Anthropic blogs talk about a ToolSearchTool to solve this problem - https://www.anthropic.com/engineering/advanced-tool-use

> > Every MCP server injects its full tool schemas into context on every turn

> I consider this a bug. I'm sure the chat clients will fix this soon enough.

ANTHROP\C's Claudes manage/minimize/mitigate this reaonably.


That’s a trade off, now you need multiple model calls for every single request

Yes we just RAG to be applied on tools. Very simple to implement.

I don’t think so. Without a list of tools in context the ai can’t even know what options it has, so a RAG like search doesn’t feel like it would be anywhere near as accurate

The RAG helps select the tool needed for the task at hand. Semantic search returns only the tools that match. Very efficient.

But MCP uses Oauth. That is not a "worse version" of API keys. It is better.

The classic "API key" flow requires you to go to the resource site, generate a key, copy it, then paste it where you want it to go.

Oauth automates this. It's like "give me an API key" on demand.


Interesting pricing differential. Seems in your country, that IdeaPad is significantly cheaper than the price in the US. But for your Macbook Neo, it's the other way around.

Wonder why that is.


No idea. Maybe Lenovo includes purchasing power in the price calculation for some reason, such as making more money in the U.S. while gaining market share here in Czechia, where purchasing power is lower. Apple may be able to afford not to do that.

I had the opposite issue. Trackpoints stated hurting my hand because it requires significantly more force than the Mac's touchpad.

We probably shouldn't deprecate AM for emergency broadcasts given you can listen to AM radio with grass https://www.youtube.com/watch?v=b9UO9tn4MpI

Only at the point of emission however...

> Claude code always give me rate limits

> 50+ tps with qwen3.5 35b a4b on a 4090

But qwen3.5 35b is worse than even Claude Haiku 4.5. You could switch your Claude Code to use Haiku and never hit rate limits. Also gets similar 50tps.


I haven't tried 4.5 haiku much, but i was not impressed with previous haiku versions.

My goto proprietary model in copilot for general tasks is gemini 3 flash which is priced the same as haiku.

The qwen model is in my experience close to gemini 3 flash, but gemini flash is still better.

Maybe it's somewhat related to what we're using them for. In my case I'm mostly using llms to code Lua. One case is a typed luajit language and the other is a 3d luajit framework written entirely in luajit.

I forgot exactly how many tps i get with qwen, but with glm 4.7 flash which is really good (to be local) gets me 120tps and a 120k context.

Don't get me wrong, proprietary models are superior, but local models are getting really good AND useful for a lot of real work.


But in your country (Spain), Telefónica de España laid off 3649 workers in Dec 2023 (about 40% of that unit) despite growing net income by 17% that year.


Nice googling, but that’s just an example that proves my point.

They had to go through a process extensively justifying losses (mostly that certain jobs were no longer relevant as they were pre-digital workforce), negotiate with unions and offer voluntary leaving conditions.

The resulting offer was good enough that more workers applied to be fired than were necesssary. For context, the deal was basically to pay them 70% of their current salary from the dismissal moment until their retirement at 63.


What was the average age of the worker that took that deal?


Can you point to the laws in your country that would make this illegal? I’m skeptical


I can, easily. Speaking for my country and assuming you are not on a fixed-term contract, you can only get fired for one of the three reasons:

1. Company's in financial trouble and forced to downsize.

2. The position becomes obsolete and there's no option to transition you into some other role. In this case, the company can't hire anyone with a similar-enough skillset to yours for at least a year (or maybe even longer, I'm not sure).

3. Gross incompetence, in which case you need to be given an opportunity to course correct via a few documented warnings before being fired. Every warning requires your signature so that the company can't just make them up and backdate them.

That said, you don't become a permanent employee on day one, a company can issue up to three fixed-term contract before being forced to give you a permanent one. If you're on a fixed-term contract, they can just not extend it without having to satisfy any of the criteria above. But after a maximum of 3 years at the same company (as the maximum length for a fixed-term contract is one year), the criteria for firing you increases drastically.

So, the only way this could happen in my country is if the company stops renewing fixed-term contracts for recent employees, but then it wouldn't all be at the same time and you'd get the hint before the time comes to plan accordingly.


Wouldn't 2 be applicable to Block? Dorsey is pretty much saying AI's made these positions obsolete.


>Speaking for my country

I like how none of you ever reveal this mysterious country you're from, probably so you don't get called on the claims you're making.

Anyway, in my country unemployment is 0% and everyone is rich and there are no problems. Why can't your country achieve the same?


As an example, the comment you're replying to is true for The Netherlands



People don't want to leak their personal information, duh. The country they are from is another bit of information that you can't take back from the internet once published. Why would you do that?


It takes a couple of minutes to find a country with the same laws via Google. Takes a second for LLM.


> Anyway, in my country unemployment is 0% and everyone is rich and there are no problems. Why can't your country achieve the same?

Where? I want to move there immediately. I bet you’re lying though.


> But with CCTV and computer vision it's getting increasingly cheap.

The barrier in the US isn't cost. It's a right to privacy and a culture of distrust of government.


I think that barrier may be weakening. I reckon that the people most concerned about crime are willing to sacrifice their privacy and defer to their government to prevent crime.


Some people, sure.

But I think the shenanigans of ICE are making people more aware of the importance of privacy. Look at the backlash Ring (lost dog superbowl ad) and Discord (age verification) and Nest (Guthrie case) received just this year.


The people I have in mind are the rich, poor and those who fantasize becoming wealthy and fear going broke. I’m uncertain how much these demographics account for the US population and empirically speaking I’m unsure of the gravity of the PR stirs you named. I really don’t know if privacy is the foremost concern when the types of people I’m thinking of consider ICE either.

It really seems to boil down to whether these types of people can be effectively sold on the virtue of tearing down the barriers of privacy and government. If they aren’t already implicitly sold to that then all it takes is for the powers that be to do a better job at marketing their initiative.


> In my experience LLM’s can make new things when they are some linear combination of existing things

It seems to me that all “new ideas” are basically linear combinations of existing things with exceeding rare exceptions…

Maybe Godel’s Incompleteness?

Darwinian evolution?

General Relativity?

Buddhist non-duality?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: