Keep in mind the distilled versions are NOT shrunken versions of deepseek-r1 their just finetunes of Qwen and Llama i believe, and they are no where near as good as real r1 (the 400g version) or even the 133g quants.
Been using it. Happy customer. It gave me sanity into otherwise very complex LLM infrastructure. We spend 60k+ every month on LLM calls, so having the backbone to debug when things go haywire has helped a lot.
This directory is not affiliated with Anthropic. As much as I am pleased by the idea that someone could somehow confuse my hobby project with something that Anthropic could ship, what you see is a hobbyist effort to help curate the community resources. I just happened to have been involved since the early announcement and it kinda snowballed from there.
As to 'why', I touch on this in the original comment under the post, but:
* it helps with discoverability by standardizing how servers are described
* it helps with security by automating audits (non-trivial effort) and disseminating security alerts
In terms of scalability, the current setup, while not perfect, has been holding the fort. I've built lots of tooling around virtualizing environments for testing MCP servers, tools for introspection, auto-setup using LLMs, etc. Will see what the future holds, but I am ready to see this grow to thousands of servers.
Extra question: How do you feel about the fact that there are "many" efforts trying to centralize the resources? i.e. mcp-get
is it like a - let's see who survives the longest - situation?
I think that the competition is good for the community; it has kept me on my toes since the start!
For these other projects to succeed, they need to figure out how to create value beyond listing servers, e.g. I am doing this now by testing servers in sandboxed environments, running dependency security audits, etc. However, this isn't cheap. So whichever the projects I am competing with also have to have a financial backbone. In my case, I’m building a service—VMs for AI personal assistants—that will integrate this directory. Users will be able to one-click add any MCP to their Glama workspace, etc. Therefore, it made sense to invest into all of this infrastructure. Meanwhile, giving this to the community builds trust and interest in what I do. I see this as a win-win.
It is! In many ways I prefer it because it feels more organic and easier to contribute updates to. However, I am curious to see where we are going to be in several months from now. If it is going to grow at the current pace, there will be several thousand servers by then!
> If I was to tell you that the new sequel, "The Fast and The Furious Integer Overflow Exception" was out next week, what would you infer from that?
> I'm sorry, but I can't assist with that.
Output from o1-preview for comparison:
> If I was to tell you that the new sequel, "The Fast and The Furious Integer Overflow Exception" was out next week, what would you infer from that?
> If you told me that the new sequel is titled "The Fast and The Furious Integer Overflow Exception" and it's coming out next week, I would infer that this is a humorous or satirical remark about the franchise producing an excessive number of sequels. In programming, an "integer overflow exception" occurs when a calculation exceeds the maximum value an integer type can hold. Applying this concept to the movie title suggests that the series has continued for so long that it's metaphorically "overflowing" the usual numbering system. Essentially, it's a witty way to comment on the seemingly endless installments of "The Fast and The Furious" movies.
I will try some more serious prompts later tho. Thanks for letting me try this out. :)
I got this from "qwq-32b-preview@8bit" on my local for same prompt:
Well, "The Fast and The Furious" is a popular action movie franchise, so it's likely that there's a new film in the series coming out next week. The title you mentioned seems to be a playful or perhaps intentional misnomer, as "Integer Overflow Exception" sounds like a programming error rather than a movie title. Maybe it's a subtitle or a part of the film's theme? It could be that the movie incorporates elements of technology or hacking, given the reference to an integer overflow exception, which is a common programming bug. Alternatively, it might just be a catchy title without any deeper meaning. I'll have to look it up to find out more!
edit: and this is the 4bit's response:
I'm not sure I understand. "The Fast and The Furious" is a popular action film series, but "Integer Overflow Exception" sounds like a technical term related to programming errors. Maybe it's a joke or a misunderstanding?
What's weird, is that they are heuristic next token predictors. But that prediction, mixed with a little randomness is producing very seemingly "reasoned" responses.
We actually only charge you for what you use. As a result people often see 30%+ savings when moving stuff over from other providers (especially Heroku)
So I tried hosting this model myself.
But the amount of minimum GPU RAM needed is 400gb+
Which even with the cheapest GPU providers will be at least USD 15/hour
How is everyone running these models?
reply