MCP server for Ghidra

randomtoast · 2025-03-27T12:35:50 1743078950

I hope that one day we have a tool that can convert any proprietary binary to source code with a single click. It would be so much fun to have an "open source" version of all games. Currently, there are projects like https://github.com/Try/OpenGothic and https://github.com/SFTtech/openage, but these require years of community effort.

airza · 2025-03-27T15:15:43 1743088543

Current SOTA models are really bad at RE and i don't really expect this to improve through training on open data.

There are just not a lot of high quality examples on the internet, and more importantly the people writing this code are doing their best to make it actively more difficult.

sebzim4500 · 2025-03-27T16:13:24 1743092004

It is quite easy to produce high quality synthetic data to train reverse engineering. Just take any open source project and ask the model to produce the code (or something equivalent) given the binary.

ai-christianson · 2025-03-27T17:49:39 1743097779

Right. You could even run it through code obfuscators and such to create more diverse, realistic examples.

gus_massa · 2025-03-27T15:26:31 1743089191

You can't open source code that is not yours. They are implementing a clean new version.

On the other direction, a company can't pick a GPL project, uncompile the code and release it as proprietary.

randomtoast · 2025-03-27T15:41:57 1743090117

> They are implementing a clean new version.

Much of reverse engineering involves analyzing existing code, and this is not a secret. There are forums where people discuss and share their reverse engineering findings. Without this, creating a nearly 100% compatible clone, such as one that can use the original game files, would be nearly impossible.

Xx_crazy420_xX · 2025-03-27T07:40:34 1743061234

For LLMs to solve code I think they should be AST-native. Code is a tree, not a sequence — yet we feed it to models linearly, with no explicit structure. Todays models lack recurrence or true memory, so they can’t reason over hierarchical structures effectively.

Nesco · 2025-03-27T08:00:07 1743062407

LLMs are autoregressive models. However, the notion of order in ASTs might be nonexistent, especially for parallel branches of computation/control flow. You could attempt to untangle each branch into N sequences, but this would erase control-flow information.

Even when there is an objective ordering of the children of every node, you still have four traversal options: {preorder, postorder} × {BF, DF}.

Note: For children lacking an objective ordering, you might apply generic rules to define a traversal order, but you’d end up with as many depth-first traversals as there are possible orders—essentially a crude heuristic. If you want the evaluation order to be dynamic at each step (e.g., using RL), the complexity grows geometrically worse. That’s been my experience tinkering with a custom AST DSL for ARC-AGI.

Xx_crazy420_xX · 2025-03-27T08:40:55 1743064855

Cool to hear you've worked on ARC-AGI — I poked with it too. You’re totally right about the messy traversal space, especially with parallel branches. What feels ambiguous at the token level becomes structured ambiguity in the AST — and that’s progress.

My hunch is that LLMs don’t need to solve the whole traversal space — they just need a clean, abstract interface. Even parallel branches can be normalized into a schema that the model can reason over consistently. And in practice, you rarely need full recursion or a complete tree walk to understand a node — but having that option unlocks deeper comprehension when it counts.

This kind of structural understanding would also massively improve Copilot-style tools, especially for less popular libraries where token-level familiarity breaks down. If models could reason over types and structure instead of guessing based on frequency, completions would be a lot more reliable outside the top 1% of APIs.

dragonwriter · 2025-03-27T08:39:59 1743064799

> LLMs are autoregressive models.

Most LLMs are autoregressive models, but exceptions exist, e.g., Mercury [0] is a diffusion LLM.

[0] https://www.inceptionlabs.ai/news

Nesco · 2025-03-28T10:05:27 1743156327

Well, from my very limited comprehension of diffusion models, they apply to fixed length structure, mostly from a continuous space. Maybe a way to make them work with tree structures could be found - that's no trivial task

dragonwriter · 2025-03-28T15:24:20 1743175460

Autoregressive LLMs don't usually work on tree structures, they work on capped-length linear token sequences, which are isomorphic to fixed-length sequences.

I'm not sure why you think working on tree structures rather than fixed length sequences would be necessary for diffusion language models—which, again, actually exist; aside from Mercury which is proprietary, there is also LLaDA: https://ml-gsai.github.io/LLaDA-demo/

gnfargbl · 2025-03-27T12:09:04 1743077344

Has there been much work on reversing binaries into an AST form? It seems like something that somebody would have thought of researching, but I've not come across any efforts.

Is it something you can do generically, or do you need to know the specific compiler? Do you need to know the specific language, even, or could you perhaps create some other hypothetical AST in a different language that would have led to the same binary?

lmeyerov · 2025-03-27T16:40:56 1743093656

The graph part , more so than the ast part, makes sense to me. We reason over programs as hairy dataflow/controlflow/etc dependency graphs that happen to originally be encoded as some sort of text->ast.

GNNs went down some roads here, but never felt like a path to reasoning. So how to get an RL reasoner flow to do what is easy for datalog, natively and/or as a tool?

pilooch · 2025-03-27T14:03:47 1743084227

Or just we could forget about code and have model act directly :) That's my bet.

otabdeveloper4 · 2025-03-27T08:26:01 1743063961

LLMs process information in a strictly sequential manner. It's their core capability and what makes them feel so anthropomorphic.

dragonwriter · 2025-03-27T08:35:54 1743064554

> LLMs process information in a strictly sequential manner.

"LLMs" as a class do not. Most LLMs, because most LLMs are autoregressive models, but diffusion LLMs exist and are not sequential in the way that autoregressive models are.

> It's their core capability

Being sequential is not a capability at all, much less a core one defining Large Language Models.

> and what makes them feel so anthropomorphic.

I disagree with this, too; I think what makes LLMs "feel so anthropomorphic" is the fact that most humans are very focused on language in perceiving other humans as human, and LLMs' output (as their name suggests) models human use of language, directly targeting a key feature used to identify something as human-like.

otabdeveloper4 · 2025-03-27T10:51:32 1743072692

The gimmick of the LLM is that it outputs text sequentially, as if it is talking to us. That's what makes them feel "alive" and "intelligent" to us. (And yes, ironically it's this sequential nature that actually limits their intelligence in practice, but whatever. The AI hype is about appearances, not facts.)

lucianbr · 2025-03-27T13:13:00 1743081180

> That's what makes them feel "alive" and "intelligent" to us.

What is the basis for this claim? Seems like "A" (chatbots output text sequentially) is true, and "B" (they feel intelligent to us) is true, and you're claiming "A causes B" without any support at all. Just because they happen to both be true and you personally feel there is a causal relationship, which proves nothing.

dragonwriter · 2025-03-27T18:33:55 1743100435

> The gimmick of the LLM is that it outputs text sequentially, as if it is talking to us. That's what makes them feel "alive" and "intelligent" to us.

Yes, I got that that was the original claim. I still disagree with us. What makes them feel alive and intelligent is that they produce human-like language output, not that the process by which they construct that output is sequential. Non-autoregressive LLMs of equal output quality would (do) appear just as alive and intelligent as autoregressive LLMs. An autoregressive LLM behind a non-streaming request/response interface where the token-by-token sequencing of the response is not exposed to the user still seems just as intelligent as one where the output is streamed to the user.

rowanG077 · 2025-03-27T11:28:09 1743074889

Are you saying that if visually LLMs would not output text sequentially but at once they would not be as successful as they are?

otabdeveloper4 · 2025-03-28T08:38:22 1743151102

Yes. Human speech is sequential (we make sounds one by one), and when LLMs mimic this with token-by-token autocomplete they seem more anthropomorphic to us.

(I take issue with the word "successful", though. Selling LLMs as a human-like intelligence is a gimmick and a borderline scam.)

mike_hearn · 2025-03-27T08:51:41 1743065501

Not fully.

The point of transformer attention is cross-wise processing of tokens that computes their relationship to each other at multiple levels of abstraction. That's why LLMs can read so fast: they're processing all the input tokens in parallel.

LLMs emit tokens in a sequential manner at the level of the outer loop, but clearly inside the activations is a non-sequential map of the entire planned output, otherwise they wouldn't be able to make coherent sentences or speak German (which puts verbs at the end).

qwertox · 2025-03-26T19:08:20 1743016100

Which tools can currently invoke MCP? I have read only a little about MCP and got to know that Claude's desktop application is capable of using MCP locally.

Are there any chat interfaces which allow using MCP remotely?

I would like to be able to specify MCP endpoints and the functions they offer in ChatGPT's, Claude's and Gemini's web interfaces so that I can have them call my servers remotely. A bit like "GPTs" and "Gems".

lauriewired · 2025-03-26T19:23:43 1743017023

I touch on this briefly in the video, beside Claude Desktop, 5ire is a fairly model-agnostic local MCP client, I'm sure there are others.

sama also recently mentioned ChatGPT Desktop is getting MCP client functionality "soon".

As for remote clients, Cloudflare has some really useful tooling, look at their "AI Playground".

jauntywundrkind · 2025-03-26T19:43:40 1743018220

OpenAI just announced support in their Agents SDK. https://news.ycombinator.com/item?id=43485566 https://openai.github.io/openai-agents-python/mcp/

electroly · 2025-03-26T23:13:31 1743030811

I use them in Cursor. Writing an MCP server is trivial, just ask Cursor to put one together in TypeScript. You would use your local MCP server to call whatever remote API you want (or perform some other task). The MCP server uses stdin/stdout to talk to Cursor.

efunnekol · 2025-03-26T19:56:05 1743018965

You can use MCP servers in SAM (Solace Agent Mesh). That has a chat interface and can be run remotely. Perhaps the easiest way to do it remotely is to use a Slack integration to SAM with a free Slack workspace, which doesn't require poking a hole to serve the browser UI

https://github.com/SolaceLabs/solace-agent-mesh

jevyjevjevs · 2025-03-27T11:04:42 1743073482

I'm using Librechat which I've found to be quite feature complete. I updated an Obsidian MCP to get my most recent journal entries to act like a therapist. Example setup here: https://www.jevy.org/articles/obsidian-mcps-to-work-with-not...

dockerd · 2025-03-27T11:39:03 1743075543

@jevyjevjevs,

Can you add rss feed to your site blog? I found few of the articles interesting and helpful. I would like to subscribe but I don't see rss or email subscription.

nekitamo · 2025-03-26T19:42:11 1743018131

I had the same question as you, and some quick Googling led me to this list here:

https://github.com/punkpeye/awesome-mcp-clients

lordviet · 2025-03-26T21:46:20 1743025580

and the list of servers - https://github.com/punkpeye/awesome-mcp-servers

salgorithm · 2025-03-26T20:00:35 1743019235

Block has an open source tool called Goose that invokes MCP. https://block.github.io/goose/

hedgehog · 2025-03-27T02:15:38 1743041738

Is there a trick to making it work well? I tried Goose briefly but it seemed very flaky compared to Open Web UI with hand-configured tool calling.

fixprix · 2025-03-26T22:23:30 1743027810

Unity, Blender and Photoshop all have rough MCP integrations available. You can find them on GitHub.

mettamage · 2025-03-26T19:59:48 1743019188

If you run some proxy server, you could run MCP servers remotely

asphodel_gray · 2025-03-26T20:10:41 1743019841

Cursor has support for it I believe

mdaniel · 2025-03-26T14:25:32 1742999132

Her previous integration with Ghidra and an LLM had a good video, too: https://news.ycombinator.com/item?id=42860849

Malimite – iOS and macOS Decompiler - https://news.ycombinator.com/item?id=42829402 - Jan, 2025 (37 comments)

sorenjan · 2025-03-26T21:54:18 1743026058

If you haven't watched her Youtube channel before I recommend checking it out. Besides the technical content I think the editing with retro OS graphics are fun.

foooorsyth · 2025-03-27T02:46:49 1743043609

It's really impressive. Technical content, GitHub repos that go along with the videos, set design, retro editing -- much higher quality than a lot of stuff out there from major studios

npace12 · 2025-03-26T20:04:27 1743019467

Also one for radare2:

https://github.com/dnakov/radare2-mcp

ngneer · 2025-03-27T01:07:37 1743037657

Thought experiment. Suppose all binaries could be instantly reverse engineered to perfection. How would that change security?

LegionMammal978 · 2025-03-27T02:43:24 1743043404

Everyone would just replace all their proprietary programs with dumb clients that communicate with a server. Either that, or they'd go all in on homomorphic encryption.

ynniv · 2025-03-27T02:45:03 1743043503

Only formally proven systems will be secure

xeckr · 2025-03-27T01:31:56 1743039116

Everything is open source is you speak assembly.

gosub100 · 2025-03-27T14:35:04 1743086104

Secure enclaves would appear in most computers. Nothing would be run without everything being encrypted.

brokensegue · 2025-03-26T03:01:59 1742958119

my experience with just copying and pasting things from ghidra into LLMs and asking it to figure it out wasn't so successful. it'd be cool to have benchmarks for this stuff though.

Everdred2dx · 2025-03-26T05:27:38 1742966858

I actually have only tried this once but had the opposite experience. Gave it 5 or so related functions from a ps2 game and it correctly inferred they were related to graphics code, properly typing and naming the parameters. I’m sure this sort of thing is extremely hit or miss though

strstr · 2025-03-27T04:30:01 1743049801

Had the same experience. Took the janky decompilation from ghidra, and it was able to name parameters and functions. Even figured out the game based on a single name in a string. Based in my read of the labeled decompilation, it seemed largely correct. And definitely a lot faster than me.

Even if I weren’t to rely on it 100% it was definitely a great draft pass over the functions.

cedws · 2025-03-26T06:45:26 1742971526

Most likely there was just a mangled symbol somewhere that it recognised from its training data.

rowanG077 · 2025-03-26T17:52:43 1743011563

Where is that coming from? The chances that some random ps2 games code symbols are in the training data are infinitesimal. It's much more likely that it can understand code and rewrite it. Basically what LLM have been capable of for years now.

sitkack · 2025-03-26T18:02:46 1743012166

Parent is supposing w/o any experience. LLMs can see in hex, bytecode and base64, rot13, etc. I use LLMs to decompile bytecode all the time.

rfoo · 2025-03-26T18:22:28 1743013348

I've been thinking on how to build a benchmark for this stuff for a while, and don't have a good idea other than LLM-as-judge (which quickly gets messy). I guess there's a reason why current neural decompilation attempts are all evaluated on "seemingly meaningless" benchmarks like "can it recompile without syntax error" or "functional equivalence of recompilation" etc.

vessenes · 2025-03-26T20:16:39 1743020199

Hmm, specifically when it comes to reverse engineering, you have the best benchmark ever - you can check the original code, no?

brokensegue · 2025-03-27T02:58:09 1743044289

that requires LLM as judge

dataangel · 2025-03-27T11:40:19 1743075619

no it doesn't, you just diff against the real source code. probably something more fuzzy/continuous than actual diff, but still

rfoo · 2025-03-30T09:58:39 1743328719

Besides functional equivalence, a significant part of the value in neural decompilation is the symbol (function names, variable names, struct definition including member names) it recovered. So, if the LLM predicted "FindFirstFitContainer" for a function originally called "find_pool", is this correct? Wrong? 26.333% correct?

brokensegue · 2025-03-27T12:28:17 1743078497

Proving that two pieces of code are equivalent sounds very hard (incomputable)

Everdred2dx · 2025-03-27T04:22:31 1743049351

Is anyone working on a "catalog" of MCP servers? Searching on Github is not exactly the best way to discover these.

meander_water · 2025-03-27T04:58:07 1743051487

I've noticed a lot of websites popping up recently which is basically just a list of MCP servers. Some examples:

- https://mcpservers.org/

- https://glama.ai/mcp/servers

- https://www.claudemcp.com/servers

Not to mention the usual GitHub ones:

- https://github.com/punkpeye/awesome-mcp-servers

The hype is real.

knowaveragejoe · 2025-03-27T06:27:32 1743056852

To clarify somewhat, while they all index MCP servers out there, some of them also will _host_ the MCP server remotely as well. Glama, mcp.run and just recently Cloudflare have offerings in this realm.

Klaster_1 · 2025-03-28T13:20:24 1743168024

Do these MCP registries expose an MCP server too, so the client can do MCP server auto discovery based on registry?

dSebastien · 2025-03-27T06:37:47 1743057467

There are multiple directories already. I listed some in my notes: https://notes.dsebastien.net/30+Areas/33+Permanent+notes/33....

cocoflunchy · 2025-03-27T10:01:18 1743069678

https://www.mcpt.com/

celesian · 2025-03-27T03:26:27 1743045987

This is very cool but it would be nice to have more features on the MCP server, such as arbitrary read and write of programs. For example, I was working on a self-unpacking CTF challenge which XORed instructions. It would be nice to have it be able to read the values at the addresses it xored.

dang · 2025-03-26T17:45:20 1743011120

Related (but merged hither):

GhidraMCP: Now AI can reverse malware [video] - https://news.ycombinator.com/item?id=43475025

userbinator · 2025-03-27T04:12:49 1743048769

RE is exactly the sort of work that requires precision and careful reasoning, not hallucinatory statistical inference. Seeing how LLMs stumble very heavily on the former makes it clear that AI will not replace us.

iugtmkbdfil834 · 2025-03-27T04:38:54 1743050334

I hate to be that guy, but one does not follow the other. To some, just the initial appearance of 'acceptable'/'good enough' is, well, good enough. Current set of LLMs can absolutely replace us while breaking a lot in the process.

enigma101 · 2025-03-27T11:47:50 1743076070

You just opened pandora's box lady wired

dprophecyguy · 2025-03-27T03:42:37 1743046957

i love you lauriewired.