More

warthog · 2025-10-27T16:54:25 1761584065

Tough day to be an AI Excel add-in startup

mitjam · 2025-10-27T20:05:39 1761595539

Ask Rosie is actually shutting down right now: https://www.askrosie.ai/

I would love to learn more about their challenges as I have been working on an Excel AI add-in for quite some time and have followed Ask Rosie from almost their start.

That they now gone through the whole cycle worries me I‘m too slow as a solo building on the side in these fast paced times.

8note · 2025-10-27T18:47:32 1761590852

its a great time for your ai excel add-in to start getting acquired by a claude competitor though

NotMichaelBay · 2025-10-27T19:21:54 1761592914

Not OpenAI, though, because they already gave $14M to an AI Excel add-in startup (Endex)

jonathanstrange · 2025-10-27T17:36:15 1761586575

That seems to be true for any startup that offers a wrapper to existing AIs rather than an AI on their own. The lucky ones might be bought but many if not most of them will perish trying to compete with companies that actually create AI models and companies large enough to integrate their own wrappers.

warthog · 2025-10-27T18:24:47 1761589487

Actually just wrote about this: https://aimode.substack.com/p/openai-is-below-above-and-arou...

not sure if it binary like that but as startups we will probably collect the scraps leftover indeed instead

warthog · 2025-10-14T10:29:15 1760437755

sorry to hear that. curious if the product's approach was a fuck you to Workday though as in you can't even put them side by side and compare (they are so different) or if it was simply that Workday sucks, we will do better

warthog · 2025-10-13T14:51:10 1760367070

Paul Graham also has a good way to frame this which perhaps I should have touched upon.

"A principle for taking advantage of thresholds has to include a test to ensure the game is worth playing. Here's one that does: if you come across something that's mediocre yet still popular, it could be a good idea to replace it. For example, if a company makes a product that people dislike yet still buy, then presumably they'd buy a better alternative if you made one."

warthog · 2025-10-13T14:48:03 1760366883

I definitely wrote it by hand, no LLM used if that is what you are insinuating.

Uber might have turned to the bad practices used by taxis now that they are focused on extracting more and more value. However, the point of the writing was to be focused on earlier days. In the earlier days, they did try and embodied price transparency and customer experience focused on customers.

If you are arguing that they never did, I don't see how it grew to a $200 Bn company.

throwaway290 · 2025-10-13T15:58:18 1760371098

They did it because they were losing money to capture market.

The smart places knew it and regulated uber or killed it and keep local taxis working

Kind of like a country can subsidize car industry export tons of cars and kill domestic production in another country. Then they can jack up prices and profit on their terms.

They don't win because their cars are better. They win because they are lying by price

Same with closedai and friends now.

warthog · 2025-10-13T13:08:25 1760360905

what do you mean by regulations? Those that benefitted Uber or?

Supermancho · 2025-10-13T13:31:36 1760362296

Taxis. This is obvious, having read the blog and post together.

warthog · 2025-10-08T15:20:01 1759936801

imo it would be better to carry the whole memory outside of the inference time where you could use an LLM as a judge to track the output of the chat and the prompts submitted

it would sort of work like grammarly itself and you can use it to metaprompt

i find all the memory tooling, even native ones on claude and chatgpt to be too intrusive

namanyayg · 2025-10-08T16:21:24 1759940484

I've been building exactly this. Currently a beta feature in my existing product. Can I reach out to you for your feedback on metaprompting/grammarly aspect of it?

warthog · 2025-10-14T13:40:05 1760449205

yep hit me up

elfenleid · 2025-10-08T15:29:29 1759937369

Totally get what you're saying! Having Claude manually call memory tools mid-conversation does feel intrusive, I agree with that, especially since you need to keep saying Yes to the tool access.

Your approach is actually really interesting, like a background process watching the conversation and deciding what's worth remembering. More passive, less in-your-face.

I thought about this too. The tradeoff I made:

Your approach (judge/watcher): - Pro: Zero interruption to conversation flow - Pro: Can use cheaper model for the judge - Con: Claude doesn't know what's in memory when responding - Con: Memory happens after the fact

Tool-based (current Recall): - Pro: Claude actively uses memory while thinking - Pro: Can retrieve relevant context mid-response - Con: Yeah, it's intrusive sometimes

Honestly both have merit. You could even do both, background judge for auto-capture, tools when Claude needs to look something up.

The Grammarly analogy is spot on. Passive monitoring vs active participation.

Have you built something with the judge pattern? I'd be curious how well it works for deciding what's memorable vs noise.

Maybe Recall needs a "passive mode" option where it just watches and suggests memories instead of Claude actively storing them. That's a cool idea.

westurner · 2025-10-08T16:00:55 1759939255

Is this the/a agent model routing problem? Which agent or subagent has context precedence?

jj autocommits when the working copy changes, and you can manually stage against @-: https://news.ycombinator.com/item?id=44644820

OpenCog differentiates between Experiential and Episodic memory; and various processes rewrite a hypergraph stored in RAM in AtomSpace. I don't remember how the STM/LTM limit is handled in OpenCog.

So the MRU/MFU knapsack problem and more predictable primacy/recency bias because context length limits and context compaction?

westurner · 2025-10-08T16:41:58 1759941718

OpenCogPrime:EconomicAttentionAllocation: https://wiki.opencog.org/w/OpenCogPrime:EconomicAttentionAll... :

> Economic Attention Allocation (ECAN) was an OpenCog subsystem intended to control attentional focus during reasoning. The idea was to allocate attention as a scarce resource (thus, "economic") which would then be used to "fund" some specific train of thought. This system is no longer maintained; it is one of the OpenCog Fossils.

(Smart contracts require funds to execute (redundantly and with consensus), and there there are scarce resources).

Now there's ProxyNode and there are StorageNode implementations, but Agent is not yet reimplemented in OpenCog?

ProxyNode implementers: ReadThruProxy, WriteThruProxy, SequentialReadProxy, ReadWriteProxy, CachingProxy

StorageNode > Implementations: https://wiki.opencog.org/w/StorageNode#Implementations

warthog · 2025-10-03T12:28:16 1759494496

i think it is too difficult to manage with little upside. setting it up is easy but managing the nodes etc is a huge overhead.

warthog · 2025-10-01T13:12:37 1759324357

I built this using OpenAI GPT-5 agentic loops as well where the agent interacts with a spreadsheet: banker dot so

Some principles I learned: 1. You don't really wanna use any kind of framework that wraps around openai or claude sdks. You end up fighting them 2. Small number of tools with more functions are better than large number of tools with small number of functions inside 3. Function definitions you put inside the tools are actually very important and need to be very clear 4. Integrating another agent as a tool to the main agent can relieve your context window by 5x 5. RAG via only vector search is mostly unnecessary. Claude Code itself leverages iterative search which works way better with code repos as well as most documents

warthog · 2025-09-30T16:58:35 1759251515

Wasn't imbue training models for coding having raised a huge fund? Is this a pivot?

thejash · 2025-09-30T17:13:33 1759252413

Since our launch 2 years ago, we've focused more on the "agents that code" part of our vision (so that everyone can make software) rather than the "training models from scratch" part (because there were so many good open source models released since then)

This is from our fundraising post 2 years ago:

> Our goal remains the same: to build practical AI agents that can accomplish larger goals and safely work for us in the real world. To do this, we train foundation models optimized for reasoning. Today, we apply our models to develop agents that we can find useful internally, starting with agents that code. Ultimately, we hope to release systems that enable anyone to build robust, custom AI agents that put the productive power of AI at everyone’s fingertips.

- https://imbue.com/company/introducing-imbue/

We have trained a bunch of our own models since then, and are excited to say more about that in the future (but it's not the focus of this release)

warthog · 2025-10-01T17:34:03 1759340043

what is the difference of this to conductor

warthog · 2025-09-29T23:50:36 1759189836

Banker (banker.so): An AI spreadsheet that excels at spreadsheet understanding (pun intended).

There are some AI spreadsheet products out there mostly as plugins along with MS Copilot. However my experience with them showed that they are bad at understanding spreadsheets.

The reason is that sheets are 2D data models. Because LLMs are trained on 1D data models (simply text), translation of 2D data models to formats an LLM can consume is a big context engineering task.

I read and implemented some of the algos mentioned in SpreadsheetLLM paper released by Microsoft. Ironic, isn't it?

Got it to a nice working state. Give it a go - if you need more tokens, let me know!