More

thedevindevops · 2026-01-12T16:35:39 1768235739

Fab job, did spark one question though, Re: face labeling, are you trying to to get a model to identify specific individuals of a species?

dewey · 2026-01-12T17:16:50 1768238210

I've thought about it and it would make the whole project even cooler with actual stats of "birdhouse regulars" not just in aggregated form but I don't know if it's possible and if bird faces have enough unique features to differentiate them?

Right now I only use it so that my thumbnails of pictures from the camera are centered on the head in the UI as I couldn't find a pre-existing model that does it for animals. I'm thinking that maybe having this data set of a few hundred bird faces will allow me to train a small one in the future to do it more automatically. If not...I at least learned something new about building models!

thedevindevops · 2025-12-03T18:07:41 1764785261

You used AI to build a spambot?

reliefcrew · 2025-12-03T18:53:07 1764787987

That's actually the point of commercial AI, depending on your definition of spam of course :-)

thedevindevops · 2025-09-22T09:36:50 1758533810

How does what you've described solve the coffee/espresso problem? You can't query SQL such that records like 'espresso' return coffee?

brudgers · 2025-09-22T15:03:50 1758553430

Wouldn’t a beverage LLM would already “know” espresso is coffee?

muzani · 2025-09-23T11:17:16 1758626236

Yup, that's exactly what parent comment is saying.

Let's say your beverage LLM is there to recommend drinks. You once said "I hate espresso" or even something like "I don't take caffeine" at one point to the LLM.

Before recommending coffee, Beverage LLM might do a vector search for "coffee" and it would match up to these phrases. Then the LLM processes the message history to figure out whether this person likes or dislikes coffee.

But searching SQL for `LIKE '%coffee%'` won't match with any of these.

sdesol · 2025-09-24T16:09:10 1758730150

I haven't looked at the code, but it might do what I do with my chat app which is talked about at https://github.com/gitsense/chat/blob/main/packages/chat/wid...

The basic idea is, you don't search for a single term but rather you search for many. Depending on the instructions provided in the "Query Construction" stage, you may end up with a very high level search term like beverage or you may end up with terms like 'hot-drinks', 'code-drinks', etc.

Once you have the query, you can do a "Broad Search" which returns an overview of the message and from there the LLM can determine which messages it should analyze further if required.

Edit.

I should add, this search strategy will only work well if you have a post message process. For example, after every message save/upddate, you have the LLM generate an overview. These are my instructions for my tiny overview https://github.com/gitsense/chat/blob/main/data/analyze/tiny... that is focused on generating the purpose and keywords that can be used to help the LLM define search terms.

adastra22 · 2025-09-24T16:31:24 1758731484

That’s going to be incredibly fragile. You could fix it by giving the query term a bunch of different scores, e.g. its caffeine-ness, bitterness, etc. and then doing a likeness search across these many dimensions. That would be much less fragile.

And now you’ve reinvented vector embeddings.

sdesol · 2025-09-24T16:50:44 1758732644

You could instruct the LLM to classify messages with high level tags like for coffee, drinks, etc. always include beverage.

Given how fast interference has become and given current supported context window sizes for most SOTA models, I think summarizing and having the LLM decide what is relevant is not that fragile at all for most use cases. This is what I do with my analyzers which I talk about at https://github.com/gitsense/chat/blob/main/packages/chat/wid...

adastra22 · 2025-09-24T16:56:41 1758733001

Inference is not fast by any metric. It is many, MANY orders of magnitude slower than alternatives.

sdesol · 2025-09-24T17:11:50 1758733910

Honestly Gemini Flash Lite and models on Cerebras are extremely fast. I know what you are saying. If the goal is to get a lot of results where they may or may not be relevant, then yes, it is an order of a magnitude slower.

If you take into consideration the post analysis process, which is what inference is trying to solve, is it an order of a magnitude slower?

adastra22 · 2025-09-25T00:09:10 1758758950

More like 6-8 orders of magnitude slower. That’s a very nontrivial difference in performance!

sdesol · 2025-09-25T01:14:08 1758762848

How are you quantify the speed at which results are reviewed?

adastra22 · 2025-09-25T01:21:57 1758763317

It’s not speed, but cost to compute.

9rx · 2025-09-24T17:12:39 1758733959

It has become fast enough that another call isn't going to overwhelm your pipeline. If you needed this kind of functionality for performance computing perhaps it wouldn't be feasible, but it is being used to feed back into an LLM. The user will never notice.

Noumenon72 · 2025-09-24T16:49:11 1758732551

Your readmes did a great job at answering my question "why is this file called 1.md? What calls this?" when I searched for "1.md". (The answer is 1=user, 2=assistant, and it allows adding other analyzers with the same structure.)

sdesol · 2025-09-24T17:05:42 1758733542

I'm guessing you are referring to https://github.com/gitsense/chat/tree/main/data/analyze or https://github.com/gitsense/chat/tree/main/packages/chat/wid...

The number is actually the order in the chat so 1.md would be the first message, 2.md would be the second and so forth.

If you goto https://chat.gitsense.com and click on the "Load Personal Help Guide" you can see how it is used. Since I want you to be able to chat with the document, I will create a new chat tree and use the directory structure and the 1,2,3... markdown files to determine message order.

Noumenon72 · 2025-09-25T05:00:04 1758776404

https://github.com/gitsense/chat/blob/129210302ec06985bbd103... also says "put a 1.md here and the modular plugin structure will know to call it".

9rx · 2025-09-24T14:16:57 1758723417

If an LLM understands that coffee and expresso are both relevant, like the earlier comment suggests, why wouldn't it understand that it should search for something like `foo LIKE '%coffee%' OR foo LIKE '%expresso%'`?

In fact, this is what ChatGPT came up with:

   SELECT *
   FROM documents
   WHERE text ILIKE '%coffee%'
      OR text ILIKE '%espresso%'
      OR text ILIKE '%latte%'
      OR text ILIKE '%cappuccino%'
      OR text ILIKE '%americano%'
      OR text ILIKE '%mocha%'
      OR text ILIKE '%macchiato%';

(I gave it no direction as to the structure of the DB, but it shouldn't be terribly difficult to adapt to your exact schema)

jimbokun · 2025-09-24T14:23:13 1758723793

You are slowly approaching the vector solution.

There are an unlimited number of items to add to your “like” clauses. Vector search allows you to efficiently query for all of them at once.

9rx · 2025-09-24T14:30:16 1758724216

The handwavvy assertion was that relational database solutions[1] work better in practice.

[1] Despite also somehow supporting MongoDB...

mr_toad · 2025-09-24T14:42:28 1758724948

Implementations that use vector database do not use LLMs to generate queries against those databases. That would be incredibly expensive and slow (and yes there is a certain irony there).

Main advantages of a vector lookup are built-in fuzzy matching and the potential to keep a large amount of documentation in memory for low latency. I can’t see an RDMS being ideal for either. LLMs are slow enough already, adding a slow document lookup isn’t going to help.

9rx · 2025-09-24T15:03:18 1758726198

The main disadvantage of vector lookup, allegedly, is that it doesn't work as well in practice. Did you, uh, forget to read the thread?

cluckindan · 2025-09-24T22:01:51 1758751311

What does ”doesn’t work as well” mean here? From my experience, vector lookup via HNSW is fast and accurate enough for practical purposes.

cluckindan · 2025-09-25T11:34:15 1758800055

You could ask an LLM to provide categorizarions for nouns and verbs, and store those. For ”I don’t like cappuccino”, you’d get back ”self”, ”human”, etc. for ”I”; ”negation” etc. for ”don’t”; ”preference”, ”trait” etc. for ”like”; ”coffee”, ”hot”, ”drink”, ”beverage” etc. for ”cappuccino”.

It would become unwieldy real fast, though. Easier to get an embedding for the sentence.

muzani · 2025-09-24T23:17:49 1758755869

An actual use case I had for vector DBs was when users were using "credit card", "kredit kad", "kad kredit", "kartu" interchangeably.

If you're matching ("%card%" OR "%kad%"), you'll also match with things like virtual card, debit card, kadar (rates), akad (contract). The more languages you support, the more false hits you get.

Not to say SQL is wrong, but 30 year old technology works with 30 year old interfaces. It's not that people didn't imagine this back then. It's just that you end up with interfaces similar to dropdown filters and vending machines. If you're giving the user the flexibility of a LLM, you have to support the full range of inputs.

9rx · 2025-09-25T09:29:15 1758792555

> The more languages you support, the more false hits you get.

Certainly you're at the mercy of what the LLM constructs. But if understands that, say, "debt card" isn't applicable to "card" it can add a negation filter. Like has already been said, you're basically just reinventing a vector database in 'relational' (that somehow includes MongoDB...) approach anyway.

But what is significant is the claim that it works better. That is a bold claim that deserves a closer look, but I'm not sure how you've added to that closer look by arbitrarily sharing your experience? I guess I've missed what you're trying to say. Everyone and their brother knows how a vector database works by this point.

esafak · 2025-09-24T14:47:37 1758725257

The negation part is a query understanding problem. https://en.wikipedia.org/wiki/Query_understanding

brudgers · 2025-09-23T13:39:16 1758634756

I think the problem being addressed is

   A. Last month user fd8120113 said “I don’t like coffee”
   B. Today they are back for another beverage recommendation

SQL is the place to store the relevant fact about user fd8120113 so that you can retrieve it into the LLM prompt to make a new beverage recommendation, today.

It’s addressing the “how many fucking times do I fucking need to tell you I don’t like fucking coffee” problem, not the word salad problem.

The ggp comment is strawmanning.

shepardrtc · 2025-09-24T15:27:06 1758727626

Right but if the user hates espresso but loves black coffee, how do you properly store that in SQL?

"I hate espresso" "I love coffee"

What if the SQL query only retrieves the first one?

brudgers · 2025-09-24T15:41:29 1758728489

Good queries are hard. Database design is hard. System architecture is hard.

My comment described the problem.

The solution is left as an exercise for the reader.

Keep in mind that people change their minds, misspeak, and use words in peculiar ways.

thedevindevops · 2025-07-29T10:28:47 1753784927

Just to be clear, are you looking for a provider to process these requests on your behalf for 3rd parties or are you wanting to ask Claude to process your Erasure request to Calude's owner Anthropic, for example?

pera · 2025-07-29T16:36:53 1753807013

The latter. They currently provide a method for requests related to account information but not for the information contained in their models or training datasets.

kasey_junk · 2025-07-29T15:14:19 1753802059

I think they just want to submit requests the old fashioned way (email, web form, phone, etc) and have the provider comply.

thedevindevops · on Oct 10, 2024

Figure out how AI can help the blind and the deaf. Take up ceramics. Write a travel book. Tinker with low-cost robotics. Volunteer for some environmental restoration project - the riverbank ones seem interesting.

thedevindevops · on June 6, 2024

Most - if not all - https://en.m.wikipedia.org/wiki/Pruning_(artificial_neural_n... methods use a form of FEA.

thedevindevops · on May 21, 2024

We have several processes which involve the upload of multiple different images, which the user is walked through with baby steps, I.e. 'take this exact picture of the asset' -> take the next, etc. Etc.

We're currently working on an image classification model to automatically tag these images with what they are and just providing a visual sample gallery of 'take these pictures' with the sample image border turning green when that type of image is identified. We expect this will reduce a multi-step error prone process with a more streamline experience.

thedevindevops · on April 21, 2024

Can you make me some paperclips?

thedevindevops · on April 6, 2024

You probably don't want to hear this but that problem doesn’t require AI to solve. We've had asset tags and QR codes for decades and won't require re-training if a new equipment manufacturer comes into play.

quirk · on April 6, 2024

I very much want to hear this! The issue we're facing is that the maintenance staff is very undertrained on a specific subset of equipment. But that equipment is actually very critical. Also I was vague in the first post - in reality this is for hospitals within a system, not one single building.

When you say "won't require retraining if a new equipment manufacturer comes into play" I don't really follow.

Edit: I get it now - you mean by using QR we wouldn't need to retrain the image recognition AI. It's a good concern, although in this specific area there is not a ton of innovation or newcomers. Worth keeping in mind though.

thedevindevops · on March 28, 2024

The most modern idea seems to be a Dyson swarm, which is not solid and consists of solar panels and habitats orbiting at a comfortable distance. Power is relayed using microwave transmitters and receivers and conceptually a large portion would be fed into particle accelerators for the manufacture of Anti-Matter on industrial scale.