Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

First, love your work. Sadly, I don't think this path would work for me as for what I do, I need the selectors as part of the workflow.

Roughly, my end goal is to do a single or multi-shot with the following information HTML differential (could be selectors, xpaths, data regions, differentials of any of the above, etc...), code stacktrace, related code, and prompt.

For this example, let's consider that the flow involves the bot to login to a website. I have selectors for the `.username` and `.password` inputs and then a selector for the login button as `.login-btn`.

1. The site updates their page and changes up all their IDs, but keeps the same structure. 2. The site updates their page and changes up all their IDs, but changes the structure and the form is named something different and is somewhere else in the DOM. 3. many... many other examples.

Trying to figure out how to minimize the tokens, but keep the needed context to regenerate the selectors that are needed to maintain the workflow.



Yeah, I've been thinking a bit about that kind of problem too.

My hunch is you could do it with a much more complex setup involving OpenAI functions - by trying different things (like "list just input elements with their names and associated labels") in a loop with the LLM where it gets to keep asking follow-up questions of the DOM until it finds the right combination.


Would love to connect with you and discuss this further.


I’m also working on this issue. Open to brainstorming some approaches together


Lets! Would love to hear your thoughts.


z (at) 6bow.com


I’d like to join the discussion if I can, I’m actively working in this area.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: