Maybe an easier way to go is to record it structured up front (it’s already structured in the original application source anyway). This makes it much easier to record efficiently (so you can record more data) and also much easier to query efficiently, where eg you might invest time in machine learning on logical data instead of having to mess around with text.
That’s what we do here anyway, it’s worked well for us:
That’s what we do here anyway, it’s worked well for us:
https://github.com/Morgan-Stanley/hobbes/blob/master/README....