You may want to do some "AND NOT" matching on the description, e.g.
https://www.ebay.com/itm/235440904571 which has "title_cleaned": "MSI RTX 3070 Ti Ventus 3X 8G OC" but the description is "MSI RTX 3070 Ti VENTUS 3X 8G OC Graphics Card Support Bracket"
And, up to you, but doing money math with floats is a well known road-to-ruin: "price_with_shipping": 18.990000000000002,
Unfortunately I can't access descriptions without another API call.
Thanks for pointing out the floats issue. It'll probably be OK because I round to the nearest integer when displaying and only use the extra precision for sorting, but I might round to 2dp on the server side to avoid wasting bandwidth on all those zeroes - I presume floating point imprecision compresses poorly.
Thanks for the script Alex, it's a very usable site much more so than the topic one. The site navigation, and sane options instead of clicking 512mb, 64mb and 1mb ram was a welcome feature. What did you do to generate this site? I noticed the was another one yesterday with prices as well, is it a new scraper or framework?
I wish I could say it's some lovely architecture, but it's just some glued together python scripts that get the data from the ebay API, categorise the GPUs (I initially tried local LLMs, but hard coded rules work far better), then generate a static site using jinja2 templating. My frontend "framework" is simple vanilla HTML, CSS, and JS.
I suppose I'm living up to the username! I do know how to write decent programs and in other languages, but I'm just so much faster when bodging.
I'm thinking of making similar website for more things, in which case I'd clean up the architecture a bit, but my exams are in a few months and this hasn't been particularly successful so we'll see. I probably shouldn't be spending my weekends programming right now :)
Thank you for telling me about the behind the scenes creation of your site, I will show it to my friends. It is a very lightweight, easy to use and mobile friendly site. I hope you will post it in it's own topic and it gets more interest.
I submitted it a few hours ago to two upvotes - I think it might be because I didn't include the link directly, and apparently that downranks it in the algorithm. I've emailed dang to see if it's OK to repost it.