Oh I've been thinking about this a lot, from a different perspective.
Now that we can get highly detailed Gaussian Splats to represent spaces in 3D, there has been great work to do segmentation of these datasets. Theres a lot of momentum behind both of these ideas.
The technology is very nearly there, such that you could scan your home from your phone, and get a detailed segmented map of everything you own.
I believe I've also seen someone take a video and input it into Gemini and ask for a list of all the products. Some combination of these ideas really.
Now that we can get highly detailed Gaussian Splats to represent spaces in 3D, there has been great work to do segmentation of these datasets. Theres a lot of momentum behind both of these ideas.
The technology is very nearly there, such that you could scan your home from your phone, and get a detailed segmented map of everything you own.
I believe I've also seen someone take a video and input it into Gemini and ask for a list of all the products. Some combination of these ideas really.