this post was submitted on 17 Apr 2024
257 points (89.5% liked)
Technology
59378 readers
2959 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I worked in the object recognition and computer vision industry for almost a decade. That stuff works. Really well, actually.
But this checkout thing from Amazon always struck me as odd. It's the same issue as these "take a photo of your fridge and the system will tell you what you can cook". It doesn't work well because items can be hidden in the back.
The biggest challenge in computer vision is occlusion, followed by resolution (in the context of surveillance cameras, you're lucky to get 200x200 for smaller objects). They would have had a really hard, if not impossible, time getting clear shots of everything.
My gut instinct tells me that they had intended to build a huge training set over time using this real-world setup and hope that the sheer amount of training data could help overcome at least some of the issues with occlusion.