This year’s European Conference on Computer Vision (ECCV) showcases 1,360 papers – 15 of them from the Machine Learning Center at Georgia Tech (ML@GT.) The papers cover a vast array of topics including an idea on how to improve vision and language navigation and a new model that is learning to generate grounded visual captions… Continue reading ML@GT Makes a Strong Showing at Premier European Computer Vision Conference
Category: ECCV
New Algorithm Follows Human Intuition to Make Visual Captioning More Grounded
Annotating and labeling datasets for machine learning problems is an expensive and time-consuming process for computer vision and natural language scientists. However, a new deep learning approach is being used to decode, localize, and reconstruct image and video captions in seconds, making the machine-generated captions more reliable and trustworthy. To solve this problem, researchers at… Continue reading New Algorithm Follows Human Intuition to Make Visual Captioning More Grounded