ML@GT Makes a Strong Showing at Premier European Computer Vision Conference

This year’s European Conference on Computer Vision (ECCV) showcases 1,360 papers – 15 of them from the Machine Learning Center at Georgia Tech (ML@GT.) The papers cover a vast array of topics including an idea on how to improve vision and language navigation and a new model that is learning to generate grounded visual captions… Continue reading ML@GT Makes a Strong Showing at Premier European Computer Vision Conference

New Algorithm Follows Human Intuition to Make Visual Captioning More Grounded

Annotating and labeling datasets for machine learning problems is an expensive and time-consuming process for computer vision and natural language scientists. However, a new deep learning approach is being used to decode, localize, and reconstruct image and video captions in seconds, making the machine-generated captions more reliable and trustworthy. To solve this problem, researchers at… Continue reading New Algorithm Follows Human Intuition to Make Visual Captioning More Grounded