The following describes work by Ching-An Cheng and Byron Boots, which was awarded Best Paper at the Further details and proofs are available at The 21st International Conference on Artificial Intelligence and Statistics (AISTATS). The paper can be found here: https://arxiv.org/abs/1801.07292. Learning to make sequential decisions is a fundamental topic in designing automatic agents with artificial intelligence. … Continue reading Convergence of Value Aggregation for Imitation Learning
Video understanding tasks such as action recognition and caption generation are crucial for various real-world applications in surveillance, video retrieval, human behavior understanding, etc. In this work, we present a generic recurrent module to detect relationships and interactions between arbitrary object groups for fine-grained video understanding. Our work is applicable to various open domain video … Continue reading From Object Interactions to Fine-grained Video Understanding
A fundamental question in Natural Language Processing (NLP) is how to represent words. If we have a paragraph we want to translate, or a product review we want to determine whether is positive or negative, or a question we want to answer, ultimately the easiest building block to start from is the individual word. The … Continue reading Learning to Represent Words by how They’re Spelled
Georgia Tech's Research Horizons Magazine has done a very nice write-up of the ML@GT center, featuring many of our research projects. Machine learning has been around for decades, but the advent of big data and more powerful computers has increased its impact significantly — moving machine learning beyond pattern recognition and natural language processing into a … Continue reading The Minds of the New Machines | Research Horizons | Georgia Tech’s Research News
Everyday skills, such as making your bed or even pressing a doorbell, might seem trivial to us, but are actually quite complicated for today’s robots. Think about your performance the first time you tried a sport. Did you seek help from a peer or coach? Did you perform better after that? Most probably you answered yes. It … Continue reading Robust Skill Generalization Using Probabilistic Inference
Machine learning at Georgia Tech was in the spotlight recently as The Center for Machine Learning at Georgia Tech (ML@GT) hosted its spring seminar on Feb. 22 in the Klaus Advanced Computing Building.Billed as a “day of discussions around machine learning,” more than 200 students and faculty from across campus registered for the daylong event.“AI is … Continue reading Ethics Highlight ‘Day of Machine Learning Discussion’ | College of Computing
Embodied Question Answering is a new AI task where an agent is spawned at a random location in a 3D environment and asked a question ("What color is the car?"). In order to answer, the agent must first intelligently navigate to explore the environment, gather information through first-person (egocentric) vision, and then answer the question ("orange").