Learning Rigidity and Scene Flow Estimation

By Zhaoyang Lv We live in a three-dimensional (3D), dynamic world every day. Being able to perceive 3D high-resolution motion is a fundamental ability of our perception system, which enables us to perform versatile jobs. At the age when we are building intelligent robots, autonomous vehicles, and augmented reality toolkits, how can we also enable … Continue reading Learning Rigidity and Scene Flow Estimation

Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance

By: Prithvijit Chattopadhyay and Ramprasaath R. Selvaraju (Paper authors include Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, and Stefan Lee) Deep Neural Networks have pushed the boundaries of standard image-classification tasks in the past few years, with performance on many challenging benchmarks reaching near human-level accuracies. One of the … Continue reading Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance

What is Graph R-CNN?

By Jianwei Yang and Jiasen Lu The following post breaks down Graph R-CNN for Scene Graph Generation, which will be presented at the European Conference on Computer Vision 2018 (ECCV). The conference takes place September 8th through 14th in Munich, Germany.  Visual scene understanding has traditionally focused on identifying objects in images -- learning to predict their … Continue reading What is Graph R-CNN?

SHIELD: Defending Deep Neural Networks from Adversarial Attacks

by Nilaksh Das, PhD student at Georgia Institute of Technology in the School of Computational Science and Engineering. Das is advised by Polo Chau. “SHIELD is a fast and practical approach to defend deep neural networks from adversarial attacks. This work proposes a multifaceted framework which incorporates compression, randomization, model-retraining, and ensembling to make computer vision models robust to adversarial … Continue reading SHIELD: Defending Deep Neural Networks from Adversarial Attacks

Learning to Cluster

“Can machines categorize new things by learning how to group similar things together?” The following describes work by Yen-Chang Hsu, Zhaoyang Lv, and Zsolt Kira, which will be presented at the 2018 International Conference on Learning Representations (ICLR) in Vancouver. Read the paper here. Clustering is the task of partitioning data into groups, so that … Continue reading Learning to Cluster

Convergence of Value Aggregation for Imitation Learning

The following describes work by Ching-An Cheng and Byron Boots, which was awarded Best Paper at the Further details and proofs are available at The 21st International Conference on Artificial Intelligence and Statistics (AISTATS). The paper can be found here: https://arxiv.org/abs/1801.07292. Learning to make sequential decisions is a fundamental topic in designing automatic agents with artificial intelligence. … Continue reading Convergence of Value Aggregation for Imitation Learning

From Object Interactions to Fine-grained Video Understanding

Video understanding tasks such as action recognition and caption generation are crucial for various real-world applications in surveillance, video retrieval, human behavior understanding, etc. In this work, we present a generic recurrent module to detect relationships and interactions between arbitrary object groups for fine-grained video understanding. Our work is applicable to various open domain video … Continue reading From Object Interactions to Fine-grained Video Understanding