Home
Projects
Action & Activity Understanding

Action & Activity Understanding

Action and activity understanding

What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning

Chi-Hsi Kung, Frangil M Ramirez, Juhyung Ha, Yi-Hsuan Tsai, Yi-Ting Chen, David J. Crandall
ICCV 2025
[paper]

LoCoNet: Long-Short Context Network for Active Speaker Detection

Xizi Wang, Feng Cheng, Gedas Bertasius
CVPR 2024
[paper] [video]

Fusing Personal and Environmental Cues for Identification and Segmentation of First-Person Camera Wearers in Third-Person

Ziwei Zhao, Yuchen Wang, Chuhua Wang
CVPR 2024
[paper]

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, et al.
CVPR 2024
[paper] [website] [video]

A Survey on Deep Learning Techniques for Video Segmentation

Tianfei Zhou, Fatih Porikli, David J. Crandall, Luc Van Gool, Wenguan Wang
PAMI 2023
[paper]

Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds

Junbo Yin, Jianbing Shen, Xin Gao, David Crandall, Ruigang Yang
PAMI 2023
[paper]

VindLU: A recipe for Effective Video-and-Language Pretraining

Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius
CVPR 2023
[paper] [video]

Zero-Shot Video Object Segmentation with Co-Attention Siamese Networks

Xiankai Lu, Wenguan Wang, Jianbing Shen, David Crandall, Jiebo Luo
PAMI 2022

Action Recognition based on Cross-Situational Action-object Statistics

Satoshi Tsutsui, Xizi Wang, Guangyuan Weng, Yayun Zhang, David Crandall, Chen Yu
ICDL 2022
[paper]

Can Gaze Inform Egocentric Action Recognition?

Zehua Zhang, David Crandall, Michael Proulx, Sachin Talathi, Abhishek Sharma
ETRA 2022
[paper]

Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning

Zehua Zhang, David Crandall
WACV 2022
[paper] [video]

Learning Video Object Segmentation from Unlabeled Videos

Xiankai Lu, Wenguan Wang, Jianbing Shen, Yu-Wing Tai, David Crandall, Steven Hoi
CVPR 2020
[paper]

A Self Validation Network for Object-Level Human Attention Estimation

Zehua Zhang, Chen Yu, David Crandall
NeurIPS 2019
[paper] [website]

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks

Wenguan Wang, Xiankai Lu, Jianbing Shen, David Crandall, Ling Shao
ICCV 2019
[paper]

Unsupervised Traffic Accident Detection in First-Person Videos

Yu Yao, Mingze Xu, Yuchen Wang, David Crandall, Ella Atkins
IROS 2019
[paper]

Observing Pianist Accuracy and Form with Computer Vision

Jangwon Lee, Bardia Doosti, Yupeng Gu, David Cartledge, David J. Crandall, Christopher Raphael
WACV 2019
[paper]

Joint Person Segmentation and Identification in Synchronized First- and Third-person Videos

Mingze Xu, Chenyou Fan, Yuchen Wang, Michael Ryoo, David Crandall
ECCV 2018
[paper] [website]

From Coarse Attention to Fine-Grained Gaze: A Two-stage 3D Fully Convolutional Network for Predicting Eye Gaze in First Person Video

Zehua Zhang, Sven Bambach, David Crandall, Chen Yu
BMVC 2018
[paper] [website]

Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action Recognition

Mingze Xu, Aidean Sharghi, Xin Chen, David Crandall
WACV 2018
[paper]

Identifying first-person camera wearers in third-person videos

Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David Crandall, Michael Ryoo
CVPR 2017
[paper] [website]

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions

Sven Bambach, Stefan Lee, David Crandall, Chen Yu
ICCV 2015
[paper] [website]

Viewpoint Integration for Hand-Based Recognition of Social Interactions from a First-Person View

Sven Bambach, David Crandall, Chen Yu
ICMI 2015
[paper]

A Framework for Reliable Text-Based Indexing of Video

Rangachar Kasturi, Sameer Antani, David Crandall
Symposium on Document Image Understanding Technology 2001
[paper]

Robust Extraction of Text in Video

Sameer Antani, David Crandall, Rangachar Kasturi
ICPR 2000
[paper]

Evaluation of Methods for Detection and Localization of Text from Video

Sameer Antani, David Crandall, Anand Narasimamurthy, Vladimir Y. Mariano, Rangachar Kasturi
IAPR Workshop on Document Analysis Systems 2000
[paper]

Action & Activity Understanding

Action and activity understanding

IU Computer Vision Lab resources