What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation LearningChi-Hsi Kung, Frangil M Ramirez, Juhyung Ha, Yi-Hsuan Tsai, Yi-Ting Chen, David J. CrandallICCV 2025[paper]
LoCoNet: Long-Short Context Network for Active Speaker DetectionXizi Wang, Feng Cheng, Gedas BertasiusCVPR 2024[paper] [video]
Fusing Personal and Environmental Cues for Identification and Segmentation of First-Person Camera Wearers in Third-PersonZiwei Zhao, Yuchen Wang, Chuhua WangCVPR 2024[paper]
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person PerspectivesKristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, et al.CVPR 2024[paper] [website] [video]
A Survey on Deep Learning Techniques for Video SegmentationTianfei Zhou, Fatih Porikli, David J. Crandall, Luc Van Gool, Wenguan WangPAMI 2023[paper]
Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point CloudsJunbo Yin, Jianbing Shen, Xin Gao, David Crandall, Ruigang YangPAMI 2023[paper]
VindLU: A recipe for Effective Video-and-Language PretrainingFeng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas BertasiusCVPR 2023[paper] [video]
Zero-Shot Video Object Segmentation with Co-Attention Siamese NetworksXiankai Lu, Wenguan Wang, Jianbing Shen, David Crandall, Jiebo LuoPAMI 2022
Action Recognition based on Cross-Situational Action-object StatisticsSatoshi Tsutsui, Xizi Wang, Guangyuan Weng, Yayun Zhang, David Crandall, Chen YuICDL 2022[paper]
Can Gaze Inform Egocentric Action Recognition?Zehua Zhang, David Crandall, Michael Proulx, Sachin Talathi, Abhishek SharmaETRA 2022[paper]
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation LearningZehua Zhang, David CrandallWACV 2022[paper] [video]
Learning Video Object Segmentation from Unlabeled VideosXiankai Lu, Wenguan Wang, Jianbing Shen, Yu-Wing Tai, David Crandall, Steven HoiCVPR 2020[paper]
A Self Validation Network for Object-Level Human Attention EstimationZehua Zhang, Chen Yu, David CrandallNeurIPS 2019[paper] [website]
Zero-Shot Video Object Segmentation via Attentive Graph Neural NetworksWenguan Wang, Xiankai Lu, Jianbing Shen, David Crandall, Ling ShaoICCV 2019[paper]
Unsupervised Traffic Accident Detection in First-Person VideosYu Yao, Mingze Xu, Yuchen Wang, David Crandall, Ella AtkinsIROS 2019[paper]
Observing Pianist Accuracy and Form with Computer VisionJangwon Lee, Bardia Doosti, Yupeng Gu, David Cartledge, David J. Crandall, Christopher RaphaelWACV 2019[paper]
Joint Person Segmentation and Identification in Synchronized First- and Third-person VideosMingze Xu, Chenyou Fan, Yuchen Wang, Michael Ryoo, David CrandallECCV 2018[paper] [website]
From Coarse Attention to Fine-Grained Gaze: A Two-stage 3D Fully Convolutional Network for Predicting Eye Gaze in First Person VideoZehua Zhang, Sven Bambach, David Crandall, Chen YuBMVC 2018[paper] [website]
Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action RecognitionMingze Xu, Aidean Sharghi, Xin Chen, David CrandallWACV 2018[paper]
Identifying first-person camera wearers in third-person videosChenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David Crandall, Michael RyooCVPR 2017[paper] [website]
Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric InteractionsSven Bambach, Stefan Lee, David Crandall, Chen YuICCV 2015[paper] [website]
Viewpoint Integration for Hand-Based Recognition of Social Interactions from a First-Person ViewSven Bambach, David Crandall, Chen YuICMI 2015[paper]
A Framework for Reliable Text-Based Indexing of VideoRangachar Kasturi, Sameer Antani, David CrandallSymposium on Document Image Understanding Technology 2001[paper]
Evaluation of Methods for Detection and Localization of Text from VideoSameer Antani, David Crandall, Anand Narasimamurthy, Vladimir Y. Mariano, Rangachar KasturiIAPR Workshop on Document Analysis Systems 2000[paper]