Generalization to Video Capsules - From Convolutional to Video Capsule Networks
University of Central Florida via YouTube
Overview
Syllabus
Intro
Computational Cost of Capsule Voting
Conventional Convolutional Layers
Convolutional Capsule Layers
Capsule Pooling
Video Capsule Networks
Video Action Detection Networks
VideoCapsuleNet Architecture
Coordinate Addition
Capsule Masking
VideoCapsuleNet Training
Action Localization Accuracy
Qualitative Results - Entire Videos
Synthetic Dataset Experiments
Summary
Capsules in multiple modalities
Combining Video and Text
Overall Approach
Multi-modal Capsule Routing Algorithm
Full Architecture
Sentence Encoder
Merging Modalities and Masking
Upsampling Network
Quantitative Results - A2D Dataset
Semi-Supervised Video Object Segmentation
VOS using Capsules
Attention Routing
Video Encoder
Frame Encoder with Memory Module
Conv Capsule Layer and Decoder Network
Objective Function
Quantitative Results -Speed Analysis
Qualitative Results - Single Object
Qualitative Results - Multiple Objects
Effect of Memory Module
Effect of the Zooming Module
Effect of Zooming Module
Taught by
UCF CRCV