Temporal Convolutional Networks for Action Segmentation and Detection
Citations
2,776 citations
1,061 citations
Cites methods from "Temporal Convolutional Networks for..."
...Motivated by the temporal convolutional network (TCN) [23], [24], [25], we propose a fully convolutional separation module that consists of stacked 1-D dilated convolutional blocks, as shown in Figure 1 B....
[...]
...a replacement for RNNs in various tasks [23], [24], [25]....
[...]
...This method is motivated by the success of temporal convolutional network (TCN) models [23], [24], [25], which allow parallel processing on consecutive...
[...]
647 citations
471 citations
Cites background or methods from "Temporal Convolutional Networks for..."
...A D-dimensional feature vector, whether it is a deep feature from a spatial CNN such as fc7 activation of AlexNet [19] or a set of kinematic features [20], is extracted per each video frame....
[...]
...In this section, we provide a brief overview of the structure of a TCN as provided in the original paper [20]....
[...]
...In this light, we propose Temporal Convolutional Neural Networks (TCN) [20] applied to 3D Human Action Recognition....
[...]
...Moreover, by model design based on temporal convolutions [20] and residual connections [12], we can begin to directly interpret what our model parameters and features represent....
[...]
385 citations
Additional excerpts
...Hand Tracking U-Net [29] 10x 1x Image Model-1 GoogLeNet [30] 100x 1x Image Model-2 ShuffleNet [27] 10x 2x Pose Estimation Mask-RCNN [28] 100x 4x Segmentation TCN [31] 1x 1....
[...]
References
23,486 citations
7,091 citations
"Temporal Convolutional Networks for..." refers background in this paper
...Large-scale Recognition: There has been substantial work on spatiotemporal models for large scale video classification and detection [31, 11, 12, 27, 33, 23, 19]....
[...]
5,566 citations
"Temporal Convolutional Networks for..." refers methods in this paper
...The Encoder-Decoder TCN is most similar to SegNet [2] whereas the Dilated TCN is most similar to the Multi-Scale Context model [38]....
[...]
4,876 citations
"Temporal Convolutional Networks for..." refers background in this paper
...Large-scale Recognition: There has been substantial work on spatiotemporal models for large scale video classification and detection [31, 11, 12, 27, 33, 23, 19]....
[...]
3,487 citations
"Temporal Convolutional Networks for..." refers methods in this paper
...Rohrbach et al. [26] used Dense Trajectories [37] and human pose features on the MPII Cooking dataset....
[...]
...[26] used Dense Trajectories [37] and human pose features on the MPII Cooking dataset....
[...]