研究生: 登蘇馬
Raden Hadapiningsyah Kusumoseniarto
論文名稱: 用於動作識別的兩流3D卷積注意力網絡
Two-Stream 3D Convolution Attentional Networkfor Action Recognition
指導教授: 項天瑞
Tien-Ruey Hsiang
口試委員: 鄧惟中
Wei-Chung Teng
Hsing-Kuo Pao
學位類別: 碩士
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2020
畢業學年度: 108
語文別: 英文
論文頁數: 39
中文關鍵詞: 3D convolutionattention moduleaction recognition
外文關鍵詞: 3D convolution, attention module, action recognition
We propose a new method, which uses a two-stream 3D convolution network to capture rich spatial and temporal information, then process it with an attention module to capture long- and short-term dependency, to recognize action on the videos. By taking advantages of 3D convolutions, not only spatial information is obtained, but the movement information on the videos is also captured as temporal information. The main reason to consider long-term temporal dependency information is that it will be important to identify action on the videos. The bidirectional self-attention network uses forward/backward masks to encode temporal order information, and attention to handle our sequence on 3D convolution features. The experimental results indicate that the proposed method can be compared to state-of-the-art work in the HMDB-51 dataset with a less complex process while maintaining the performance.

