1. Carreira, J., & Zisserman, A. (2017a). Quo vadis, action recognition? a new model and the kinetics dataset. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6299–6308).
2. Carreira, J., & Zisserman, A. (2017b). Quo vadis, action recognition? a new model and the kinetics dataset. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6299–6308).
3. Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks;Chattopadhay,2018
4. Diba, A., Fayyaz, M., Sharma, V., Hossein Karami, A., Mahdi Arzani, M., Yousefzadeh, R., et al. (2018). Temporal 3d convnets using temporal transition layer. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (pp. 1117–1121).
5. Feichtenhofer, C. (2020). X3d: Expanding architectures for efficient video recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 203–213).