- Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning, NeurIPS, 2024. [PDF]
- Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer, ACM Multimedia, 2024. [PDF]
- Semi-Supervised Multimodal Emotion Recognition with Expression MAE, ACM MM, 2023. [PDF]
- Suppressing Mislabeled Data via Grouping and Self-Attention. ECCV, 2020. [PDF] [CODE]
- Suppressing Uncertainties for Large-Scale Facial Expression Recognition. CVPR, 2020. [PDF] [CODE]
- Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition. IEEE Transactions on Image Processing, 2020. [PDF] [CODE]
- Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition. IEEE Transactions on Image Processing, 2019. [PDF]
- DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction. ICCV, 2019. [PDF] [CODE]
- Residual Compensation Networks for Heterogeneous Face Recognition. AAAI, 2019. [PDF] [CODE]
- Frankenstein: Learning Deep Face Representations using Small Data. IEEE Transactions on Image Processing, 2018. [PDF]
- Multi-region two-stream R-CNN for action detection. ECCV, 2016. [PDF] [CODE]
- Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice. Computer Vision and Image Understanding (CVIU), 2016. [PDF]
- Action Recognition with Stacked Fisher Vectors. ECCV, 2014. [PDF]
Full list in Google Scholar
[Representive Publications]
- Facial Action Units as A Bridge of Joint Dataset Training for Facial Expression Recognition, IEEE Transactions on Multimedia, 2024. [PDF]
- Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning, NeurIPS, 2024. [PDF]
- Invisible Gas Detection: An RGB-Thermal Cross Attention Network and A New Benchmark, Computer Vision and Image Understanding (CVIU), 2024. [PDF]
- Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer, ACM Multimedia, 2024. [PDF]
- SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition, ACM Multimedia, 2024. [PDF]
- Graph Attentive Dual Ensemble learning for Unsupervised Domain Adaptation on point clouds, Pattern Recognition, 2024. [PDF]
- NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling, ICLR, 2024. [PDF]
- MIPS at SemEval-2024 Task 3: Conversational Emotion-Cause Pair Analysis with Multimodal LLM, NAACL, 2024. [PDF]
- A Challenge Dataset and Effective Models for Conversational Stance Detection, COLING, 2024. [PDF]
- Semi-Supervised Multimodal Emotion Recognition with Expression MAE, ACM MM, 2023. [PDF]
- Real-time UAV Localization and Tracking in Multi-Weather Conditions using Multispectral Image Analysis, IEEE International Conference on Real-time Computing and Robotics (RCAR), 2023. [PDF]
- Cascaded Vehicle Matching and Short-Term Spatial-Temporal Network for Smoky Vehicle Detection, Appl. Sci. 2023, 13(8), 4841. [PDF]
- DB-Net: Detecting Vehicle Smoke with Deep Block Networks, Appl. Sci. 2023, 13(8), 4941. [PDF]
- Rail Detection: An Efficient Row-based Network and A New Benchmark,2022.07,ACM Multimedia. [PDF]
- Video Frame Interpolation Based on Deformable Kernel Region. IJCAI, 2022. [PDF]
- AU-Guided Unsupervised Domain-Adaptive Facial Expression Recognition. Applied Sciences. 2022, 12(9), 4366. [PDF]
- An Efficient Training Approach for Very Large Scale Face Recognition. CVPR, 2022. [PDF]
- Unsupervised person re-identification with multi-label learning guided self-paced clustering. Pattern Recognition, 2022. (IF: 7.74) [PDF]
- A Comprehensive Study on Temporal Modeling for Online Action Detection. Complex and Intelligent System, 2021. (IF: 4.927) [PDF]
- Sequential Interactive Biased Network for Context-Aware Emotion Recognition. IJCB, 2021. [PDF]
- Detecting Human-Object Interaction via Fabricated Compositional Learning. CVPR, 2021. [PDF]
- Affordance Transfer Learning for Human-Object Interaction Exploration. CVPR, 2021. [PDF]
- TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation. Neurocomputing, 2021. [PDF]
- Learning Category Correlations for Multi-label Image Recognition with Graph Networks. Pattern Recognition Letter, 2020. [BibTeX][PDF]
- Finding Hard Faces with better Proposals and Classifier. Machine Vision Applications, 2020. [BibTeX][PDF]
- Product Image Recognition with Guidance Learning and Noisy Supervision. Computer Vision and Image Understanding (CVIU), 2020. [BibTeX][PDF]
- Cascade Multi-Head Attention Networks for Action Recognition. Computer Vision and Image Understanding (CVIU), 2020. [BibTeX][PDF]
- Suppressing Mislabeled Data via Grouping and Self-Attention. ECCV, 2020. [PDF] [CODE]
- Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition. ECCV, 2020. [PDF]
- Visual Compositional Learning for Human Object Interaction Detection. ECCV, 2020. [PDF] [CODE]
- Suppressing Uncertainties for Large-Scale Facial Expression Recognition. CVPR, 2020. [PDF] [CODE]
- Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition. IEEE Transactions on Image Processing, 2020. [PDF] [CODE]
- Learning Discriminative Representation for Facial Expression Recognition from Uncertainties. ICIP, 2020. [PDF]
- Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild. CVPRW, 2020. [BibTeX][PDF]
- Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition. International Conference on Multimodal Interaction (ICMI’19) [PDF]
- Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction. International Conference on Multimodal Interaction (ICMI’19) [PDF]
- Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression. International Conference on Multimodal Interaction (ICMI’19) [PDF]
- Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition. IEEE Transactions on Image Processing, 2019. [PDF]
- DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction. ICCV, 2019. [PDF] [CODE]
- Residual Compensation Networks for Heterogeneous Face Recognition. AAAI, 2019. [PDF] [CODE]
- AnoPCN: Video Anomaly Detection via Deep Predictive Coding Network. ACM MultiMedia, 2019. [PDF]
- Frame Attention Networks for Facial Expression Recognition in Videos. ICIP, 2019. [PDF] [CODE]
- Visual-Textual Sentiment Analysis in Product Reviews. ICIP, 2019. [PDF]
- Face Detection, Alignment Alignment, Quality Assessmentand Attribute Analysis with Multi-Task Hybrid Convolutional Neural Networks. ZTE COMMUNICATIONS, 2019. [PDF]
- Recurrent Metric Networks and Batch Multiple Hypothesis for Multi-Object Tracking. IEEE Access, 2019. [PDF]
- Deep Recurrent Multi-instance Learning with Spatio-temporal Features for Engagement Intensity Prediction. International Conference on Multimodal Interaction (ICMI’18) [PDF]
- Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues. International Conference on Multimodal Interaction (ICMI’18) [PDF]
- Frankenstein: Learning Deep Face Representations using Small Data. IEEE Transactions on Image Processing, 2018. [PDF]
- Multi-region two-stream R-CNN for action detection. ECCV, 2016. [PDF] [CODE]
- Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice. Computer Vision and Image Understanding (CVIU), 2016. [PDF] [CODE]
- Action Recognition with Stacked Fisher Vectors. ECCV, 2014. [PDF]
- Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics. ECCV, 2014. [PDF]
- Multi-View Super Vector for Action Recognition. CVPR, 2014. [PDF]
[2024]
[2023]
[2022]
[2021]
[2020]
BibTeX:
@article{li2020learning, title={Learning label correlations for multi-label image recognition with graph networks}, author={Li, Qing and Peng, Xiaojiang and Qiao, Yu and Peng, Qiang}, journal={Pattern Recognition Letters}, volume={138}, pages={378--384}, year={2020}, publisher={Elsevier} }
BibTeX:
@article{zeng2020finding, title={Finding hard faces with better proposals and classifier}, author={Zeng, Xiaoxing and Peng, Xiaojiang and Wang, Yali and Qiao, Yu}, journal={Machine Vision and Applications}, volume={31}, number={7}, pages={1--15}, year={2020}, publisher={Springer} }
BibTeX:
@article{li2020product, title={Product image recognition with guidance learning and noisy supervision}, author={Li, Qing and Peng, Xiaojiang and Cao, Liangliang and Du, Wenbin and Xing, Hao and Qiao, Yu and Peng, Qiang}, journal={Computer Vision and Image Understanding}, pages={102963}, year={2020}, publisher={Elsevier} }
BibTeX:
@article{wang2020cascade, title={Cascade multi-head attention networks for action recognition}, author={Wang, Jiaze and Peng, Xiaojiang and Qiao, Yu}, journal={Computer Vision and Image Understanding}, volume={192}, pages={102898}, year={2020}, publisher={Elsevier} }
BibTeX:
@INPROCEEDINGS{9150797, author={S. {Ji} and K. {Wang} and X. {Peng} and J. {Yang} and Z. {Zeng} and Y. {Qiao}}, booktitle={CVPRW}, title={Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild}, year={2020}, volume={}, number={}, pages={1657-1661}, doi={10.1109/CVPRW50498.2020.00215}}
[2019]
[2018 and before]