I am Zhaopeng Qiu (邱昭鹏), a Senior Deep Learning Solutions Architect in NVIDIA. Before joining NVIDIA, I worked as a Senior Researcher at Career Science Lab of BOSS Zhipin and Jarvis Lab of Tencent. I obtained Master in Computer Software and Theory from Peking University in 2018, under supervision of Prof. Yasha Wang. Before that, I obtained Bachelor in Software Engineering from Beihang University (BUAA) in 2014. My research works are focused on recommendation, NLP, medical data mining, and LLM post-training infrastructure.
🔥 News
- 2026.01: Our FP8-RL technical report is released, a practical FP8 inference stack for LLM reinforcement learning.
- 2025.05: 🎉 One tutorial is accepted by KDD 2025.
- 2025.01: 🎉 One paper is accepted by Frontiers of Computer Science.
- 2024.10: 🎉 Our LLM4Rec survey is accepted by World Wide Web journal.
- 2024.04: 🎉 One paper are accepted by JAMIA (IF=6.4).
- 2024.04: 🎉 One paper are accepted by TKDE.
- 2024.03: 🎉 One paper are accepted by TOIS.
- 2024.01: 🎉 Three papers are accepted by WWW 2024.
📝 Publications
* → Equal contribution; The intern under my supervision
KDD 2025Practical Guidance and Tutorial on Incentivizing Reasoning in LLMs using Distillation and Reinforcement Learning, Zhaopeng Qiu, Jingqi Zhang, Shuang Yu, Shuai Zhang, Junjie LaiFCSExploiting Large Language Model with Reinforcement Learning for Generative Job Recommendations, Zhi Zheng*, Zhaopeng Qiu*, Chen Zhu, Xiao Hu, Likang Wu, Yang Song, Hengshu Zhu, Hui XiongJAMIALarge language models leverage external knowledge to extend clinical insight beyond language boundaries Jiageng Wu, Xian Wu, Zhaopeng Qiu, Minghui Li, Shixu Lin, Yingying Zhang, Yefeng Zheng, Changzheng Yuan, Jie YangTKDEBilateral Multi-Behavior Modeling for Reciprocal Recommendation in Online Recruitment Zhi Zheng, Xiao Hu, Zhaopeng Qiu, Yuan Cheng, Shanshan Gao, Yang Song, Hengshu Zhu, Hui XiongTOISDistributional Fairness-aware Recommendation, Hao Yang, Xian Wu, Zhaopeng Qiu, Yefeng Zheng, Xu ChenWWWJA Survey on Large Language Models for Recommendation, Likang Wu*, Zhi Zheng*, Zhaopeng Qiu*, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong ChenWWW 2024Harnessing Large Language Models for Text-Rich Sequential Recommendation, Zhi Zheng, Wen Shuo Chao, Zhaopeng Qiu, Hengshu Zhu, Hui XiongWWW 2024Causally Debiased Time-aware Recommendation, Lei Wang, Chen Ma, Xian Wu, Zhaopeng Qiu, Yefeng Zheng, Xu ChenWWW 2024GraphLeak: Patient Record Leakage through Gradients with Knowledge Graph , Xi Zhang, Weifan Guan, Jiahao Lu, Zhaopeng Qiu, Jian Cheng, Xian Wu and Yefeng ZhengAAAI 2024A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction, Wenshuo Zhao, Zhaopeng Qiu, Likang Wu, Zhuoning Guo, Zhi Zheng, Hengshu Zhu, Hao LiuAAAI 2024Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations, Likang Wu, Zhaopeng Qiu, Zhi Zheng, Hengshu Zhu, Enhong ChenCIKM 2023REST: Drug-Drug Interaction Prediction via Reinforced Student-Teacher Curriculum Learning, Xinhang Li, Zhaopeng Qiu, Xiangyu Zhao, Yong Zhang, Chunxiao Xing, Xian WuSIGIR 2023Distributionally Robust Sequential Recommnedation, Rui Zhou, Xian Wu, Zhaopeng Qiu, Yefeng Zheng and Xu ChenTOISConditional Cross-Platform User Engagement Prediction, Xinhang Li, Zhaopeng Qiu, Jiacheng Jiang, Yong Zhang, Chunxiao Xing and Xian WuCOLING 2022DeltaNet: Conditional Medical Report Generation for COVID-19 Diagnosis, Xian Wu, Shuxin Yang, Zhaopeng Qiu, Shen Ge, Yangtian Yan, Xingwang Wu, Yefeng Zheng, S. Kevin Zhou and Li XiaoCIKM 2022Gromov-Wasserstein Guided Representation Learning for Cross-Domain Recommendation, Xinhang Li, Zhaopeng Qiu, Xiangyu Zhao, Zihao Wang, Yong Zhang, Chunxiao Xing and Xian WuKDD 2022DDR: Dialogue Based Doctor Recommendation for Online Medical Service, Zhi Zheng, Zhaopeng Qiu, Hui Xiong, Xian Wu, Tong Xu, Enhong Chen, Xiangyu ZhaoNAACL 2022(Findings) Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback, Yunfan Hu, Zhaopeng Qiu, Xian WuTKDDGraph Neural News Recommendation with User Existing and Potential Interest Modeling, Zhaopeng Qiu, Yunfan Hu, Xian WuWWW 2022CBR: Context Bias aware Recommendation for Debiasing User Modeling and Click Prediction, Zhi Zheng, Zhaopeng Qiu, Tong Xu, Xian Wu, Xiangyu Zhao, Enhong Chen, Hui XiongWWW 2022Conditional Generation Net for Medication Recommendation, Rui Wu, Zhaopeng Qiu, Jiacheng Jiang, Guilin Qi, Xian WuAAAI 2021U-BERT: Pre-training User Representations for Improved Recommendation, Zhaopeng Qiu, Xian Wu, Jingyue Gao, Wei FanCOLING 2020Automatic Distractor Generation for Multiple Choice Questions in Standard Tests, Zhaopeng Qiu, Xian Wu, Wei FanCIKM 2019Question Difficulty Prediction for Multiple Choice Problems in Medical Exams, Zhaopeng Qiu, Xian Wu, Wei FanTMC 2018HyTasker: Hybrid Task Allocation in Mobile Crowd Sensing, Jiangtao Wang, Feng Wang, Yasha Wang, Leye Wang, Zhaopeng Qiu, Daqing Zhang, Bin Guo, Q. Lv
Pre-Prints
- FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning, Zhaopeng Qiu, Shuang Yu, Jingqi Zhang, Shuai Zhang, Xue Huang, Jingyi Yang, Junjie Lai
🎖 Honors and Awards
- 2019.10 Second prize in ICDM 2019 Knowledge Graph Contest.
📖 Educations
- 2015.09 - 2018.07, Master, Peking University.
- 2010.09 - 2014.06, Bachelor, Beihang University (BUAA).
💻 Work Experience
- 2023.09 - present, NVIDIA, Senior Deep Learning Solutions Architect @ AI/ML specialist team.
- 2023.04 - 2023.08, BOSS Zhipin, Senior Researcher @ Career Science Lab.
- 2018.07 - 2023.04, Tencent, Senior Researcher @ Jarvis Lab.