Selected Publications

[16] Chen, Z., Xiang, Z., Xiao, C., Song, D., & Li, B.. (2024). AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases, in Proceeding of the Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec 9. [arXiv preprint], [project page], [github repo].

[8] Chen, Z., Zhao, Z., Luo, H., Yao, H., Li, B., & Zhou, J. (2024). HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding, in Proceeding of the Forty-first International Conference on Machine Learning (ICML 2024), Vienna, Austria, July 2024. [arXiv preprint], [project page], [github repo].

[7] Chen, Z., Zhao, Z., Zhu, Z., Zhang, R., Li, X., Raj, B., & Yao, H. (2024). AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition, in Proceeding of 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Mexico City, Mexico, Jun 2024. [arXiv preprint]

[5] Chen, Z., Zhao, Z., He, T., Chen, B., Zhao, X., Gong, L., & Liu C. (2024). Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards, in Proceeding of 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024), Abu Dhabi ,UAE, October 2024. [arXiv preprint]. (Oral).

[15] Chen, Z., Du, Y., Wen, Z., Zhou, Y., Cui, C., Weng, Z., Tu, H., Wang, C., Tong, Z., Huang, Q., Chen, C., Ye, Q., Zhu, Z., Zhang, Y., Zhou, J., Zhao, Z., Rafailov, R., Finn, C., & Yao, H. (2024). MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?. arXiv preprint arXiv:2407.04842. [arXiv preprint], [project page], [github repo], [huggingface].

Publications

[14] Zhu, Z., Cheng, X., Chen, Z., Chen, Y., Zhang, Y., Wu, X., Zheng, Y., & Xing, B. (2024). InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing. In Proceeding of 2024 ACM Multimedia Conference (ACM MM 2024), Melbourne, Australia, Oct 2024. (Oral).

[13] Zhou, Y., Fan, Z., Cheng, D., Yang, S., Chen, Z., Cui, C., Wang, X., Li, Y., Zhang, L., & Yao, H. (2024). Calibrated Self-rewarding Vision Language Models, in Proceeding of the Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec 9. [arXiv preprint], [github repo].

[12] Zhu, Z., Cheng, X., Zhang, Y., Chen, Z., Long, Q., Li, H., Huang, Z., Wu, X., & Zheng, Y. (2024). Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation. In Proceeding of 2024 International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2024), Marrakesh, Morocco, Oct 2024.

[11] Zhang, Y., Zhao, Z., Chen, Z., Feng, Z., Ding, Z., & Sun, Y. (2024). RankCLIP: Ranking-Consistent Language-Image Pretraining. [arXiv preprint].

[10] Chen, Z., Zhao, Z., Qu, W., Wen, Z., Han, Z., Zhu, Z., Zhang, J., & Yao, H. (2024). PANDORA: Detailed LLM Jailbreaking via Collaborated Phishing Agents with Decomposed Reasoning. Short version presented at ICLR 2024 Workshop on Secure and Trustworthy Large Language Models (SeT LLM@ICLR2024). [paper link].

[9] Yang, X., Wen, Z., Qu, W., Chen, Z., Xiang, Z., Chen, B., & Yao, H. (2024). Memorization and Privacy Risks in Domain-Specific Large Language Models. In ICLR 2024 Workshop on Reliable and Responsible Foundation Models (R2-FM@ICLR2024).

[6] Hai, X., Liu, X., Chen, Z., Tan, Y., Zhang, H., Liu, G., Zhou, R., & Zhou, X. (2024). Ghost-in-Wave: How Speaker-Irrelative Features Interfere Deepfake Voice Detectors. In Proceeding of 2024 IEEE International Conference on Multimedia and Expo (ICME 2024), Niagra Falls, Canada, Jul 2024. (Oral).

[4] Xie, S., Gong, L., Chen, Z., & Chen, B. (2023, July). Simulation of Real-time Collision-Free Path Planning Method with Deep Policy Network in Human-Robot Interaction Scenario. In 2023 International Conference on Advanced Robotics and Mechatronics (ICARM) (pp. 360-365). IEEE. [paper link]

[3] Chen, Z., Chen, B., Xie, S., Gong, L., Liu, C., Zhang, Z., & Zhang, J. (2021, September). Efficiently training on-policy actor-critic networks in robotic deep reinforcement learning with demonstration-like sampled exploration. In 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT) (pp. 292-298). IEEE. [paper link] (Best Paper Award).

[2] Sun, T., Gong, L., Li, X., Xie, S., Chen, Z., Hu, Q., & Filliat, D. (2021). Robotdrlsim: A real time robot simulation platform for reinforcement learning and human interactive demonstration learning. In Journal of Physics: Conference Series (Vol. 1746, No. 1, p. 012035). IOP Publishing. [paper link]

[1] Zhao, L., Gong, L., Li, X., Yang, C., Chen, Z., Huang, Y., & Liu, C. (2019, July). A Bionic Arm Mechanism Design and Kinematic Analysis of the Humanoid Traffic Police. In 2019 IEEE 9th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER) (pp. 1606-1611). IEEE. [paper link]