Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2 
publications
BRACE: A Benchmark for Robust Audio Caption Quality Evaluation
Published in NeurIPS 2025, 2025
Recommended citation: T Guo*, H Chen*, H Liang*, M Qiang, B Zeng, L Sun, B Cui, W Zhang. (2025). "BRACE: A Benchmark for Robust Audio Caption Quality Evaluation." NeurIPS 2025. (Project Leader)
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
Published in ACL 2025, 2025
Recommended citation: T Zhang, C Zhu, Y Shen, W Luo, Y Zhang, H Liang, F Yang, M Lin, Y Qiao, et al. (2025). "CFBench: A Comprehensive Constraints-Following Benchmark for LLMs." ACL 2025.
MathScape: Evaluating MLLMs in Multimodal Math Scenarios through a Hierarchical Benchmark
Published in ACM MM 2025, 2025
Recommended citation: H Liang*, L Sun*, M Zhou*, T Li, Z Wu, M Lin, L Sun, Y Zhou, Y Zhang, et al. (2025). "MathScape: Evaluating MLLMs in Multimodal Math Scenarios through a Hierarchical Benchmark." ACM MM 2025.
MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification
Published in ACL 2025, 2025
Recommended citation: L Sun*, H Liang*, J Wei, B Yu, T Li, F Yang, Z Zhou, W Zhang. (2025). "MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification." ACL 2025.
Facilitating Multi-Turn Function Calling for LLMs via Compositional Instruction Tuning
Published in ICLR 2025, 2025
Recommended citation: M Chen, H Sun, T Li, F Yang, H Liang, K Lu, B Cui, W Zhang, Z Zhou, et al. (2025). "Facilitating Multi-Turn Function Calling for LLMs via Compositional Instruction Tuning." ICLR 2025.
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Published in ICDE 2025, 2025
Recommended citation: M Zheng*, H Liang*, F Yang, H Sun, T Li, L Xiong, Y Zhang, Y Wu, K Li, et al. (2025). "PAS: Data-Efficient Plug-and-Play Prompt Augmentation System." ICDE 2025.
QAEncoder: Towards Aligned Representation Learning in Question Answering Systems
Published in ACL 2025 (Oral), 2025
Recommended citation: Z Wang, Q Yu, S Wei, Z Li, F Xiong, X Wang, S Niu, H Liang, W Zhang. (2025). "QAEncoder: Towards Aligned Representation Learning in Question Answering Systems." ACL 2025 (Oral).
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Published in ACM MM 2025, 2025
Recommended citation: Z Liu*, H Liang*, B Li, T Bai, W Xiong, C Chen, C He, W Zhang, B Cui. (2025). "SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models." ACM MM 2025.
UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens
Published in NeurIPS 2025, 2025
Recommended citation: R An, S Yang, R Zhang, Z Shen, M Lu, G Dai, H Liang, Z Guo, S Yan, et al. (2025). "UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens." NeurIPS 2025.
Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge
Published in Technical Report (1st Place Winner of ICML SeePhys Challenge 2025), 2025
Recommended citation: H Liang, R Wu, B Zeng, J Niu, W Zhang, B Dong. (2025). "Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge."
Data Preparation for Large Language Models
Published in Journal of Computer Science and Technology (JCST), 2026
Recommended citation: H Liang, ZH Wong, R Liu, Y Wang, M Qiang, Z Zhao, C Shen, C He, et al. (2026). "Data Preparation for Large Language Models." Journal of Computer Science and Technology.
Learning What Reinforcement Learning Can’t: Interleaved Online Fine-Tuning for Hardest Questions
Published in ICLR 2026, 2026
Recommended citation: L Ma, H Liang, M Qiang, L Tang, X Ma, ZH Wong, J Niu, C Shen, R He, et al. (2026). "Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions." ICLR 2026.
Let’s Verify Math Questions Step by Step
Published in KDD 2026, 2026
Recommended citation: C Shen*, ZH Wong*, R He*, H Liang*, M Qiang, Z Meng, Z Zhao, B Zeng, et al. (2026). "Let's Verify Math Questions Step by Step." KDD 2026. (Project Leader)
LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts
Published in WWW 2026, 2026
Recommended citation: H Liang*, Q Cai*, H Dong, M Qiang, R An, Z Han, Z Zhu, B Cui, W Zhang. (2026). "LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts." WWW 2026.
Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL
Published in ICDE 2026, 2026
Recommended citation: Q Cai*, H Liang*, C Xu*, T Xie, W Zhang, B Cui. (2026). "Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL." ICDE 2026.
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Teaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.
