Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Posts

Future Blog Post

less than 1 minute read

Published: January 01, 2199

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published: August 14, 2015

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

BRACE: A Benchmark for Robust Audio Caption Quality Evaluation

Published in NeurIPS 2025, 2025

Recommended citation: T Guo*, H Chen*, H Liang*, M Qiang, B Zeng, L Sun, B Cui, W Zhang. (2025). "BRACE: A Benchmark for Robust Audio Caption Quality Evaluation." NeurIPS 2025. (Project Leader)

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

Published in ACL 2025, 2025

Recommended citation: T Zhang, C Zhu, Y Shen, W Luo, Y Zhang, H Liang, F Yang, M Lin, Y Qiao, et al. (2025). "CFBench: A Comprehensive Constraints-Following Benchmark for LLMs." ACL 2025.

MathScape: Evaluating MLLMs in Multimodal Math Scenarios through a Hierarchical Benchmark

Published in ACM MM 2025, 2025

Recommended citation: H Liang*, L Sun*, M Zhou*, T Li, Z Wu, M Lin, L Sun, Y Zhou, Y Zhang, et al. (2025). "MathScape: Evaluating MLLMs in Multimodal Math Scenarios through a Hierarchical Benchmark." ACM MM 2025.

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

Published in ACL 2025, 2025

Recommended citation: L Sun*, H Liang*, J Wei, B Yu, T Li, F Yang, Z Zhou, W Zhang. (2025). "MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification." ACL 2025.

Facilitating Multi-Turn Function Calling for LLMs via Compositional Instruction Tuning

Published in ICLR 2025, 2025

Recommended citation: M Chen, H Sun, T Li, F Yang, H Liang, K Lu, B Cui, W Zhang, Z Zhou, et al. (2025). "Facilitating Multi-Turn Function Calling for LLMs via Compositional Instruction Tuning." ICLR 2025.

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

Published in ICDE 2025, 2025

Recommended citation: M Zheng*, H Liang*, F Yang, H Sun, T Li, L Xiong, Y Zhang, Y Wu, K Li, et al. (2025). "PAS: Data-Efficient Plug-and-Play Prompt Augmentation System." ICDE 2025.

QAEncoder: Towards Aligned Representation Learning in Question Answering Systems

Published in ACL 2025 (Oral), 2025

Recommended citation: Z Wang, Q Yu, S Wei, Z Li, F Xiong, X Wang, S Niu, H Liang, W Zhang. (2025). "QAEncoder: Towards Aligned Representation Learning in Question Answering Systems." ACL 2025 (Oral).

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

Published in ACM MM 2025, 2025

Recommended citation: Z Liu*, H Liang*, B Li, T Bai, W Xiong, C Chen, C He, W Zhang, B Cui. (2025). "SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models." ACM MM 2025.

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Published in NeurIPS 2025, 2025

Recommended citation: R An, S Yang, R Zhang, Z Shen, M Lu, G Dai, H Liang, Z Guo, S Yan, et al. (2025). "UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens." NeurIPS 2025.

Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge

Published in Technical Report (1st Place Winner of ICML SeePhys Challenge 2025), 2025

Recommended citation: H Liang, R Wu, B Zeng, J Niu, W Zhang, B Dong. (2025). "Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge."

Data Preparation for Large Language Models

Published in Journal of Computer Science and Technology (JCST), 2026

Recommended citation: H Liang, ZH Wong, R Liu, Y Wang, M Qiang, Z Zhao, C Shen, C He, et al. (2026). "Data Preparation for Large Language Models." Journal of Computer Science and Technology.

Learning What Reinforcement Learning Can’t: Interleaved Online Fine-Tuning for Hardest Questions

Published in ICLR 2026, 2026

Recommended citation: L Ma, H Liang, M Qiang, L Tang, X Ma, ZH Wong, J Niu, C Shen, R He, et al. (2026). "Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions." ICLR 2026.

Let’s Verify Math Questions Step by Step

Published in KDD 2026, 2026

Recommended citation: C Shen*, ZH Wong*, R He*, H Liang*, M Qiang, Z Meng, Z Zhao, B Zeng, et al. (2026). "Let's Verify Math Questions Step by Step." KDD 2026. (Project Leader)

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts

Published in WWW 2026, 2026

Recommended citation: H Liang*, Q Cai*, H Dong, M Qiang, R An, Z Han, Z Zhu, B Cui, W Zhang. (2026). "LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts." WWW 2026.

Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL

Published in ICDE 2026, 2026

Recommended citation: Q Cai*, H Liang*, C Xu*, T Xie, W Zhang, B Cui. (2026). "Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL." ICDE 2026.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Hao Liang

Sitemap

Pages

Posts

portfolio

publications

talks

teaching