đ About Me
My research interests lie in agents based on large models (LLMs/MLMs). Specifically, I am interested in:
- The integration of LMs with APIs (tools).
- Retrieval augmented generation.
- Multi-Agent Scaling.
Recently, I aim to explore the integration of LLM inference scaling with Tools.
My research group primarily focuses on system software, including machine learning systems, web systems, software engineering, and etc. However, during my second year of PhD studies, I discovered a greater interest in AI, so my current focus is primarily on AI :-).
đ Publications
- Z. Tao, âŠ,H. Shen, et al. âWebShaper: Agentically Data Synthesizing via Information-Seeking Formalizationâ, arXiv (arXivâ25).
- Z. Shao*, H. Shen*, et al. âGrounding AI Explanations in Experience: A Reflective Cognitive Architecture for Clinical Decision Supportâ, arXiv (arXivâ25).
- H. Shen*, H. Yan*, et al. âRAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimizationâ, arXiv (arXivâ25).
- T. Guo*, H. Shen*, et al. âMASS: Multi-Agent Simulation Scaling for Portfolio Constructionâ, arXiv, (arXivâ25).
- Q. Yang, W. B, H.Shen et al. âPixelWeb: The First Web GUI Dataset with Pixel-Wise Labelsâ, arXiv (arXivâ25).
- Z. Chen, Y. Ma, H.Shen et al. âWeInfer: Unleashing the Power of WebGPU on LLM Inference in Web Browsersâ, the Web Conference (WWWâ25).
- H. Shen, Y. Li et al. âShortcutsbench: A large-scale real-world benchmark for API-based agentsâ the International Conference on Learning Representations (ICLRâ25).
- M. Liu, H.Shen et al. âWebAssembly for Container Runtime: Are We There Yet?â Transactions on Software Engineering Methodology (TOSEMâ24).
-
H. Shen, Y. Ma et al. âAdpal: Automatic detection of troubled users in online service systems via page access logsâ, IEEE International Conference on Web Services (ICWSâ23).
đ Experience
- Intern, with Alibab Tongyi DeepResearch Group (2025.06 ~ now)
- WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization.
- Tongyi DeepResearch https://github.com/Alibaba-nlp/DeepResearch, which has gained over 10k Starâ and was trending on GitHub.
- TBD.
- Intern, with Miracleplus \& with Hang Yan in Shanghai AI Lab. (2024.11 - 2025.05)
- Outstanding Research Award, Peking University (2022-2023)
-
Ph.D. Candidate in Computer Science and Technology, School of Computer Science at Peking University (2022.09 - now)
- Ph.D. comprehensive exam: LLM, MLM, and LLM-based agent
-
Researcher, Alibaba Innovative Researcher in Technical & Quality at Fliggy, Alibaba (2022.08 - 2022.10)
- Research topic: Anomaly detection using page access logs
- National Scholarship, First Class Scholarship, and WU Yajun Scholarship, Northwestern Polytechnical University (2019 ~ 2021)
- B.Sc. in Computer Science and Technology, School of Computer Science at Northwestern Polytechnical University (2018.09 - 2022.07)