Publications

publications by categories in reversed chronological order. * means equal contribution.

2026

  1. Tencent HY
    Stabilizing RLVR via Token-level Gradient Diagnosis and Layerwise Clipping
    Guanhua Huang*, Tingqiang Xu*, Jinbo Wang*, Guangming Sheng, Siheng Li, Evander Yang, Kejiao Li, Yunxiang Li, Zenan Xu, Qi Yi, Kyrierl Deng, Ziyuan Nan, Yuhao Jiang, Chenchen Zhang, Taiqiang Wu, Feiyuan Zhang, Junhao Wang, Bo Zhou, Alex Chen, Di Wang, and Shunyu Yao
    Tencent HunYuan Research Blog, 2026
  2. ICLR
    Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws
    Jinbo Wang*, Binghui Li*, Zhanpeng Zhou, Mingze Wang, Yuxuan Sun, Jiaqi Zhang, Xunliang Cai, and Lei Wu
    In International Conference on Learning Representations (ICLR), 2026
  3. ICLR Workshop
    Scaling-Law Analysis of SignSGD: From Feature-Space Linear Regression to LLM Pre-training
    Binghui Li*, Jianan Wang*, Jinbo Wang*, Lean Wang*, Zilin Wang*, and Lei Wu
    In ICLR 2026 Workshop on Scientific Methods for Understanding Deep Learning (Sci4DL), In submission (Alphabetical order), 2026
  4. ACL Findings
    SWE-Mutation: Can LLMs Generate Reliable Test Suites in Software Engineering?
    Yuxuan Sun, Yuze Zhao, Yufeng Wang, Yao Du, Zhiyuan Ma, Jinbo Wang, Mengdi Zhang, Kai Zhang, and Zhenya Huang
    In Findings of the Association for Computational Linguistics: ACL 2026, 2026
  5. ICML
    GradPower: Powering Gradients for Faster Language Model Pre-Training
    Jinbo Wang*, Mingze Wang*, Jiaqi Zhang, Wei Wang, Peng Pei, Xunliang Cai, Weinan E, and Lei Wu
    In International Conference on Machine Learning (ICML), 2026

2025

  1. Preparation
    How Does Local Landscape Geometry Evolve in Language Model Pre-Training?
    Zhanpeng Zhou, Yuhan Sun, Bingrui Li, Jinbo Wang, Huaijin Wu, Lei Wu, and Junchi Yan
    Preparation, In submission, 2025
  2. ICML
    The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
    In International Conference on Machine Learning (ICML), 2025

2024

  1. NeurIPS
    Improving Generalization and Convergence by Enhancing Implicit Regularization
    Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, and Lei Wu
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  2. JML
    Memory³: Language Modeling with Explicit Memory
    Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, and Weinan E
    Journal of Machine Learning, 2024

2023

  1. ICONIP
    Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study
    Zeping Min and Jinbo Wang
    In International Conference on Neural Information Processing (ICONIP), 2023