Publication MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu Thirty-eighth Annual Conference on Neural Information Processing System (NeurIPS 2024) | December 2024 spotlight Github Project
Publication Uncovering Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor Siran Liu, Chengxiang Qi, Ying Cao, Chao Yang, Weifang Hu, Xuanhua Shi, Fan Yang, Mao Yang SOSP | November 2024
Publication WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin 2024 Meeting of the Association for Computational Linguistics | October 2024
Publication Uncover Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor Ying Cao, Fan Yang, Mao Yang September 2024
Publication Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Zhenting Qi, Mingyuan Ma, Jiahang Xu, Li Lyna Zhang, Fan Yang, Mao Yang Arxiv | August 2024
Publication Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor Yiqi Liu, Yu Xue, Yu Cheng, Lingxiao Ma, Ziming Miao, Jilong Xue, Jian Huang SOSP 2024 | August 2024
Publication Uncovering Milestone Papers: A Network Diffusion and Game Theory Approach Wei Zhang, Juyang Cao, Manuel Sebastian Mariani, Zhen-Zhen Wang, Mingyang Zhou, Wei Chen, Hao Liao Journal of Informetrics | August 2024, Vol 18(3)
Publication LordNet: An efficient neural network for learning to solve parametric partial differential equations without simulated data Xinquan Huang, Wenlei Shi, Xiaotian Gao, Xinran wei, Jia Zhang, Jiang Bian, Mao Yang, Tie-Yan Liu Neural Networks | August 2024
Publication Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang, Shuming Ma, Ruiping Wang, Furu Wei July 2024
Publication Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond Xutong Liu, Siwei Wang, Jinhang Zuo, Han Zhong, Xuchuang Wang, Zhiyong Wang, Shuai Li, Mohammad Hajiesmaili, John C.S. Lui, Wei Chen Proceedings of the 41st International Conference on Machine Learning (ICML) | July 2024