| 
 Tao Sun (孙涛) |   | Associate   Professor(副研究员,硕导),College of Computer Science and Technology, National University of Defense Technology (国防科技大学计算机学院)
 Changsha, Hunan, China
 I am seeking self-motivated students with strong mathematical skills and/or programming expertise. If you are interested in optimization, please do not hesitate to contact me (没人呀,招人呐).
 E-mail:suntao.saltfish@outlook.com;  nudtsuntao@163.com(Previous)
 | 
 About meI am now an associate professor in a research group led by Prof.  Xinwang Liu. News Starting in September 2024, I will update the news section to collect the rejection experiences from my academic journey.  2025-06 One paper was rejected by ICCV.  2025-05 Many papers were rejected by ICML.  2025-03 One paper was rejected by PAMI.  2025-02 Two papers were rejected by CVPR.  2025-01 Good month. No paper was rejected.  2024-12 One paper was rejected by JMLR.  2024-11 One paper was rejected by NC.  2024-10 Two papers were rejected by AAAI in the first round.  2024-9 One paper was rejected by NeurIPS, and another was withdrawn before. As a wise person once said, "As long as the withdrawal is done quickly, it won't be rejected." (no reference)  EducationPh.D., Computational Mathematics, National University of Defense Technology, 12.2018 M.S., Computational Mathematics, National University of Defense Technology, 12.2014 B.S., Applied Mathematics,  National University of Defense Technology, 06.2012 ExperienceAssociate professor, National University of Defense Technology, 12.2022--Now Assistant professor, National University of Defense Technology, 03.2019--12.2022 ResearchMy research interests include:  
Machine Learning  Deep LearningOptimizationDistributed Learning Selected Conference Papers 
        L. Shen, A. Tang, Y. Luo,  T. Sun, H. Hu, X. Cao, , "Targeted Low-rank Refinement: Enhancing Sparse Language Model with Precision.", ICML,  2025. T. Sun*, Y. Huang, L. Shen, K. Xu, B. Wang, "Investigating the Role of Weight Decay in Enhancing Nonconvex SGD.", CVPR,  2025. X. Deng**, T. Sun*,  et al., "Stability and Generalization of Asynchronous SGD: Sharper Bounds Beyond Lipschitz and Smoothness.", NeurIPS,  2024. X. Pan, X. Li, J. Liu, T. Sun, K. Sun, L. Chen, Z. Qu, "Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-) Convex One to K-Level Stochastic Optimizations.", ICML,  2024. X. Deng**, T. Sun*,  et al., "Exploring the Inefficiency of Heavy Ball as Momentum Parameter Approaches 1.", IJCAI,  2024. T. Sun,  et al., "Momentum Ensures Convergence of SIGNSGD under Weaker Assumptions.", ICML,  2023.X. Deng**, T. Sun*, S. Li,  et al.,  "Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD.", AAAI,  2023.T. Sun,  et al.,  "Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks." Advances in Neural Information Processing Systems, 2022.T. Sun,  et al.,  "Adaptive Random Walk Gradient Descent for Decentralized Optimization." International Conference on Machine Learning, 2022.T. Sun,  et al.,  "Stability and Generalization of the Decentralized Stochastic Gradient Descent." Proceedings of the AAAI Conference on Artificial Intelligence 35, pp. 9756-9764 2021.T. Sun,  et al.,  "Heavy-ball Algorithms Always Escape Saddle Points". Proceedings of the International Joint Conference on Artificial Intelligence, pp.3520-3526, 2019. T. Sun, P. Yin, et al.,  "Non-ergodic Convergence Analysis of Heavy-ball Algorithms." Proceedings of the AAAI Conference on Artificial Intelligence 33, pp. 5033-5040, 2019.T. Sun, Y. Sun,  et al., "General Proximal Incremental Aggregated Gradient Algorithms: Better and Novel Results under General Scheme", Advances in Neural Information Processing Systems 32, 2019.T. Chen, G. Giannakis, T. Sun,  W. Yin, "LAG: Lazily Aggregated Gradient for Communication-Efficient Distributed Learning.", Advances in Neural Information Processing Systems 31, 2018.T. Sun, Y. Sun, W. Yin, "On Markov Chain Gradient Descent", Advances in Neural Information Processing Systems 31, 2018.T. Sun, R. Hannah, W. Yin, "Asynchronous Coordinate Descent under More Realistic Assumptions", Advances in Neural Information Processing Systems 30, 2017. Selected Journal Papers 
  Y. Lei,   T. Sun, M. Liu, "Minibatch and Local SGD: Algorithmic Stability and Linear Speedup in Generalization", Applied and Computational Harmonic Analysis,  2025.  P. Luo**, X. Deng**, Z. Wen**, T. Sun*, D. Li, "BHerd: Accelerating Federated Learning by Selecting Beneficial Herd of Local Gradients", IEEE Transactions on Computers ,  2025.  X. Deng**, L. Shen, S. Li,  T. Sun*, D. Li, D. Tao, "Towards understanding the generalizability of delayed stochastic gradient descent", IEEE Transactions on Pattern Analysis and Machine Intelligence ,  2025.   T. Sun, L. Shen, X. Liu, "On Nonconvex SGD under Unbounded Noise with Weak Gradient Lipschitz and Delayed Stochastic Gradient", IEEE Transactions on Pattern Analysis and Machine Intelligence ,  2025. S. Chen, X. Deng**, D. Xu*, T. Sun*,   et al., "Decentralized stochastic sharpness-aware minimization algorithm", Neural Networks Journal,  2024. T. Sun, Q. Wang, Y. Lei,  et al., "Pairwise Learning with Adaptive Online Gradient Descent", Transactions on Machine Learning Research,  2023. T. Sun,  et al., "On the Decentralized Stochastic Gradient Descent with Markov Chain Sampling", IEEE Transactions on Signal Processing ,  2023. T. Sun,  et al., "Decentralized Federated Averaging.", IEEE Transactions on Pattern Analysis and Machine Intelligence ,  2022. T. Sun,  et al., "General Nonconvex Total Variation and Low-Rank Regularizations: Model, Algorithm and Applications.", Pattern  Recognition Journal ,  2022. T. Sun,  et al., "Sign Stochastic Gradient Descents without Bounded Gradient Assumption for the Finite Sum Minimization.", Neural Networks Journal ,  2022.B. Wang#, T. M. Nguyen#, T. Sun#, A. L. Bertozzi, R. G. Baraniuk, S. J. Osher, "Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent.", SIAM J. Imaging Sciences ,  2021.T. Sun, H. Shen, T. Chen,  et al.,"Adaptive Temporal Difference Learning with Linear Function Approximation.", IEEE Transactions on Pattern Analysis and Machine Intelligence ,  2021.T. Sun, L. Qiao, Q. Liao,  et al., "Novel Convergence Results of Adaptive Stochastic Gradient Descent.", IEEE Transactions on Image Processing,  2020.T. Sun, L. Qiao,  et al., "Non-ergodic Complexity of Proximal Inertial Gradient Descents.", IEEE Transactions on Neural Networks and Learning Systems,  2020.T. Sun, K. Tang,  et al., "Gradient Descent Learning with Floats.", IEEE Transactions on Cybernetics,  2020.T. Sun,  et al., "Capri: Consensus Accelerated Proximal Reweighted Iteration for A Class of Nonconvex Minimizations.", IEEE Transactions on Knowledge and Data Engineering,  2020.T. Sun, Y. Sun, Y. Xu, W. Yin, "Markov Chain Block Coordinate Descent.",  Computational Optimization and Applications, pp. 35-61, 2020.T. Sun, R. Barrio, M. Rodriguez, H. Jiang, "Inertial Nonconvex Alternating Minimizations for the Image Deblurring.",  IEEE Transactions on Image Processing, pp. 6211-6224, 2019.T. Sun, P. Yin, H. Jiang, W. Zhu, "Iteratively Linearized Reweighted Alternating Direction Method of Multipliers for A Class of Nonconvex Problems.", IEEE Transactions on Signal Processing, pp.5380-5391, 2018.T. Sun, P. Yin, H. Jiang, L. Cheng, "Alternating Direction Method of Multipliers with Difference of Convex Functions.", Advances in Computational Mathematics, pp.723-744, 2018.T. Sun,  H. Jiang, L. Cheng, "Convergence of Proximal Iteratively Reweighted Nuclear Norm Algorithm for Image Processing.", IEEE Transactions on Image Processing , pp. 5632-5644, 2017.T. Sun,  H. Jiang, L. Cheng, "Global Convergence of Proximal Iteratively Reweighted Algorithm", Journal of Global Optimization, pp. 815-826, 2017. Note: *indicates the corresponding author, # denotes equal contributions,** denotes my student. Full list of publications. Academic serviceEditorial Board 
   Research, Science (Youth Editorial Board)Neural Networks, Elsevier Reviewer 
NeurIPS, ICML, ICLR, ECML, TMLR, AAAI, IJCAI, TPAMI Invited Talks OthersGrants (Chinese) Awards (Chinese) 
    CCF优博提名奖2020ACM中国新星奖(长沙分会,排名第一),2023国防科大青年科技创新奖2023湖南省优秀博士论文奖2021 |