l Distributed Computing Task Scheduling and Fault-tolerant for Machine Learning/Deep Learning Clusters, Cloud System, and Data Processing System l High Performance Computing Development and Optimization of HPC applications for Meteorology, Task Scheduling for HPC Systems |
长期从事云计算及高性能计算领域研究工作,包括云和数据中心等分布式系统以及新型分布式深度学习系统中的资源调度和实时容错,从事气候模拟等领域的高性能计算软件优化和研发。