[SIST Seminar] Routing and Scheduling in Optical Data Center Networks for Emerging Cloud Applications

ON2024-02-23TAG: ShanghaiTech UniversityCATEGORY: Lecture

Topic: Routing and Scheduling in Optical Data Center Networks for Emerging Cloud Applications

Speaker: Dr. LI Jialong, Max Planck Institute for Informatics (MPI-INF)

Date and time: 10:00–11:00, February 28

Venue: Room 1A-200, SIST

Host: YANG Zhice


Abstract:

Optical data center networks show promise to serve as the next-generation cloud infrastructure with their cost and power benefits. As circuit-switched networks, optical data center networks set up dedicated optical circuits between endpoints before they can exchange data. On one hand, this property makes optical data center networks a good fit for bulk data transfers in critical cloud applications, such as machine learning and parallel computing. On the other hand, this nature deviates from traditional packet-switched networks and brings unique challenges for routing and task scheduling. In this talk, we present our solutions to routing and tasking scheduling in optical data center networks. We first introduce Hop-On Hop-Off (HOHO) routing that leverages programmable switches to accelerate flow transmission by up to 35%, and then give an overview to Network-Aware GPU Sharing (NAGS) that allocates distributed training jobs on GPUs to minimize training time and maximize GPU utilization.


Biography:

LI Jialong is a postdoctoral researcher at Max Planck Institute for Informatics (MPI-INF). Before joining MPI-INF, he received his BE and PhD degrees in Electronic Engineering from Tsinghua University in 2016 and 2021, respectively. His research interests include optical networks, optical data center networks, and network-accelerated machine learning systems. He has published more than ten papers in journals and conferences such as JOCN, IEEE/ACM Transactions on Networking, APNet, OFC, and served as a reviewer for journals including JOCN, IEEE/ACM Transactions on Networking, Mathematics.