Publications

Highlights

For a full list see below or go to Google Scholar.

Aeolus: A Building Block for Proactive Transport in Datacenters

As datacenter network bandwidth keeps growing, proactive transport becomes attractive, where bandwidth is proactively allocatedas “credits” to senders who then can send “scheduled packets” at a right rate to ensure high link utilization, low latency, and zero packet loss. We present Aeolus, a solution focusing on “pre-credit” packet transmission as a building block for proactive transports.

Shuihai Hu, Wei Bai, Gaoxiong Zeng, Zilong Wang, Baochen Qiao, Kai Chen, Kun Tan, Yi Wang

SIGCOMM 2020, PDF

AuTO: Scaling Deep Reinforcement Learning to Enable Datacenter-Scale Automatic Traffic Optimization

Traffic optimizations (TO, e.g. flow scheduling, load balancing) in datacenters are difficult online decision-making problems. Leveraging the long-tail distribution of datacenter traffic, we develop a two-level DRL system, AuTO, mimicking the Peripheral & Central Nervous Systems in animals, to solve the scalability problem.

Li Chen, Justinas Lingys, Kai Chen, Feng Liu

SIGCOMM 2018, PDF

PowerMan: An Out-of-Band Management Network for Datacenters using Power Line Communication

Management tasks in datacenters are usually executed in-band with the data plane applications, making them susceptible to faults and failures in the data plane. We design PowerMan, a novel datacenter management network that can be readily built into existing datacenter power systems

Li Chen, Jiacheng Xia, Bairen Yi, Kai Chen

NSDI 2018, PDF

Resilient Datacenter Load Balancing in the Wild

Production datacenters operate under various uncertainties such as traffic dynamics, topology asymmetry, and failures. Therefore, datacenter load balancing schemes must be resilient to these uncertainties. We introduce Hermes, a datacenter load balancer that is resilient to uncertainties.

Hong Zhang, Junxue Zhang, Wei Bai, Kai Chen, Mosharaf Chowdhury

SIGCOMM 2017, PDF

Enabling Wide-spread Communications on Optical Fabric with MegaSwitch

Existing wired optical interconnects face a challenge of supporting wide-spread communications in production clusters. We present MegaSwitch, a multi-fiber ring optical fabric that exploits space division multiplexing across multiple fibers to deliver rearrangeably non-blocking communications to 30+ racks and 6000+ servers.

Li Chen, Kai Chen, Joshua Zhu, Minlan Yu, George Porter, Chunming Qiao, Shan Zhong

NSDI 2017, PDF

 

Full List

AutoByte: Automatic Configuration for Optimal Communication Scheduling in DNN Training
Yiqing Ma, Hao Wang, Yiming Zhang, Kai Chen
INFOCOM 2022

Cutting Tail Latency in Commodity Datacenters with Cloudburst
Gaoxiong Zeng, Li Chen, Bairen Yi, Kai Chen
INFOCOM 2022

Addressing Network Bottlenecks with Divide-and-Shuffle Synchronization for Distributed DNN Training
Weiyan Wang, Cengguang Zhang, Liu Yang, Kai Chen, Kun Tan
INFOCOM 2022

Sphinx: Enabling Privacy-Preserving Online Learning over the Cloud
Han Tian, Chaoliang Zeng, Zhenghang Ren, Di Chai, Junxue Zhang, Kai Chen, Qiang Yang
S&P 2022

Improving Availability of Vertical Federated Learning
Zhenghang Ren, Liu Yang, Kai Chen
ACM Transactions on Intelligent Systems and Technology 2021

Efficient Federated Matrix Factorization against Inference Attacks
Di Chai, Leye Wang, Kai Chen, Qiang Yang
ACM Transactions on Intelligent Systems and Technology 2021

FlashPass: Proactive Congestion Control for Shallow-buffered WAN
Gaoxiong Zeng, Jianxin Qiu, Yifei Yuan, Hongqiang Liu, Kai Chen
ICNP 2021, PDF

HAFLO: GPU-Based Acceleration for Federated Logistic Regression
Xiaodian Cheng, Wanhang Lu, Xinyang Huang, Shuihai Hu and Kai Chen
FTL-IJCAI 2021, PDF

Aegis: A Trusted, Automatic and Accurate Verification Framework for Vertical Federated Learning
Cengguang Zhang, Junxue Zhang, Di Chai and Kai Chen
FTL-IJCAI 2021, PDF

Tiara: A Scalable and Efficient Hardware Acceleration Architecture for Stateful Layer-4 Load Balancing
Chaoliang Zeng, Layong Luo, Teng Zhang, Zilong Wang, Luyang Li, Wenchen Han, Nan Chen, Lebing Wan, Lichao Liu, Zhipeng Ding, Xiongfei Geng, Tao Feng, Feng Ning, Kai Chen, Chuanxiong Guo
NSDI 2022, PDF

Enabling Edge-Cloud Video Analytics for Robotic Applications
Yiding Wang, Weiyan Wang, Duowen Liu, Xin Jin, Junchen Jiang, Kai Chen
INFOCOM 2021, PDF

Secure efficient Federated KNN for Recommendation Systems
Zhaorong Liu, Leye Wang, Kai Chen
ICNC-FSKD 2020, PDF

Accelerating Intra-Party Communication in Vertical Federated Learning with RDMA
Duowen Liu
DistributedML 2020, PDF

Federated Recommendation Systems
Liu Yang, Ben Tan, Vincent W. Zheng, Kai Chen, Qiang Yang
Federated Learning, PDF

Exploring Clustering of Bandits for Online Recommendation System
Liu Yang, Bo Liu, Leyu Lin, Feng Xia, Kai Chen, Qiang Yang
RecSys 2020, PDF

Secure Federated Matrix Factorization
Di Chai, Leye Wang, Kai Chen, Qiang Yang
IEEE Intelligent Systems, PDF

FPGA-Based Hardware Accelerator of Homomorphic Encryption for Efficient Federated Learning
Zhaoxiong Yang, Shuihai Hu, Kai Chen
FL-IJCAI 2020, PDF

Aeolus: A Building Block for Proactive Transport in Datacenters
Shuihai Hu, Wei Bai, Gaoxiong Zeng, Zilong Wang, Baochen Qiao, Kai Chen, Kun Tan, Yi Wang
SIGCOMM 2020, PDF

RAT - Resilient Allreduce Tree for Distributed Machine Learning
Xinchen Wan, Hong Zhang, Hao Wang, Shuihai Hu, Junxue Zhang, Kai Chen
APNet 2020, PDF

One More Config is Enough: Saving (DC)TCP for High-speed Extremely Shallow-buffered Datacenters
Wei Bai, Shuihai Hu, Kai Chen, Kun Tan, Yongqiang Xiong
INFOCOM 2020, PDF

Enabling ECN for Datacenter Networks with RTT Variations
Junxue Zhang, Wei Bai, Kai Chen
CoNEXT 2019, PDF

Congestion Control for Cross-Datacenter Networks
Gaoxiong Zeng, Wei Bai, Ge Chen, Kai Chen, Dongsu Han, Yibo Zhu, Lei Cui
ICNP 2019, PDF

Rethinking Transport Layer Design for Distributed Machine Learning
Jiacheng Xia, Gaoxiong Zeng, Junxue Zhang, Weiyan Wang, Wei Bai, Junchen Jiang, Kai Chen
APNet 2019, PDF

Quantifying the Performance of Federated Transfer Learning
Qinghe Jing, Weiyan Wang, Junxue Zhang, Han Tian, Kai Chen
FL-IJCAI 2019, the 1st Intl. Workshop on Federated Learning for User Privacy & Data Confidentiality (Best Student Paper Award), PDF

Secure Federated Matrix Factorization
Di Chai, Leye Wang, Kai Chen, Qiang Yang
FL-IJCAI 2019, the 1st Intl. Workshop on Federated Learning for User Privacy & Data Confidentiality, PDF

Bridging the Edge-Cloud Barrier for Real-time Advanced Vision Analytics
Yiding Wang, Weiyan Wang, Junxue Zhang, Junchen Jiang, Kai Chen
HotCloud 2019, PDF

Tagger: Practical PFC Deadlock Prevention in Data Center Networks
Shuihai Hu, Yibo Zhu, Peng Cheng, Chuanxiong Guo, Kun Tan, Jitendra Padhye, Kai Chen
IEEE/ACM Transactions on Networking, 2019, PDF

Providing Bandwidth Guarantees, Work Conservation and Low Latency Simultaneously in the Cloud
Shuihai Hu, Wei Bai, Kai Chen, Chen Tian, Ying Zhang, Haitao Wu
IEEE Transactions on Cloud Computing, 2019, PDF

AuTO: Scaling Deep Reinforcement Learning to Enable Datacenter-Scale Automatic Traffic Optimization
Li Chen, Justinas Lingys, Kai Chen, Feng Liu
SIGCOMM 2018, PDF

Augmenting Proactive Congestion Control with Aeolus
Shuihai Hu, Wei Bai, Baochen Qiao, Kai Chen, Kun Tan
APNet 2018, PDF

Pas de deux: Shape the Circuits, and Shape the Apps too!
Hong Zhang, Kai Chen, Mosharaf Chowdhury
APNet 2018, PDF

BDS: A Centralized Near-Optimal Overlay Network for Inter-Datacenter Data Replication
Yuchao Zhang, Junchen Jiang, Ke Xu, Xiaohui Nie, Martin Reed, Haiyang Wang, Guang Yao, Miao Zhang, Kai Chen
EuroSys 2018, PDF

PowerMan: An Out-of-Band Management Network for Datacenters using Power Line Communication
Li Chen, Jiacheng Xia, Bairen Yi, Kai Chen
NSDI 2018, PDF

Enabling Work-conserving Bandwidth Guarantees for Multi-tenant Datacenters via Dynamic Tenant-Queue Binding
Zhuotao Liu, Kai Chen, Haitao Wu, Shuihai Hu, Yih-Chun Hu, Yi Wang, Gong Zhang
INFOCOM 2018, PDF

Tagger: Practical PFC Deadlock Prevention in Data Center Networks
Shuihai Hu, Yibo Zhu, Peng Cheng, Chuanxiong Guo, Kun Tan, Jitendra Padhye, Kai Chen
CoNEXT 2017, PDF

Resilient Datacenter Load Balancing in the Wild
Hong Zhang, Junxue Zhang, Wei Bai, Kai Chen, Mosharaf Chowdhury
SIGCOMM 2017, PDF

Combining ECN and RTT for Datacenter Transport
Gaoxiong Zeng, Wei Bai, Ge Chen, Kai Chen, Dongsu Han, Yibo Zhu
APNet 2017, PDF

Congestion Control for High-speed Extremely Shallowbuffered Datacenter Networks
Wei Bai, Kai Chen, Shuihai Hu, Kun Tan, Yongqiang Xiong
APNet 2017, PDF

Information-Agnostic Flow Scheduling for Commodity Data Centers
Wei Bai, Li Chen, Kai Chen, Dongsu Han, Chen Tian, Hao Wang
IEEE/ACM Transactions on Networking (ToN), 2017, PDF

Towards A Scalable, Fault-tolerant, High-performance Optical Data Center Architecture
Kai Chen, Xitao Wen, Xingyu Ma, Yan Chen, Yong Xia, Chengchen Hu, Qunfeng Dong, Yongqiang Liu
IEEE/ACM Transactions on Networking (ToN), 2017, PDF

Rate-Aware Flow Scheduling for Commodity Data Center Networks
Ziyang Li, Wei Bai, Kai Chen, Dongsu Han, Yiming Zhang, Dongsheng Li, Hongfang Yu
INFOCOM 2017, PDF

Enabling Wide-spread Communications on Optical Fabric with MegaSwitch
Li Chen, Kai Chen, Joshua Zhu, Minlan Yu, George Porter, Chunming Qiao, Shan Zhong
NSDI 2017, PDF

Enabling ECN over Generic Packet Scheduling
Wei Bai, Kai Chen, Li Chen, Changhoon Kim, Haitao Wu
CoNEXT 2016, PDF

Stream: Decentralized Opportunistic Inter-Coflows Scheduling for Datacenter Networks
Hengky Susanto, Hao Jin, Kai Chen
ICNP 2016, PDF

Guaranteeing Deadlines for Inter-Datacenter Transfers
Hong Zhang, Kai Chen, Wei Bai, Dongsu Han, Chen Tian, Hao Wang, Haibing Guan, Ming Zhang
IEEE/ACM Transactions on Networking (ToN), 2016, PDF

CODA: Toward Automatically Identifying and Scheduling Coflows in the Dark
Hong Zhang, Li Chen, Bairen Yi, Kai Chen, Mosharaf Chowdhury, Yanhui Geng
SIGCOMM 2016, PDF

Scheduling Mix-flows in Commodity Datacenters with Karuna
Li Chen, Kai Chen, Wei Bai, Mohammad Alizadeh
SIGCOMM 2016, PDF

Enabling ECN in Multi-Service Multi-Queue Data Centers
Wei Bai, Li Chen, Kai Chen, Haitao Wu
NSDI 2016, PDF

Providing Bandwidth Guarantees, Work Conservation and Low Latency Simultaneously in the Cloud
Shuihai Hu, Wei Bai, Kai Chen, Chen Tian, Ying Zhang, Haitao Wu
INFOCOM 2016, PDF

OPTAS: Decentralized Flow Monitoring and Scheduling for Tiny Tasks
Ziyang Li, Yiming Zhang, Dongsheng Li, Kai Chen, Yuxing Peng
INFOCOM 2016, PDF

Explicit Path Control in Commodity Data Centers: Design and Applications
Shuihai Hu, Kai Chen, Haitao Wu, Wei Bai, Chang Lan, Hao Wang, Hongze Zhao, Chuanxiong Guo
IEEE/ACM Transactions on Networking (ToN), 2015, PDF

Towards Comprehensive Traffic Forecasting in Cloud Computing: Design and Application
Yang Peng, Kai Chen, Guohui Wang, Wei Bai, Yangming Zhao, Hao Wang, Yanhui Geng, Zhiqiang Ma, Lin Gu
IEEE/ACM Transactions on Networking (ToN), 2015, PDF

FlowProphet: Generic and Accurate Traffic Prediction for Data-parallel Cluster Computing
Hao Wang, Li Chen, Kai Chen, Ziyang Li, Yiming Zhang, Haibin Guan, Zhengwei Qi, Dongsheng Li, Yanhui Geng
ICDCS 2015, PDF

Information-Agnostic Flow Scheduling for Commodity Data Centers
Wei Bai, Li Chen, Kai Chen, Dongsu Han, Chen Tian, Hao Wang
NSDI 2015, PDF

Explicit Path Control in Commodity Data Centers: Design and Applications
Shuihai Hu, Kai Chen, Haitao Wu, Wei Bai, Chang Lan, Hao Wang, Hongze Zhao, Chuanxiong Guo
NSDI 2015, PDF

Guaranteeing Deadlines for Inter-Datacenter Transfers
Hong Zhang, Kai Chen, Wei Bai, Dongsu Han, Chen Tian, Hao Wang, Haibing Guan, Ming Zhang
EuroSys 2015, PDF

RAPIER: Integrating Routing and Scheduling for Coflow-aware Data Center Networks
Yangming Zhao, Kai Chen, Wei Bai, Minlan Yu, Chen Tian, Yanhui Geng, Yiming Zhang, Dan Li, Sheng Wang
INFOCOM 2015, PDF

Joint VM Placement and Topology Optimization for Traffic Scalability in Dynamic Datacenter Networks
Yangming Zhao, Yifan Huang, Kai Chen, Minlan Yu, Sheng Wang, Dongsheng Li
Computer Networks, 2015, PDF

PIAS: Practical Information-Agnostic Flow Scheduling for Datacenter Networks
Wei Bai, Li Chen, Kai Chen, Dongsu Han, Chen Tian, Weicheng Sun
HotNets 2014, PDF

PAC: Taming TCP Incast Congestion Using Proactive ACK Control
Wei Bai, Kai Chen, Haitao Wu, Wuwei Lan, Yangming Zhao
ICNP 2014, PDF

BitBill: Scalable, Robust, Verifiable Peer-to-Peer Billing for Cloud Computing
Li Chen, Kai Chen
HotCloud 2014, PDF

HadoopWatch: A First Step Towards Comprehensive Traffic Forecasting in Cloud Computing
Yang Peng, Kai Chen, Guohui Wang, Wei Bai, Zhiqiang Ma, Lin Gu
INFOCOM 2014, PDF

Towards Minimal-Delay Deadline-Driven Data Center TCP
Li Chen, Shuihai Hu, Kai Chen, Haitao Wu
HotNets 2013, PDF

OSA: An Optical Switching Architecture for Data Center Networks with Unprecedented Flexibility
Kai Chen, Ankit Singla, Atul Singh, Kishore Ramachandran, Lei Xu, Yueping Zhang, Xitao Wen, Yan Chen
IEEE/ACM Transactions on Networking (ToN), 2013, PDF

OSA: An Optical Switching Architecture for Data Center Networks with Unprecedented Flexibility
Kai Chen, Ankit Singla, Atul Singh, Kishore Ramachandran, Lei Xu, Yueping Zhang, Xitao Wen, Yan Chen
NSDI 2012, PDF

DAC: Generic and Automatic Address Configuration for Data Center Networks
Kai Chen, Chuanxiong Guo, Haitao Wu, Jing Yuan, Zhenqian Feng, Yan Chen, Songwu Lu, Wenfei Wu
IEEE/ACM Transactions on Networking (ToN), 2012, PDF

Generic and Automatic Address Configuration for Data Center Networks
Kai Chen, Chuanxiong Guo, Haitao Wu, Jing Yuan, Zhenqian Feng, Yan Chen, Songwu Lu, Wenfei Wu
SIGCOMM 2010, PDF

Where the Sidewalk Ends: Extending the Internet AS Graph Using Traceroutes From P2P Users
Kai Chen, David Choffnes, Rahul Potharaju, Yan Chen, Fabian Bustamante, Dan Pei, Yao Zhao
CoNEXT 2009, PDF