News

Publications (see all)

. Machine Learning on Commodity Tiny Devices: Theory and Practice. Book, CRC Press, 2022.

PDF

. PASS: Patch Automatic Skip Scheme for Efficient Real-time Video Perception on Edge Devices. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI-23), will appear, 2022.

. Graph Knows Unknowns: Reformulate Zero-shot Learning as Sample-level Graph Recognition. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI-23), will appear, 2022.

. Hierarchical Channel-spatial Encoding for Communication-efficient Collaborative Learning. In Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS-22), 2022.

PDF Poster Slides

. A Comprehensive Inspection of the Straggler Problem. The Spotlight of IEEE Transactions on Computers (TC), 2021.

PDF

. Octo: INT8 Training with Loss-aware Compensation and Backward Quantization for Tiny On-device Learning. In Proceedings of the USENIX Annual Technical Conference (ATC-21), 2021.

PDF Code Slides Video

. On-device Learning Systems for Edge Intelligence: A Software and Hardware Synergy Perspective. IEEE Internet of Things Journal, 2021.

PDF

. Petrel: Heterogeneity-aware Distributed Deep Learning via Hybrid Synchronization. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020.

PDF Code

. Canary: Decentralized Distributed Deep Learning via Gradient Sketch and Partition in Multi-interface Networks. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020.

PDF Code

. Dual-view Attention Networks for Single Image Super-Resolution. In Proceedings of the 28th ACM International Conference on Multimedia (MM-20), 2020.

PDF

. Falcon: Addressing Stragglers in Heterogeneous Parameter Server via Multiple Parallelism. IEEE Transactions on Computers (TC), 2020.

PDF Code

. Petrel: Community-aware Synchronous Parallel for Heterogeneous Parameter Server. In Proceedings of the IEEE 40th International Conference on Distributed Computing Systems (ICDCS), poster, 2020.

PDF

. Fast Coflow Scheduling via Traffic Compression and Stage Pipelining in Datacenter Networks. IEEE Transactions on Computers (TC), 2019.

PDF Code

. Falcon: Towards Computation-Parallel Deep Learning in Heterogeneous Parameter Server. In Proceedings of the 39th IEEE International Conference on Distributed Computing Systems (ICDCS), 2019.

PDF

. Cluster Frameworks for Efficient Scheduling and Resource Allocation in Data Center Networks: A Survey. IEEE Communications Surveys & Tutorials, 2018.

PDF

. Swallow: Joint Online Scheduling and Coflow Compression in Datacenter Networks. The 32nd International Parallel & Distributed Processing Symposium (IPDPS), 2018.

PDF Code

. Promoting Security and Efficiency in D2D Underlay Communication: A Bargaining Game Approach. IEEE Global Communications Conference, incorporating the Global Internet Symposium (GLOBECOM), 2017.

PDF