Cheng Li
Home
Publications
Experience
Projects
Talks
Languages
Contact
Abdul Dakkak
Latest
The Design and Implementation of a Scalable DL Benchmarking Platform
DLSpec: A Deep Learning Task Exchange Specification
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
MLModelScope: Evaluate and Introspect Cognitive Pipelines
Accelerating Reduction and Scan Using Tensor Core Units
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function as a Service Environments
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects
Accelerating Reduction Using Tensor Core Units
SCOPE: C3SR Systems Characterization and Benchmarking Framework
RAI: A Scalable Project Submission System for Parallel Programming Courses
Cite
×