Cheng Li
Home
Publications
Experience
Projects
Talks
Languages
Contact
Algorithms
TOPS
Leveraging NVIDIA’s Tensor Cores to express Collectives with matrix multiplication and exploring the benefits in terms of program simplicity, efficiency, and performance.
Cite
×