Poster
in
Workshop: Machine Learning with New Compute Paradigms
Inference analysis of optical transformers
Xianxin Guo · Chenchen Wang · Djamshid Damry
Abstract:
This paper explores the utilization of optical computing for accelerating inference in transformer models, which have demonstrated substantial success in various applications. Optical computing offers ultra-fast computation and ultra-high energy efficiency compared to conventional electronics. Our findings suggest that optical implementation has the potential to achieve a significant 10-100 times improvement in the inference throughput of compute-limited transformer models.
Chat is not available.