LATTE: Low-Precision Approximate Attention with Head-wise Trainable Threshold for Efficient Transformer
Part Of
2024 IEEE 6th International Conference on AI Circuits and Systems, AICAS 2024 - Proceedings
Journal Volume
32
Start Page
208
End Page
212
ISBN (of the container)
979-835038363-8
Date Issued
2024-04-22
Author(s)
Event(s)
6th IEEE International Conference on AI Circuits and Systems, AICAS 2024
Publisher
IEEE
Type
conference paper