Michael Zhang
mzhang [at] cs [dot] stanford [dot] edu
About
Posts
Projects
Home
Updated
on
February 28, 2024
(Based) Simple linear attention language models balance the recall-throughput tradeoff