Hacker News
new
past
comments
ask
show
jobs
points
by
ashirviskas
14 hours ago
|
comments
by
cookiengineer
12 hours ago
|
[-]
Maybe read up on how transformers, their encoders and decoders, and the attention matrix works?
https://arxiv.org/abs/1706.03762
reply