On page 503, the text “The weights are often visualized as attention maps, like in the following image:” does not have a diagram that follow after it. The next text describes details of the missing diagram. “There’s a column for each token in the input sequence …”
Hey @karakots, I’m sorry the image isn’t showing for you. Is this in the PDF version or the print edition? We’ll be sure to get that fixed in the next update.
In the meantime, here’s what it’s supposed to look like so you can follow along with the chapter. Thanks!