Understanding Neural Network Architectures with Attention and Diffusion — Michal Karzynski