ML [uts-002B]
✍️source

I don't want efforts in Transformers: from self-attention to performance optimizations to be discontinued, lately there is [ferrando2024primer] on this topic.

I might need to follow on the latest development on the linear attention mechanism [peng2024eagle].

I have almost no understanding of diffusion models, so I should read [bao2023all] and related papers.

I should also read [mikula2023magnushammer] and related papers.

ML [uts-002B]
✍️source

Applied mathematics [uts-0024]

Topology [uts-0025]

Knots [uts-0026]

Origami [uts-0027]

Dynamical Systems: Bifurcation Theory [uts-0028]

Sheaves [uts-0029]

Synthetic Differential Geometry [uts-002A]

ML [uts-002B]
✍️source

Consciousness [uts-002C]

opening thoughts [lm-0006]

reference. A primer on the inner workings of transformer-based language models [ferrando2024primer]

reference. Eagle and finch: RWKV with matrix-valued states and dynamic recurrence [peng2024eagle]

reference. All are worth words: A ViT backbone for diffusion models [bao2023all]

reference. Magnushammer: A transformer-based approach to premise selection [mikula2023magnushammer]