Discussion about this post

User's avatar
Mike X Cohen, PhD's avatar

Least-squares *and* LLMs in the same post? Definitely something I want to read ☺️

Expand full comment
Neural Foundry's avatar

This series has been incredibly instructive. Aplying least squares to model GPT activations is such a clever approach to understanding LLM internals. The progression from theory to GPT2 analysis really demonstrates how foundational math translates into practcal ML applications. Can't wait to dive into the code on GitHub.

Expand full comment
3 more comments...

No posts

Ready for more?