August
Papers
- A Mathematical Framework for Transformer Circuits
- PMET: Precise Model Editing in a Transformer
- Code Llama: Open Foundation Models for Code
Links
- https://github.com/hackclub/putting-the-you-in-cpu A technical explainer of how your computer runs programs, from start to finish.
- https://bioconductor.org/books/3.13/OSCA.basic/index.html Basics of Single-Cell Analysis with Bioconductor
- https://www.brainpost.co/ Weekly summaries of the latest neuroscience publications
- https://www.lesswrong.com/posts/b5HNYh9ne5vEkX5ag/one-layer-transformers-aren-t-equivalent-to-a-set-of-skip One-layer transformers aren’t equivalent to a set of skip-trigrams
- https://distill.pub/2020/circuits/ Thread: Circuits
- https://pair.withgoogle.com/explorables/grokking/ Do Machine Learning Models Memorize or Generalize?
- https://www.smashthestack.org/main.html Smash the Stack