February
Papers
Links
- https://www.zeta-alpha.com/post/a-guide-to-neurips-2023-7-research-areas-and-10-spotlight-papers-to-read A Guide to NeurIPS 2023 — 7 Research Areas and 10 Spotlight Papers to Read
- https://en.wikipedia.org/wiki/List_of_regions_in_the_human_brain List of regions in the human brain
- https://postgrest.org/en/stable/ PostgREST - A standalone web server that turns a PostgreSQL database directly into a RESTful API
- https://karpathy.medium.com/virtual-reality-still-not-quite-there-again-5f51f2b43867 Virtual Reality: still not quite there, again.
- https://karpathy.github.io/2015/11/14/ai/ Short Story on AI: A Cognitive Discontinuity.
- https://fourmilab.ch/ Fourmilab
- https://www.zellic.io/blog/mpc-from-scratch/ MPC From Scratch: Everyone Can Do it!
- https://github.com/jzhang38/LongMamba LongMamba
- https://latecomermag.com/article/a-holistic-view-of-the-cell/ The Cell Is Not a Computer
- https://scrollprize.org/grandprize Vesuvius Challenge 2023 Grand Prize awarded: we can read the first scroll!
- https://www.nature.com/articles/d41586-024-00327-x It’s time to admit that genes are not the blueprint for life
- https://github.com/rlabbe/Kalman-and-Bayesian-Filters-in-Python Kalman and Bayesian Filters in Python
- https://edwardlib.org/tutorials/mixture-density-network Mixture Density Networks
- https://brilliant.org/wiki/gaussian-mixture-model/ Gaussian Mixture Model
- https://medium.com/saarthi-ai/xlnet-the-permutation-language-model-b30f5b4e3c1e Understand how the XLNet outperforms BERT in Language Modelling
- https://openai.com/research/video-generation-models-as-world-simulators Video generation models as world simulators
- https://en.wikipedia.org/wiki/Hyperparameter_optimization Hyperparameter optimization
- https://en.wikipedia.org/wiki/Artificial_intelligence Artificial intelligence
- https://largeworldmodel.github.io/ World Model on Million-Length Video and Language with RingAttention
- https://wow.groq.com/wp-content/uploads/2024/02/GroqISCAPaper2022_ASoftwareDefinedTensorStreamingMultiprocessorForLargeScaleMachineLearning.pdf A Software-defined Tensor Streaming Multiprocessor for Large-scale Machine Learning
- https://nicholas.carlini.com/writing/2024/my-benchmark-for-large-language-models.html My benchmark for large language models
- https://jackcook.com/2024/02/23/mamba.html Mamba: The Easy Way
- https://srush.github.io/annotated-mamba/hard.html Mamba: The Hard Way
- https://www.isattentionallyouneed.com/ Is Attention All You Need?
- https://www.reedbeta.com/blog/programmers-intro-to-unicode/ A Programmer’s Introduction to Unicode
- https://www.beren.io/2023-02-04-Integer-tokenization-is-insane/ Integer tokenization is insane
- https://www.lesswrong.com/posts/aPeJE8bSo6rAFoLqg/solidgoldmagikarp-plus-prompt-generation SolidGoldMagikarp (plus, prompt generation)
- https://distill.pub/2017/feature-visualization/ Feature Visualization
- https://www.leap-labs.com/ Leap Labs
- https://medium.com/@csi12345678949/paper-review-rare-tokens-degenerate-all-tokens-improving-neural-text-generation-via-adaptive-f6b6d80644f9 Paper review: Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings
- https://transformer-circuits.pub/2024/feb-update/index.html Circuits Updates - February 2024
- https://arbital.com/p/cev/ Coherent extrapolated volition (alignment target)
- https://www.bhoite.com/sculptures/ Free-formed electronic circuit sculptures
- https://wtfhappenedin1971.com/ WTF Happened In 1971?