See Exploration Logs

File

Paper Queue

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
¹ provides a theoretical and empirical analysis of the use of Centralized Critics in CTDE.

² introduces a new mutual information framework for MARL. This leads to the development of an algorithm called Variational Maximum Mutual Information, Multi-Agent Actor Critic which allows agents to coordinate simultaneous actions without latency.

Branching Reinforcement Learning by Du, and Chen (Jun 15, 2022)
Vinyals et al. (2019) Grandmaster level in StarCraft II using multi-agent reinforcement learning
- Linked in Self Play
Wu et al. (2017) Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
- Linked in Trust Region Policies
Ecoclimates — Climate-Response Modeling of Vegetation by Palubicki et al. (2022)
Ma et al. (2024) Foundation Methods for Music — A survey
Human-level play in the game of Diplomacy
All these papers from Large Language Model
- TransferTransfo — A Transfer Learning Approach for Neural Network based Conversational Agents by Wolf, Sanh, Chaumond, and Delangue (Feb 4, 2019)
- ⭐ BERT — Pre-Training of Deep Bidirectional Transformer for Language Understanding by Devlin, Chang, Lee, and Toutanova (May 24, 2019)
- Towards a Human-like Open-Domain Chatbot by Adiwardana et. al (Feb 27, 2020)
- Dense Passage Retrieval for Open-Domain Question Answering by Karpukhin et. al (Sep 30, 2020)
- TOD-BERT — Pre-trained Natural Language Understanding for Task-Oriented Dialogue by Wu, Hoi, Socher, and Xiong (November 2020)
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks by Lewis et. al., (2020)
- ⭐ LaMDA- Language Models for Dialog Applications by Thoppilan et. al (Feb 10, 2022)
- Language-Agnostic BERT Sentence Embedding by Feng et. al (Mar 8, 2022)
- ⭐ Training Compute-Optimal Large Language Models by Hoffmann et. al (Mar 29, 2022)
- Generating Training Data with Language Models- Towards Zero-Shot Language Understanding by Meng, Huang, Zhang, Han (Oct 12, 2022)
All these papers from Prompt Engineering
- Commonsense Knowledge Mining from Pretrained Models by Feldman, Davison and Rush (2019)
- ⭐ Prefix Tuning — Optimizing Continuous Prompts for Generation by Li and Liang (Jan 1, 2021)
- GPT Understands Too by Liu et. al (Mar 18, 2021)
- Calibrate Before Use — Improving Few-Shot Performance of Language Models by Zhao et. al (Jun 10, 2021)
- KnowPrompt — Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction by Zhang et. al (Jan 23, 2022)
- P-Tuning v2 - Prompt Tuning can be comparable to Fine-tuning Universally Across Scales and Tasks by Liu et. al (Mar 20, 2022)
- Complexity-Based Prompting for Multi-Step Reasoning by Fu et. al (Jan 30, 2023)

Backlogs

Note, some entries in Trivia are also interesting.
Forms of Government
Beam Search
Theory of Computation
- Algorithmic Information Theory
Japanese Mythology
Graph Neural Network - GNNs
Combinatorial Optimization
Ballistics
HEMA
Extreme Performance Artists. Prompted by this
Dramaturgy
Von Neumann Morgenstern Utility Theorem
Petri Nets
Hopfield Networks
General Method of Moments / Simulated Method of Moments
AlphaFold
Socionics
Chess Openings
Sound Design
Johnson-Lindenstrauss Lemma
Neuro Evolution of Augmented Topologies and similar algorithms
- Evolutionary Acquisition of Neural Topologies
- Compositional Pattern Producing Network
- NEAT Particles
Lanczos Algorithm (cited in Lanczos Networks)
Some interesting things to explore further - PSLQ, Sinkhorn Limits, Kruithoff Limits
- https://en.wikipedia.org/wiki/Iterative_proportional_fitting
Combinatorial Maps
Surface Nets
Halton Sequences
Bayesian Policy Reuse.
Theory of Mind
Birch and Swinnerton-Dyver Conjecture
Vexillology and Heraldry
Sewing / Tailoring / Cosplay
Bergman Divergence and in general Information Geometry
Haruhi Theorem and Superpermutations
Cattell’s 16 primary personality factors
Generative Flow Networks
JEPA
Fuzzy Computation and Fuzzy Logic
Mining Large Datasets
Gwern
https://math.libretexts.org/Bookshelves/Combinatorics_and_Discrete_Mathematics/Combinatorics_(Morris)

Bookstops

The Psychology of Money
Rigid Body Simulation - Nonpenetration constraints
Graph Theoretic Approaches for Swarms - Resume Ch. 4
Code Complete by McConnell - Resume Ch. 10
Philosophy - read through A New History of Western Philosophy By Anthony Kenny
Factory Physics - Workforce Planning.
Virtues and their Vices by Kevin and Craig -Cardinal Virtues, Intellectual Virtues, Theological Virtues
Linear and Nonlinear Programming by Luenberger and Ye - Resume Ch. 5
Drawing on the Right Side of the Brain by Edwards - Resume Ch. 8
Ordinary Differential Equations by Arnold - Ch. 6 (but restart all of Part 1) with better math background

Future Readings

Alex Sludd - includes Silicon Photonics, running Neural Networks through light meshes.
Better Videogame Characters by Design by Katherine Isbister
Christopher Alexander principles of architecture. To explore further. In particular
- A Pattern Language
- The Nature of Order books 1 -4
The Advantage by Patrick Lencioni
One Page Design by Stone Librande
Unmasking the Social Engineer: The Human Element of Security
It’s Not All About “Me”: The Top Ten Techniques for Building Quick Rapport with Anyone
https://www.amazon.com/Business-Dynamics-Systems-Thinking-Modeling/dp/007238915X
GPU Gems 1, 2, 3 - Notes on Realtime Computer Graphics
Lord of Mysteries - https://lordofthemysteries.fandom.com/wiki/Demoness_Pathway. Interesting from a Worldbuilding Perspective
From The Depths - specifically focusing on digging through the design and interaction between systems.

Notemarks

Geometric Deep Learning - stopped at the intro portion of the paper. Need to review Group Theory first.
Forecasting — specifically classical techniques for Time Series Analysis.
Abnormal Psychology. Specifically Mental Illnesses.
Linear Models

Utilities

Beall’s List - provides a list of predatory journals and publishers.
Library Genesis - site for searching millions of books
Anna’s Archive - site for searching millions of books
phind - a search engine that makes use of an LLM under the hood.
Pi.ai - an online personal assistant chatbot as an alternative to ChatGPT.
[Chat Paper](https://chatpaper.com/) - AI powered tool for reviewing research papers

General Knowledge Repositories

Seita’s Place
Very Short Introductions - A book series.
Kevin’s Habits
The Stacks Project - an open source resource for Algebraic Geometry.
Nlab - wiki on mathematics, physics, and philosophy (from a pure math perspective)
IQuiLezles - math, art and computer graphics
Math3ma - math
For h in hexes - tabletop
HackerFactor - Security

What to Learn?

Universities provide curricula that can aid in the self learning process. The autodidact can either follow these curricula, figure out prerequisites needed for a field of study or simply explore what is out there
Another great source Open Syllabus

Curios

The Library of Babel Website

Lyu et al. (2023) On Centralized Critics in Multi-Agent Reinforcement Learning ↩
Kim, Jung, Cho, Sung (2020) A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning ↩

Table of Contents

Graph View

Backlinks

The Library

The Antilibrary

File

Paper Queue

Backlogs

Bookstops

Future Readings

Notemarks

Utilities

General Knowledge Repositories

What to Learn?

Curios

Table of Contents

Graph View

Backlinks

The Antilibrary

File §

Paper Queue §

Backlogs §

Bookstops §

Future Readings §

Notemarks §

Utilities §

General Knowledge Repositories §

What to Learn? §

Curios §

Footnotes §

File

Paper Queue

Backlogs

Bookstops

Future Readings

Notemarks

Utilities

General Knowledge Repositories

What to Learn?

Curios

Footnotes