- See Exploration Logs
File
- Demo and Writeup for the AI of a Tactical Top Down Shooter
- Final Fantasy VII - Game AI Writeup
- Predictive Aiming
- Introversion Games - Subversion
Paper Queue
-
1 provides a theoretical and empirical analysis of the use of Centralized Critics in CTDE.
- 2 introduces a new mutual information framework for MARL. This leads to the development of an algorithm called Variational Maximum Mutual Information, Multi-Agent Actor Critic which allows agents to coordinate simultaneous actions without latency.
-
Branching Reinforcement Learning by Du, and Chen (Jun 15, 2022)
-
Vinyals et al. (2019) Grandmaster level in StarCraft II using multi-agent reinforcement learning
- Linked in Self Play
-
Wu et al. (2017) Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
- Linked in Trust Region Policies
-
Ecoclimates — Climate-Response Modeling of Vegetation by Palubicki et al. (2022)
-
Ma et al. (2024) Foundation Methods for Music — A survey
-
All these papers from Large Language Model
- TransferTransfo — A Transfer Learning Approach for Neural Network based Conversational Agents by Wolf, Sanh, Chaumond, and Delangue (Feb 4, 2019)
- ⭐ BERT — Pre-Training of Deep Bidirectional Transformer for Language Understanding by Devlin, Chang, Lee, and Toutanova (May 24, 2019)
- Towards a Human-like Open-Domain Chatbot by Adiwardana et. al (Feb 27, 2020)
- ⭐ Language Models are Few-Shot Learners by Brown et. al, (Jul. 22, 2020)
- Dense Passage Retrieval for Open-Domain Question Answering by Karpukhin et. al (Sep 30, 2020)
- TOD-BERT — Pre-trained Natural Language Understanding for Task-Oriented Dialogue by Wu, Hoi, Socher, and Xiong (November 2020)
- ⭐Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks by Lewis et. al., (2020)
- ⭐ LaMDA- Language Models for Dialog Applications by Thoppilan et. al (Feb 10, 2022)
- Language-Agnostic BERT Sentence Embedding by Feng et. al (Mar 8, 2022)
- ⭐ Training Compute-Optimal Large Language Models by Hoffmann et. al (Mar 29, 2022)
- Generating Training Data with Language Models- Towards Zero-Shot Language Understanding by Meng, Huang, Zhang, Han (Oct 12, 2022)
- ⭐ LLaMA- Open and Efficient Foundation Language Models by Touvron et. al (Feb 27, 2023)
- ⭐ OpenAGI—When LLM Meets Domain Experts by Ge et. al (Apr 12, 2023)
-
All these papers from Prompt Engineering
- Commonsense Knowledge Mining from Pretrained Models by Feldman, Davison and Rush (2019)
- ⭐ Prefix Tuning — Optimizing Continuous Prompts for Generation by Li and Liang (Jan 1, 2021)
- GPT Understands Too by Liu et. al (Mar 18, 2021)
- Calibrate Before Use — Improving Few-Shot Performance of Language Models by Zhao et. al (Jun 10, 2021)
- ⭐Pre-train Prompt and Predict- A systematic survey of prompting methods in Natural Language Processing by Liu et. al (Jul 28, 2021) - A survey of different prompting techniques.
- KnowPrompt — Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction by Zhang et. al (Jan 23, 2022)
- P-Tuning v2 - Prompt Tuning can be comparable to Fine-tuning Universally Across Scales and Tasks by Liu et. al (Mar 20, 2022)
- ⭐ Chain-Of-Thought Prompting Elicits Reasoning in Large Language Models by Wei et. al (Jan 10, 2023)
- Complexity-Based Prompting for Multi-Step Reasoning by Fu et. al (Jan 30, 2023)
Backlogs
-
Note, some entries in Trivia are also interesting.
-
Forms of Government
-
- Algorithmic Information Theory
-
Japanese Mythology
-
Graph Neural Network - GNNs
-
Combinatorial Optimization
-
Ballistics
-
Extreme Performance Artists. Prompted by this
-
General Method of Moments / Simulated Method of Moments
-
Chess Openings
-
Neuro Evolution of Augmented Topologies and similar algorithms
-
Lanczos Algorithm (cited in Lanczos Networks)
-
Some interesting things to explore further - PSLQ, Sinkhorn Limits, Kruithoff Limits
-
Bayesian Policy Reuse.
-
Theory of Mind
-
Vexillology and Heraldry
-
Sewing / Tailoring / Cosplay
-
Bergman Divergence and in general Information Geometry
-
Cattell’s 16 primary personality factors
-
Fuzzy Computation and Fuzzy Logic
Bookstops
- Rigid Body Simulation - Nonpenetration constraints
- Graph Theoretic Approaches for Swarms - Resume Ch. 4
- Code Complete by McConnell - Resume Ch. 10
- Philosophy - read through A New History of Western Philosophy By Anthony Kenny
- Factory Physics - Workforce Planning.
- Virtues and their Vices by Kevin and Craig -Cardinal Virtues, Intellectual Virtues, Theological Virtues
- Linear and Nonlinear Programming by Luenberger and Ye - Resume Ch. 5
- Drawing on the Right Side of the Brain by Edwards - Resume Ch. 8
- Ordinary Differential Equations by Arnold - Ch. 6 (but restart all of Part 1) with better math background
Future Readings
-
Alex Sludd - includes Silicon Photonics, running Neural Networks through light meshes.
-
Better Videogame Characters by Design by Katherine Isbister
-
Christopher Alexander principles of architecture. To explore further. In particular
- A Pattern Language
- The Nature of Order books 1 -4
-
The Advantage by Patrick Lencioni
-
One Page Design by Stone Librande
-
Unmasking the Social Engineer: The Human Element of Security
-
It’s Not All About “Me”: The Top Ten Techniques for Building Quick Rapport with Anyone
-
https://www.amazon.com/Business-Dynamics-Systems-Thinking-Modeling/dp/007238915X
-
GPU Gems 1, 2, 3 - Notes on Realtime Computer Graphics
-
Lord of Mysteries - https://lordofthemysteries.fandom.com/wiki/Demoness_Pathway. Interesting from a Worldbuilding Perspective
-
From The Depths - specifically focusing on digging through the design and interaction between systems.
Notemarks
- Geometric Deep Learning - stopped at the intro portion of the paper. Need to review Group Theory first.
- Forecasting — specifically classical techniques for Time Series Analysis.
- Abnormal Psychology. Specifically Mental Illnesses.
- Linear Models
Utilities
-
Beall’s List - provides a list of predatory journals and publishers.
-
Library Genesis - site for searching millions of books
-
Anna’s Archive - site for searching millions of books
-
phind - a search engine that makes use of an LLM under the hood.
-
Pi.ai - an online personal assistant chatbot as an alternative to ChatGPT.
-
[Chat Paper](https://chatpaper.com/) - AI powered tool for reviewing research papers
General Knowledge Repositories
-
Very Short Introductions - A book series.
-
The Stacks Project - an open source resource for Algebraic Geometry.
-
Nlab - wiki on mathematics, physics, and philosophy (from a pure math perspective)
-
IQuiLezles - math, art and computer graphics
-
Math3ma - math
-
For h in hexes - tabletop
-
HackerFactor - Security
What to Learn?
- Universities provide curricula that can aid in the self learning process. The autodidact can either follow these curricula, figure out prerequisites needed for a field of study or simply explore what is out there
Curios
Footnotes
-
Lyu et al. (2023) On Centralized Critics in Multi-Agent Reinforcement Learning ↩
-
Kim, Jung, Cho, Sung (2020) A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning ↩