Deep Q-learning

Some tricks used include:

Emacs 29.4 (Org mode 9.6.15)