Minimal offline RL scaffold featuring a DQN baseline on Peg Solitaire (7x7 and 4x4 variants), deterministic seeding, YAML config, JSONL metrics, and plotting utilities. Designed for reproducible ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results