This is my attempt at Trexquant 25 Hangman challenge. Trexquant reserves all rights to the question and related code. The rest is distributed freely under GPLv3. Any suggestions or improvements are welcome!
gymnasium_env: Custom HangmanEnv implementation of OpenAI's gymnasium envbible.py: Interface for reading the datasettest_agent_accuracy.py: Tests the agent's accuracy on randomly sampled training data plus new wordsrecurrent_maskable: Custom combined stablebaselines3 implementation of Recurrent and Maskable PPO by wdlctctext_files/: Custom training datasetsconfig.yaml: Main config file to perform simulationscolab_script.sh: bash script to automate colab runlocal_run.sh: Sample the word list to test the agent on different word lengths*.py: Different agent instances
- Clone the repository
pip install -r requirements.txtpip install -e ./pip install -e recurrent_maskable/- Configure
config.yamlwith appropriate parameters - Choose your favourite agent and run
python agent.py. For example, to run RecurrentMaskablePPO, executepython combined_ppo.py - Run
python test_agent_accuracy.pyto locally check the accuracy of the agent after training