Commit 206e5cbf authored by Hudson Yeo's avatar Hudson Yeo
Browse files

first commit

parent db92a2db
# DRL Risk
Repository for the individual projection: Distributional Reinforcement Learning and the meaning of uncertainty in the return distribution
\ No newline at end of file
Repository for the individual project: Distributional Reinforcement Learning and the meaning of uncertainty in the return distribution.
In the folder code, there are several Jupyter notebooks that contain the full code for various aspects in the project. Below, we describe the main files used in the report.
| File | Description |
| ----------- | ----------- |
| c51 | Main c51 file for WCW env |
| mdp | c51 agent for MDP env in Risk-aware action selection |
| safety_classifier_investigation | All code used for investigating various SCs in report, except QSC |
| DQN | QSC investigations |
| blackjack | SC investigations on Blackjack |
| Risky_MDP | Risky MDP env with c51 agent |
| polgrad_risky_mdp | Multi-Agent SC on Risky MDP env |
| polgrad_wcw | Multi-Agent SC on WCW env, including its variants such as Inverted WCW and MWCW |
| IQN | IQN code, inclusive of monotonically increasing quantile function |
| QRDQN | QRDQN code, for graphs |
\ No newline at end of file
This diff is collapsed.
This source diff could not be displayed because it is too large. You can view the blob instead.
This diff is collapsed.
This source diff could not be displayed because it is too large. You can view the blob instead.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment