Faster Convergence 

Training an agent now still takes a long time. The particular [experiment](https://wandb.ai/costa-huang/gym-microrts/runs/2v658xqx) in #36 took 4d 9h 11m 14s to finish. 

Looking at the reward chart, it appears the agent could achieve 70% of the final performance in just 50M steps (or  about 10 hours into training)

![image](https://user-images.githubusercontent.com/5555347/151291819-8cd5111b-b003-4c07-bd20-cbbd5bd0cad3.png)

We should try to optimize based on the 10 hours time computational budget.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster Convergence #51

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Faster Convergence #51

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions