Skip to content

New observation #1

@montali

Description

@montali

Some key concepts:

  • Knowing where the station is, maybe a one-hot encoded array with compass-like directions -> can use this to actually avoid deadlocks when they are going in opposite directions
  • Heatmap to avoid others and go to stations: other trains are super hot, the destination is super cold. The temperature propagates in space and time: when our agent travels a path, it stays mildly hot for a while
  • Communication between trains
  • Deadlocks avoiding using a predictor
  • Newton's law: trains have a weight proportional to their distance to the station, and repell other trains when they're near; when a train is malfunctioning we'll have to avoid it
  • Normalized distance (like Manhattan) from the station
  • Punish deadlocks using rewards
  • When an agent gets to its destination, remove all its heat
  • Consider the flow in how the temperature propagates: no backward heatening

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions