fuscarili

fuscarili OP t1_j6zb7y9 wrote

This is the syllabus:

Reinforcement learning in non-sequential problems:

  • Non-contextual multi-armed bandits
  • Contextual Multi-Armed Bandits

Reinforcement learning in sequential problems:

  • Dynamic planning. Bellman's formula
  • Value-Based Methods
  • Policy-based methods
  • Actor-critic methods
  • Model-Based Methods

Would you say it's within the basic stuff? I honestly have no clue

2