Harbor 🪼

❯

Machine Learning

❯

Reinforcement Learning

❯

Reinforcement Learning MOC

Reinforcement Learning MOC

May 20, 20261 min read

machine-learning
reinforcement-learning
moc
dashboard

Reinforcement Learning (RL) is a subfield of machine learning concerned with how intelligent agents ought to take actions in an environment to maximize the notion of cumulative reward.

Foundations

Reinforcement Learning - Core definitions, the agent-environment loop, and rewards.
Markov Decision Processes - The mathematical framework (States, Actions, Transitions).
Exploration vs Exploitation - The fundamental trade-off in learning.

Mathematical Tools

Bellman Equations - Recursive decomposition of value functions.
Dynamic Programming - Solving MDPs with known models (Policy/Value Iteration).

Model-Free Algorithms

Temporal Difference Learning - Learning from incomplete episodes.
Q-Learning - Off-policy value-based learning.
SARSA - On-policy value-based learning.

Deep Reinforcement Learning

Deep Reinforcement Learning - Scaling RL with Neural Networks.
Deep Q-Networks - Experience Replay and Target Networks.
Policy Gradient Methods - Directly optimizing the policy (PPO, TRPO).
Actor-Critic Methods - Hybrid approaches.

Advanced Topics

Multi-Agent Reinforcement Learning
Inverse Reinforcement Learning
Hierarchical RL

To Research / Inbox

Soft Actor-Critic (SAC)
Proximal Policy Optimization (PPO)
Curiosity-Driven Exploration

Foundations
Mathematical Tools
Model-Free Algorithms
Deep Reinforcement Learning
Advanced Topics
To Research / Inbox

Backlinks

Machine Learning MOC
Deep Reinforcement Learning
Reinforcement Learning
Dynamic Programming

Created with Quartz v4.5.2 © 2026