Brayden Zhang

      • International Geography Olympiad 2023
        • HOME - Machine Learning
        • Decoder-Only Transformers
        • Model Context Protocol (MCP)
        • LSTM
        • Recurrent Neural Networks
        • Transformers
        • Diffusion Models
        • UNet
        • CLIP (Contrastive Language-Image Pretraining)
        • Cross-Validation
        • Hyperparameter Tuning
        • R-CNN
        • Self-Supervised Learning
        • Seq2Seq
        • Gaussian Splatting
        • NeRF
        • Gradient Descent
        • ResNet
        • Regularization
        • A Primer on Probability
        • HOME - Deep Reinforcement Learning
        • Reinforcement Learning from Human Feedback
        • Policy Gradient
        • Actor-Critic Methods
        • Multi-Agent Reinforcement Learning
        • Proximal Policy Optimization (PPO)
        • Q-Learning
        • The Ingredients of RL
          • Foundation Models for Robotics
          • Sim2Real
          • Imitation Learning
        • HOME - Robotics
        • Untitled
      • Predicting Food Deserts - Citadel Invitational Datathon
      • Building a SUMO Robot from Scratch
      • Solar PV Power Forecasting using Deep Learning
      • International Young Physicists' Tournament (IYPT)
        • Courses I've Taken in University
    Home

    ❯

    Notes

    ❯

    Reinforcement Learning

    Folder: Notes/Reinforcement-Learning

    8 items under this folder.

    • Sep 01, 2025

      HOME - Deep Reinforcement Learning

      • robotics
      • tutorial
    • Apr 20, 2025

      Reinforcement Learning from Human Feedback

      • Mar 08, 2025

        Policy Gradient

        • Mar 07, 2025

          Actor-Critic Methods

          • Mar 07, 2025

            Multi-Agent Reinforcement Learning

            • Mar 07, 2025

              Proximal Policy Optimization (PPO)

              • Mar 07, 2025

                Q-Learning

                • Mar 06, 2025

                  The Ingredients of RL


                  Quartz v4.4.0 © 2025

                  • Homepage
                  • GitHub
                  • LinkedIn
                  • Twitter