Advanced TopicsImitation LearningImitation LearningBehavioral cloning, DAgger, and learning from demonstrations.Copy MarkdownFeedbackTo be done soonRLHF and Language ModelsRLHF for autoregressive language models, from reward modeling and PPO to DPO and GRPO alternatives.Offline RLLearning from fixed datasets without environment interaction.