Paper Note - Anthropic Constitutional AI
Constitutional AI: Harmlessness from AI Feedback
Machine Learning Engineer in TSMC. Focus on DL / RL.
Constitutional AI: Harmlessness from AI Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
What is Human intent & Human Preference in RLHF?