Roy Tsai

Machine Learning Engineer in TSMC. Focus on DL / RL.

Blog About

Paper Note - Anthropic Constitutional AI

Constitutional AI: Harmlessness from AI Feedback

Read More

Paper Note - Anthropic RLHF

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Read More

Align Human Intent

What is Human intent & Human Preference in RLHF?

Read More