Heath

会员 ID:161663

诚信认证:

举报此人

分享到:

  • 姓    名:

    Heath
  • 国    籍:

    Australia
  • 性    别:

    Male
  • 年    龄:

    36
  • 学    历:

    University Degree
  • 工作经验:

    No experience
  • 工作性质:

    Full-time
  • 所在地区:

    Overseas
  • 希望工作地点:

    All
  • 希望薪金:

    Negotiable
  • 希望从事职业:

  • 到岗时间:

    To be discussed
  • 注册时间:

    2025-02-12 09:59
  • 最后登录:

    2025-02-12 09:59
  • 签证种类:

    No Visa
  • 签证到期日期:

    0000-00-00
  • 联系方式:

    VIP会员可见

简介:

For Outlier, I participated in reinforcement learning from human feedback (RLHF). This entailed crafting questions with certain parameters to elicit incorrect responses from the AI model, identifying and pointing out errors, correcting them, and rating and critiquing the model\'s instruction-following and truthfulness.