The Pond

SearchSearch

Search

  • About me
  • My research
  • Posts
  • Subscribe

Tag: human values

6 items with this tag.

  • 12/17/2022

    Positive Values Seem More Robust and Lasting than Prohibitions

      shard theoryhuman valuesAI

  • 11/29/2022

    Alignment Allows “Non-Robust” Decision-Influences and Doesn’t Require Robust Grading

      shard theoryhuman valuesAI

  • 9/9/2022

    Understanding and Avoiding Value Drift

      human valuesshard theoryrationalityAI

  • 9/4/2022

    The Shard Theory of Human Values

      understanding the worldshard theoryhuman valuesrationalityAI

  • 7/14/2022

    Humans Provide an Untapped Wealth of Evidence About Alignment

      shard theoryhuman valuesAI

  • 7/7/2022

    Human Values & Biases Are Inaccessible to the Genome

      understanding the worldshard theoryhuman values