The Pond

SearchSearch

Search

  • About me
  • My research
  • All posts
  • Open source
  • Subscribe

Tag: human values

6 items with this tag.

  • 12/17/2022

    Positive Values Seem More Robust and Lasting than Prohibitions

    • shard theory
    • human values
    • AI

  • 11/29/2022

    Alignment Allows “Non-Robust” Decision-Influences and Doesn’t Require Robust Grading

    • shard theory
    • human values
    • AI

  • 9/9/2022

    Understanding and Avoiding Value Drift

    • human values
    • shard theory
    • rationality
    • AI

  • 9/4/2022

    The Shard Theory of Human Values

    • understanding the world
    • shard theory
    • human values
    • rationality
    • AI

  • 7/14/2022

    Humans Provide an Untapped Wealth of Evidence About Alignment

    • shard theory
    • human values
    • AI

  • 7/7/2022

    Human Values & Biases Are Inaccessible to the Genome

    • understanding the world
    • shard theory
    • human values