The Pond

SearchSearch

Search

  • About me
  • My research
  • All posts
  • Open source
  • Subscribe

Tag: critique

10 items with this tag.

  • 1/20/2026

    No Instrumental Convergence without AI Psychology

    • instrumental convergence
    • critique
    • AI

  • 7/25/2025

    English Writes Numbers Backwards

    • understanding the world
    • critique

  • 6/29/2025

    Authors Have a Responsibility to Communicate Clearly

    • practical
    • critique

  • 1/15/2025

    Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses

    • critique
    • deepmind
    • AI

  • 3/5/2024

    Many Arguments for AI X-Risk Are Wrong

    • critique
    • AI

  • 2/10/2024

    Dreams of AI Alignment: The Danger of Suggestive Names

    • rationality
    • critique
    • AI

  • 1/19/2024

    Don’t Use the “Shoggoth” Meme to Portray LLMs

    • critique
    • AI

  • 12/2/2022

    Inner and Outer Alignment Decompose One Hard Problem Into Two Extremely Hard Problems

    • shard theory
    • critique
    • AI

  • 11/26/2022

    Don’t Align Agents to Evaluations of Plans

    • critique
    • AI

  • 11/18/2022

    Don’t Design Agents Which Exploit Adversarial Inputs

    • critique
    • AI