• panda_abyss@lemmy.ca
    link
    fedilink
    arrow-up
    6
    ·
    19 days ago

    I think it’s unintentional, but LLM arena style benchmarks really favour sycophantic models, and it makes for a stickier product when your users are becoming emotionally dependent on it.