AIs are more likely to mislead people if trained on human feedback

posted on Oct. 02, 2024 at 6:38 pm

Illustration of a chatbot icon on a digital blue wavy background — Striving to come up with answers that please humans may make chatbots more likely to pull the wool over our eyes

JuSun/Getty Images

Giving AI chatbots human feedback on their responses seems to make them better at giving convincing, but wrong, answers.

The raw output of large language models (LLMs), which power chatbots like ChatGPT, often contains biased, harmful or irrelevant information, and their style of interaction can seem unnatural to humans. To get around this, developers often get people to evaluate a model’s responses and then fine-tune it based on this feedback.

Source link

PennsylvaniaDigitalNews.comOctober 2, 2024

the authorPennsylvaniaDigitalNews.com