y0news
AnalyticsDigestsSourcesRSSAICrypto
#socio-cognitive1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 15h ago5/10
๐Ÿง 

Evaluating LLM Alignment With Human Trust Models

Researchers analyzed how the GPT-J-6B language model internally represents and reasons about trust by comparing its embeddings to established human trust models. The study found that the AI's trust representation most closely aligns with the Castelfranchi socio-cognitive model, suggesting LLMs encode social concepts in meaningful ways.