Study: Some language reward models exhibit political bias

🃏Joker@sh.itjust.works · 15 days ago

Study: Some language reward models exhibit political bias

supersquirrel@sopuli.xyz · edit-2 15 days ago

“may also be biased, even when trained on statements known to be objectively truthful.”

I feel like computer science aggressively ignores the humanities/philosophy as a waste of time and then fundamentally undermines and hopelessly entraps itself in the wrong questions for doing so.