Hello! Thanks for sharing code and data for your excellent work. Can you please help me find where the human preference data for trolley problems is located? I was trying to calculate the LLM misalignment scores, but couldn't find gold human data to compare it with. Thank you!
Hello! Thanks for sharing code and data for your excellent work. Can you please help me find where the human preference data for trolley problems is located? I was trying to calculate the LLM misalignment scores, but couldn't find gold human data to compare it with. Thank you!