|
df = combine_rows(df, 'Race', ['Race- self report', 'race', 'Race - self report']) |
The "Race - self report" shouldn't get merged in. Peter created two columns, one for "Race - SIS" that came from the student information system. And the other he copied from the survey responses as "Race - self report". He combined those columns to create a column that is used as the 'source of truth' called 'Race'. If information exists in the student information system, we use that. Otherwise we use the self report. Sometimes kids get cheeky and report their race as 'other' and self describe as 'apache helicopter'. The variations of the race column are "race", "RACE", and "Race".
@mnrva-dev
merge-csv/merge-csv.py
Line 224 in 73cfb0a
The "Race - self report" shouldn't get merged in. Peter created two columns, one for "Race - SIS" that came from the student information system. And the other he copied from the survey responses as "Race - self report". He combined those columns to create a column that is used as the 'source of truth' called 'Race'. If information exists in the student information system, we use that. Otherwise we use the self report. Sometimes kids get cheeky and report their race as 'other' and self describe as 'apache helicopter'. The variations of the race column are "race", "RACE", and "Race".
@mnrva-dev