There is a column called email_please where people are free to provide their own email address.
Currently it's null for all users.
Additionally we have an id column which has a unique id provided by typeform.
I suggest:
- rename
email_please to email
- we use the
email column as the uid for purposes of anonymization
- in the ETL (data transformation step when the data is mungded we) we use the id value for the
email column when the email address hasn't been provided
- if an email has been provided then that is used
This setup allows us to capture cases where someone provides multiple survey answers and provides the same email address, and secondly also ensures we anonymize with respect to the email address, and as such never leak it.
There is a column called
email_pleasewhere people are free to provide their own email address.Currently it's
nullfor all users.Additionally we have an id column which has a unique id provided by typeform.
I suggest:
email_pleasetoemailemailcolumn as theuidfor purposes of anonymizationemailcolumn when the email address hasn't been providedThis setup allows us to capture cases where someone provides multiple survey answers and provides the same email address, and secondly also ensures we anonymize with respect to the email address, and as such never leak it.