Data masking with fake data
We often have the use case that developers need "real" data to test features, debug errors or use a development / staging system.
Currently we have lots of hand crafted tools and scripts which export data from the live system and mask sensitive or customer-related fields.
It would be nice if the data masking would support faking data and replace existing data with fake / random generated values especially for values that fall under GDPR:
- first names
- last names
- integer / float values (eg. income)
- dates (birthday)
- address data
This way we could create a data masking pipeline which produces a masked copy with usable data which reflects the "real world" / production system.
2
votes
Laurens Stötzel
shared this idea