Data masking with fake data
We often have the use case that developers need "real" data to test features, debug errors or use a development / staging system.
Currently we have lots of hand crafted tools and scripts which export data from the live system and mask sensitive or customer-related fields.
It would be nice if the data masking would support faking data and replace existing data with fake / random generated values especially for values that fall under GDPR:
- first names
- last names
- integer / float values (eg. income)
- dates (birthday)
- address data
This way we could create a data masking pipeline which produces a masked copy with usable data which reflects the "real world" / production system.
2
votes
![](https://secure.gravatar.com/avatar/36b69a6df8460a773b7703067cd399c0?size=40&default=https%3A%2F%2Fassets.uvcdn.com%2Fpkg%2Fadmin%2Ficons%2Fuser_70-6bcf9e08938533adb9bac95c3e487cb2a6d4a32f890ca6fdc82e3072e0ea0368.png)