How to run the PII scan and apply transformers automatically

Governor supports scanning columns for personally identifiable information (PII) before masking. You can run PII scanning and apply transformers from Governor by following these steps:

  1. Open the workflow you want to scan

  2. Open "Sensitive data scan" from the left menu

  3. Click the "Run scanning" button

  4. Visually verify the list of database columns considered sensitive, displayed after scanning is complete. You can manually override columns to declare them sensitive or non-sensitive by selecting them and clicking the update sensitivity button.

  5. Select all columns and click the "Apply suggested transformers" button; this will update your YAML configuration to include the suggested transformations.

  6. Check the "Configuration" tab to see which transformers were applied

Columns are scanned by matching their names to regular expressions (Column regex) and by matching their data entries with to regular expressions (Row regex). //or by machine learning scans of the entries (Row ML). Some entries are also tested for validity with respect to their check digits to avoid false positives for numbers which are known to satisfy Luhn’s Algorithm (Luhn).

The following table shows which data types can be detected using which methods.

Property Description

Email addresses

Column regex, Row regex

Phone number

Column regex, Row regex

Credit card number

Column regex, Row regex

Social Security Number/National Insurance number (SSN/NI)

Column regex, Row regex

IPv4 address

Column regex, Row regex

IPv6 address

Column regex, Row regex

International Bank Account Number (IBAN)

Column regex, Row regex

Bank Identifier Code (BIC/SWIFT)

Column regex, Row regex

MAC address

Column regex, Row regex

Passport number

Column regex, Row regex

License plate number

Column regex

Date of birth

Column regex

Company name

Column regex

Person name

Column regex

Postcode

Column regex

Salary

Column regex

Race

Column regex

Religion

Column regex

Sexual orientation

Column regex

Domain name

Column regex

Github token

Column regex

Geolocation

Column regex

Address

Column regex

Nationality

Column regex

Gender

Column regex

Password

Column regex

Fax number

Column regex

Tax Identification Number

Row regex, Luhn

Legal Entity Identifier

Row regex, Luhn