Skip to main content

Inbuilt Types and Detectors

PII Types

โ€‹ Tokern can detect multiple types of PII. In-built types are: โ€‹

  • Phone
  • Email
  • Credit Card
  • Address
  • Person/Name
  • Birth Date
  • Gender
  • Nationality
  • SSN
  • User Name
  • Password
  • Zip Code
  • PO Box โ€‹ โ€‹

ColumnNameRegexDetector

Tokern uses common regular expressions on column names to detect sensitive data. โ€‹

  • PERSON
^.*(firstname|fname|lastname|lname|
fullname|fname|maidenname|_name|
nickname|name_suffix|name).*$

โ€‹

  • Email
^.*(email|e-mail|mail).*$

โ€‹

  • BIRTH_DATE
^.*(date_of_birth|dateofbirth|dob|
birthday|date_of_death|dateofdeath|birthdate).*$

โ€‹

  • GENDER
^.*(gender).*$
  • NATIONALITY
^.*(nationality).*$

โ€‹

  • ADDRESS
^.*(address|city|state|county|country|zone|borough).*$

โ€‹

  • ZIP_CODE
^.*(zipcode|zip_code|postal|postal_code|zip).*$

โ€‹

  • USER_NAME
^.*user(id|name|).*$

โ€‹

  • PASSWORD:
^.*pass.*$

โ€‹

  • US Social Security Number
^.*(ssn|social_number|social_security|
social_security_number|social_security_no).*$

โ€‹

  • PO_BOX
^.*(po_box|pobox).*$

โ€‹ โ€‹

DatumRegexDetector

Tokern uses the CommonRegex Improved package to scan a sample set of rows for sensitive data. โ€‹ The following data categories are recognized using this method: โ€‹

  • Phone
  • Email
  • Credit Card
  • Address
  • US Social Security Number
  • Zip Code
  • PO Box