Projects

Data Lists

Data Lists are collections of values that can be used in anonymization and synthesis rules to replace or generate data. They provide a way to define custom sets of data that your transformations can use.

What are Data Lists?

Data Lists allow you to create named collections of values that can be referenced in your data transformation rules. These lists can contain simple text values or be populated from CSV files.

Example UI representation:
┌───────────────────────────────────────────────────────────────────────┐
│ Data Lists                                                            │
├───────────────────────────────────────────────────────────────────────┤
│ Search       Sort ▼                                                   │
├───────────────────────────────────────────────────────────────────────┤
│ Name         Values      Created by     Updated at     Actions        │
│ My Cities    NYC,LA,...  John Doe      2 hours ago                    │
│ Countries    CSV file    Jane Smith    1 day ago                      │
└───────────────────────────────────────────────────────────────────────┘

When to Use Data Lists?

Data Lists are particularly useful when:

  • You need to replace specific values with a predefined set of alternatives (e.g., replacing real city names with a list of fictional cities)
  • You want to generate synthetic data from a specific pool of values (e.g., generating realistic names from a list of common names)
  • You have domain-specific values that aren't covered by built-in generators (e.g., specific product codes, internal department names)
  • You need consistent data across multiple fields or tables

Creating a Data List

From Plain Text Values

  1. Click the Create button in the Data Lists view
  2. Enter a name and description for your data list
  3. Select "Plain text" as the source type
  4. Add comma-separated values in the text area
  5. Click Save
Example UI representation:
┌───────────────────────────────────────────────────────────────────────┐
│ New datalist                              [X]                         │
├───────────────────────────────────────────────────────────────────────┤
│ Name           │ My Company Departments                               │
│ Description    │ List of internal company departments                 │
│ Source type    │ [Plain text ●] [CSV ○]                               │
│ Values         │ HR,Finance,IT,Marketing,Sales                        │
│                │ Operations,Legal,R&D                                 │
├───────────────────────────────────────────────────────────────────────┤
│                          [Save] [Cancel]                              │
└───────────────────────────────────────────────────────────────────────┘

From CSV Files

  1. Click the Create button in the Data Lists view
  2. Enter a name and description for your data list
  3. Select "CSV" as the source type
  4. Configure CSV parsing options:
    • Check "Does the CSV contain headers?" if your file has a header row
    • Specify the separator character (default is comma)
  5. Upload your CSV file
  6. Click Save
Example UI representation:
┌───────────────────────────────────────────────────────────────────────┐
│ New datalist                              [X]                         │
├───────────────────────────────────────────────────────────────────────┤
│ Name           │ Customer Names                                       │
│ Description    │ Names for synthetic customer data generation         │
│ Source type    │ [Plain text ○] [CSV ●]                               │
│ ✓ Does the CSV contain headers?                                       │
│ Separator      │ ,                                                    │
│ Upload file    │ [Choose File]                                        │
├───────────────────────────────────────────────────────────────────────┤
│                          [Save] [Cancel]                              │
└───────────────────────────────────────────────────────────────────────┘

Using Data Lists in Rules

Data Lists can be used in both anonymization and synthesis rules.

Anonymization Example

Replace sensitive data with values from a Data List:

  1. In an anonymization rule, select the "List" function
  2. Choose your Data List from the dropdown
  3. Configure how values are inserted:
    • Sequential: Values are used in order
    • Random: Values are selected randomly
Example UI representation:
┌───────────────────────────────────────────────────────────────────────┐
│ List Function Configuration                                           │
├───────────────────────────────────────────────────────────────────────┤
│ Datalist      │ [Select datalist ▼]                                   │
│                 - My Cities                                           │
│                 - Countries                                           │
│ Insert values │ [Sequential ●] [Random ○]                             │
│ Pick value    │ [Column 1 ▼]                                          │
│ from          │                                                       │
└───────────────────────────────────────────────────────────────────────┘

Synthesis Example

Generate synthetic data using values from a Data List:

  1. In a synthesis rule, select the "List" function
  2. Choose your Data List from the dropdown
  3. Configure insertion method (Sequential or Random)

Benefits of Data Lists

  • Reusability: Create once, use in multiple rules
  • Consistency: Ensure the same values are used across transformations
  • Flexibility: Support both simple lists and complex CSV data
  • Organization: Keep your transformation data organized and named

Best Practices

  • Use descriptive names for your Data Lists
  • Keep lists updated when your data requirements change
  • When using CSV files, ensure they're properly formatted
  • Test your rules after making changes to Data Lists

On this page