Synthesized Platform

    • Get Started
      • Welcome to Synthesized
      • 5-Minute Quick Start
      • Connect Data Sources
      • How to Create a Workflow
      • JDBC URL Examples
      • Projects
      • Database Permissions
    • Guides
      • Masking Data
        • Masking Specific Column Types
        • Masking Transformers
      • Generating Data
        • Generation Overview
        • Generation Transformers
        • Generating XML Files
        • Generating CSV Files
      • Subsetting Data
        • Subsetting Overview
        • Data Filtering Techniques
      • Managing Data Properties
        • Schema Creation
        • Virtual Foreign Keys
        • Cyclical References
        • Changing Data Types
        • Triggers
        • Custom SQL Scripting
      • Advanced Workflows
        • Custom Transformations
        • TDK Agents / Workers
    • Core Concepts
      • What is the Synthesized Platform?
      • Architecture Overview
      • How Masking Works
      • How Generation Works
      • How Subsetting Works
      • PII Detection & Protection
    • Reference
      • Transformers
        • All Transformers
        • Masking Transformers
        • Generation Transformers
      • Configuration
        • Configuration Reference
        • Advanced Configuration
      • API Reference
        • CLI Reference
        • REST API Reference
        • API Access Keys
      • Data Types
        • Data Type Inventory
    • Deployment & Operations
      • Getting TDK into Production
        • Docker Compose
        • Kubernetes (Helm)
        • Offline Installation
      • Installation Methods
        • CLI Overview
        • CLI Installation
        • CLI with Docker
        • CLI with Kubernetes
        • AWS Marketplace
        • AWS EKS
        • AWS ECS Fargate
        • GCP Marketplace
      • Updating TDK
        • Update Docker Compose
        • Update Helm
        • Update Offline
      • Administration
        • User Management
        • Activation & Licensing
        • Roles and Permissions
    • Security & Compliance
      • Role-Based Access Control
      • Authentication
        • Single Sign-On (SSO)
        • LDAP/Active Directory
        • Kerberos
    • Integrations
      • Database Integrations
      • CI/CD Integrations
      • Cloud Integrations
      • Secret Management
        • AWS Secrets Manager, Azure Key Vault, HashiCorp Vault
      • Cloud Storage
        • AWS S3
        • CSV Files
      • External Tools
        • GitHub Actions
        • Testcontainers
        • AWS License Manager
    • Troubleshooting
      • Common Issues
        • Diagnostic Tools
      • Performance
        • Improving Performance
        • Database Locking Issues
      • Database-Specific
        • PostgreSQL Partitioned Tables
        • Oracle Sequences
        • MySQL TINYINT Issues
        • Zero Date Value Issues
        • SSL Connection Issues
      • Type Handling
        • Auto-generated Types
        • Custom Types
        • Geometric Types
    • Changelog
      • Governor
      • TDK
    • FAQ
    • Issues & Support
    • Privacy Policy
    • Data Processing Addendum
    • Terms of Service
Synthesized Platform v1.141
  • Home
    • default
  • Synthesized Platform
    • v1.141
    • v1.140
    • v1.139
  • Synthesized Platform
  • Guides
  • Subsetting Data
v1.141 v1.140 v1.139

Subsetting Data

Extract a smaller, referentially intact subset of your database for testing.

Overview

Subsetting reduces database size while maintaining referential integrity. Use subsetting to:

  • Create smaller test databases

  • Reduce cloud storage costs

  • Speed up test execution

  • Focus on specific data scenarios

Topics

  • Subsetting Basics

  • Data Filtering Strategies

See Also

  • How Subsetting Works

Generating CSV Files Subsetting Overview