Welcome to Synthesized

The Synthesized platform is your complete solution for database masking, synthetic data generation, and subsetting. Get privacy-compliant, realistic test data in minutes.

New to Synthesized? Start with our 5-Minute Quick Start to see the platform in action.

What is the Synthesized Platform?

The Synthesized platform provides a secure, privacy-preserving, tailored version of production data that can be used for many purposes:

  • Creating privacy-compliant replicas of production data for development, testing, and data engineering

  • Generating large amounts of data for performance testing and load testing

  • Subsetting production databases into smaller, manageable datasets while maintaining referential integrity

The platform gives you the ability to generate structured synthetic test data at the database level, replicating database structures and maintaining key features like referential integrity whilst also preserving data privacy.

Who It Helps

Target Roles

Software Engineers & Developers

Get realistic test data without waiting for sanitized production dumps

QA Engineers & Testers

Test at scale with production-like data that’s safe to use

DBAs

Quickly provision databases for development and testing environments

Data Engineers

Test data pipelines with realistic, compliant datasets

DevOps Teams

Automate test data creation in CI/CD workflows

Key Capabilities

Table 1. Core Features
Feature Description Key Benefits


Data Masking

Replace sensitive information with realistic but fake data

  • Data types and formats preserved

  • Referential integrity maintained

  • Statistical distributions kept

  • Business logic and constraints preserved


Data Generation

Create synthetic data from scratch that matches your database schema

  • Generate millions of rows quickly

  • Maintain foreign key relationships

  • Control data distributions and patterns

  • Support for complex data types


Data Subsetting

Extract smaller, representative samples from large databases

  • Referential integrity maintained automatically

  • Custom filtering criteria

  • Reduce database size by 90%+

  • Perfect for development environments

How It Works

Platform Workflow
Source DB → Connect → Analyze → Transform → Write → Destination DB

The Synthesized platform connects to your source and destination databases, analyzes the schema and data patterns, then applies transformations according to your workflow configuration:

Transformation Pipeline
Step Action Details

1. Connect

Connects to your source database and reads the schema

Secure connections via JDBC, supports all major databases

2. Analyze

Understands data types, constraints, and relationships

Automatic schema detection, relationship mapping

3. Transform

Applies masking, generation, or subsetting based on your workflow

Configurable transformers, maintains data integrity

4. Write

Outputs transformed data to your destination database

Batch processing, optimized performance

What’s Preserved

The platform maintains these critical elements from source to destination:

Schema

Table and column names

Data Types

All columns maintain their original data types

DDL

Constraints, procedures, views, sequences

Referential Integrity

Primary and Foreign keys

Key Cardinality

Foreign key distributions

Deployment Options

Deployment Type Use Case Get Started

Docker Compose

Quick local setup for development and testing

Quick Start Guide

Kubernetes (Helm)

Production-ready, scalable deployments

Helm Setup

CLI

Standalone command-line interface for automation

CLI Installation

Cloud Marketplaces

One-click deployments on AWS/GCP

AWSGCP

See Deployment & Operations for comprehensive deployment documentation.

What’s Next?

Get Started Now

Ready to see the platform in action?

Set up your environment and run your first transformation

Learn More

Explore the documentation:

  • Core Concepts - Understand the platform architecture and how it works

  • Guides - Detailed tutorials and best practices for common use cases

  • Reference - Complete API and transformer documentation

  • Deployment - Production deployment and operations guides

Need Help?

Find answers and solutions: