The Synthetic Data Vault (SDV) enables end users to easily generate Synthetic Data for different data modalities, including single table, multi-table and time series data. With this ecosystem, we are releasing several years of our work building, testing and evaluating algorithms and models geared towards synthetic data generation. On this site you will find a number of open-source libraries, tutorials and other useful resources. We are constantly improving algorithms, APIs, and benchmarking methods to give you access to the latest innovations in the field. Try it, test it and give us feedback!
Explore docs, papers, videos, tutorials. Join our community slack.
Learn a variety of statistical and neural models and use them to synthesize data, evaluate the quality of the synthetic data.
Learn about different concepts that underpin synthetic data generation, evaluation and usage through our tutorials.