Synthetic data generation for ML model development

I thought to check if anyone is using language models to create synthetic data for ML model development. Synthetic data for ML development must adhere to the actual data distribution.