arxiv Better Synthetic Data by Retrieving and Transforming Existing Datasets