This solution demonstrate a design pattern how to implement data preparation with a serverless AWS Glue ETL pipeline and Amazon SageMaker Data Wrangler in an end-to-end machine learning (ML) workflow.
This repository provides a Python library to build and test AWS Glue Custom Blueprints locally. It also provides sample blueprints addressing common use-cases in ETL. Crawling Amazon S3 locations: ...