GCP & PySpark with ETL - Lead
This lead role focuses on designing and managing large-scale ETL pipelines using PySpark on Google Cloud Platform. You will work with BigQuery, Dataflow, Cloud Composer, and Cloud Storage to build reliable and scalable data solutions. The role involves optimizing Spark jobs, ensuring data quality, and implementing governance and security best practices. You will collaborate closely with engineers, analysts, and business stakeholders while leading troubleshooting and performance optimization efforts.