Job Description
Seamless.AI is seeking a Freelance Principal Data Engineer to design, develop, and maintain scalable ETL pipelines. The ideal candidate will have expertise in Python, Spark, AWS Glue, and other ETL technologies. The candidate will work autonomously and deliver high-quality solutions.Role involves:
- Designing, developing, and maintaining scalable ETL pipelines.
- Working with stakeholders to understand data requirements.
- Implementing data transformation logic using Python.
- Utilizing AWS Glue to create and manage ETL jobs.
- Optimizing ETL processes for large datasets.
- Applying data matching, deduplication, and aggregation techniques.
- Ensuring compliance with data governance, security, and privacy practices.
- Providing recommendations on emerging technologies.
Requirements:
- Strong proficiency in Python and experience with related libraries.
- Hands-on experience with AWS Glue or similar ETL tools.
- Solid understanding of data modeling and data warehousing principles.
- Expertise in working with large datasets and distributed computing frameworks.
- Strong proficiency in SQL.
- Familiarity with data matching, deduplication, and aggregation methodologies.
- Excellent communication and collaboration skills.
- Fluency in English and Spanish.
- 7+ years of experience as a Data Engineer.
- Professional experience with Spark and AWS pipeline development.
Role offers:
- Opportunity to work as an independent contractor.
- Flexible contract terms (6 or 12 months).
- Compensation based on project milestones or a fixed contractual rate.