Job Description
Seamless.AI is seeking a Freelance Principal Data Engineer to design, develop, and maintain scalable ETL pipelines. The candidate will work with stakeholders to understand data requirements and propose effective data acquisition and integration strategies. They will implement data transformation logic using Python and relevant frameworks, ensuring efficiency and reliability. The role involves utilizing AWS Glue or similar tools to create and manage ETL jobs, workflows, and data catalogs. The Freelance Principal Data Engineer will optimize ETL processes to improve performance and scalability, particularly for large datasets. They will apply data matching, deduplication, and aggregation techniques to enhance data accuracy and quality.
Responsibilities:
- Design, develop, and maintain scalable ETL pipelines.
- Work with stakeholders to understand data requirements.
- Implement data transformation logic using Python.
- Utilize AWS Glue to manage ETL jobs.
- Optimize ETL processes for large datasets.
- Apply data matching and aggregation techniques.
- Ensure compliance with data governance and security practices.
- Provide recommendations on emerging technologies.
Requirements:
- Strong proficiency in Python and related frameworks.
- Hands-on experience with AWS Glue or similar ETL tools.
- Solid understanding of data modeling and data warehousing principles.
- Expertise in working with large datasets and distributed computing frameworks.
- Strong proficiency in SQL.
- Familiarity with data matching and aggregation methodologies.
- Excellent communication and collaboration skills.
- Fluency in English and Spanish.
- 7+ years of experience as a Data Engineer.
Seamless.AI offers:
- A contract position with possible renewal.
- Compensation based on project milestones or a fixed contractual rate.