Job Description
PhonePe Group is seeking a Site Reliability Engineer (SRE) to focus on the DataPlatform OnPremise. The candidate will ensure the reliability, scalability, and performance of the Cloudera Data Platform (CDP) infrastructure. This role involves close collaboration with cross-functional teams to design, implement, and maintain robust systems that support data-driven initiatives. The ideal candidate will possess strong troubleshooting skills and a proactive mindset towards automation and optimization.
The SRE will play a pivotal role in ensuring the smooth functioning, operation, performance, and security of a large, high-density Cloudera-based infrastructure.
Responsibilities Include:
- Implementation of Cloudera Data Platform and integration with existing systems.
- Infrastructure management, ensuring optimal performance, high availability, and scalability.
- Implementing and enforcing security best practices to safeguard data integrity and confidentiality.
- Continuously optimizing the Cloudera infrastructure to enhance performance, efficiency, and cost-effectiveness.
- Planning and performance tuning of Hadoop clusters, monitoring resource utilization trends.
- Implementing robust backup and disaster recovery strategies.
- Applying recommended patches and performing rolling upgrades of the platform.
- Creating comprehensive documentation for configurations, processes, and procedures.
- Collaborating effectively with cross-functional teams.
Requirements:
- Bachelor's degree in Computer Science, Engineering, or related field.
- 3-5 years of experience in design, setup, and management of large-scale Hadoop clusters.
- Strong understanding of distributed computing principles and Hadoop ecosystem technologies.
- Experience with Kerberos and LDAP.
- Hands-on experience with configuration management tools.
- Strong scripting skills for automation and troubleshooting.
- Experience with monitoring and logging solutions.
- Knowledge of networking principles and protocols.
- Excellent communication, analytical, problem-solving, and troubleshooting skills.
What PhonePe Offers:
- Insurance Benefits
- Wellness Program
- Parental Support
- Mobility Benefits
- Retirement Benefits
- Other Benefits such as Higher Education Assistance and Car Lease