Cloudera / Data Engineer
As a Cloudera / Data Engineer, you will be responsible for designing, building, and maintaining scalable data pipelines and platforms, with a strong focus on the Cloudera Hadoop ecosystem. You will work closely with data analysts, scientists, and business stakeholders to ensure data accessibility, quality, and security.
What will you do?
- Design, build, and manage the Cloudera Hadoop Distribution (CDH/CDP).
- Develop and maintain ETL pipelines using tools such as Apache NiFi, Hive, Spark, and Impala.
- Manage and optimize HDFS, YARN, Kafka, HBase, and Oozie workflows.
- Monitor and troubleshoot cluster performance and jobs with strong problem-solving and debugging skills.
- Collaborate with DevOps and Data Science teams to integrate data platforms into applications and analytics workflows.
- Ensure data governance, security, and compliance using tools like Apache Ranger, Atlas, and Kerberos.
- Mentor and guide a team of data engineers to deliver robust data solutions.
Qualifications
The ideal candidate should possess:
- 10+ years of experience in big data engineering, preferably with Cloudera.
- Strong programming skills in Python, Java, and Spark.
- Experience with Apache Spark, Hive, Impala, and Kafka.
- Familiarity with Linux/Unix and shell scripting.
- Degree in Computer Science, Information Technology, or a related field.
Preferred Skills:
- Cloudera Certified Professional (CCP) or Cloudera Data Platform certification.
- Experience/Knowledge on cloud platforms (AWS, Azure, or GCP) and hybrid deployments.
- Familiarity with CI/CD pipelines, Docker, or Kubernetes in a data context.
Additional Information
We are driven by our AEIOU beliefs - Adventure, Excellence, Integrity, Ownership, and Unity - and we seek individuals who embody these values in both their professional and personal lives. We are committed to our Impact: Valuing our clients, Growing our people, and Creating our future.
Together, we make the extraordinary happen.
Learn more about us at ncs.co and visit our LinkedIn career site.