Job Location | United Arab Emirates |
Education | Engineering Graduates/PG |
Salary | Not Mentioned |
Industry | IT - Software |
Functional Area | Not Mentioned |
Location: Dubai, UAENotice Period: Immediate to Max 30 DaysExperience: 5+ YearsKey Responsibilities:– Build and maintain scalable ETL pipelines using PySpark on Cloudera Data Platform (CDP)– Ingest data from RDBMS, APIs, and file systems into CDP– Cleanse and transform large datasets to support business needs– Optimize performance of PySpark jobs and Cloudera components– Automate workflows with Apache Oozie / Airflow– Collaborate with analysts, PMs, and engineering teams– Ensure data quality, validation, and thorough documentationMust-Have Skills:– Strong hands-on expertise in PySpark (RDDs, DataFrames, optimization)– Experience with Cloudera Manager, Hive, Impala, HDFS, HBase– Proficiency in SQL, ETL processes, and big data tech– Knowledge of orchestration tools like Oozie/Airflow– Solid scripting in LinuxLooking for someone analytical, detail-oriented, and collaborative.If you’re passionate about data and ready for your next challenge, we want to hear from you!
Keyskills :
SQL ETL processes and big data tech Linux Scripting
© 2023 HireeJobsGulf All Rights Reserved