Senior Data Engineer (GovTech and Public Sector)
Company Description:Are you an experienced Data Engineer ready to tackle complex, high-load, and data-intensive systems? We are looking for a Senior professional to join our team in Ukraine, Europe, working full-time on a project that will make a real impact in the public sector.At Sigma Software, we specialize in delivering innovative solutions for enterprise clients and public organizations. In this role, you will contribute to building an integrated platform that collects, processes, and visualizes critical indicators, enabling better decision-making and analytics.Why join us? You will work with a modern big data stack, have end-to-end involvement from ingestion to machine learning workflows, and be part of a professional team that values ownership, collaboration, and continuous improvement.PROJECT
You will be involved in developing an integrated platform that processes both batch and streaming data, ensures secure and governed data environments, and supports advanced analytics and machine learning workflows. The solution will leverage modern big data technologies to provide actionable insights for the public sector.Job Description:Design and implement data ingestion pipelines for batch and streaming dataConfigure and maintain data orchestration workflows (Airflow, NiFi) and CI/CD automation for data processesDesign and organize data layers within Data Lake architecture (HDFS, Iceberg, S3)Build and maintain secure and governed data environments using Apache Ranger, Atlas, and SDXDevelop SQL queries and optimize performance for analytical workloads in Hive/ImpalaCollaborate on data modeling for analytics and BI, ensuring clean schemas and dimensional modelsSupport machine learning workflows using Spark MLlib or Cloudera Machine Learning (CML)Qualifications:Proven experience in building and maintaining large-scale data pipelines (batch and streaming)Strong knowledge of data engineering fundamentals: ETL/ELT, data governance, data warehousing, Medallion architectureStrong SQL skills for Data Warehouse data servingMinimum 3 years of experience in Python or Scala for data processingHands-on experience with Apache Spark, Kafka, Airflow, and distributed systems optimizationExperience with Apache Ranger and Atlas for security and metadata managementUpper-Intermediate English proficiencyWILL BE A PLUSExperience with Cloudera Data Platform (CDP)Advanced SQL skills and Hive/Impala query optimizationBS in Computer Science or related fieldExposure to ML frameworks and predictive modelingAdditional Information:PERSONAL PROFILEOwnership mindset and proactive approachAbility to drive initiatives forward and suggest improvementsTeam player with shared responsibility for delivery speed, efficiency, and qualityExcellent written and verbal communication skills
Apply Now
You will be involved in developing an integrated platform that processes both batch and streaming data, ensures secure and governed data environments, and supports advanced analytics and machine learning workflows. The solution will leverage modern big data technologies to provide actionable insights for the public sector.Job Description:Design and implement data ingestion pipelines for batch and streaming dataConfigure and maintain data orchestration workflows (Airflow, NiFi) and CI/CD automation for data processesDesign and organize data layers within Data Lake architecture (HDFS, Iceberg, S3)Build and maintain secure and governed data environments using Apache Ranger, Atlas, and SDXDevelop SQL queries and optimize performance for analytical workloads in Hive/ImpalaCollaborate on data modeling for analytics and BI, ensuring clean schemas and dimensional modelsSupport machine learning workflows using Spark MLlib or Cloudera Machine Learning (CML)Qualifications:Proven experience in building and maintaining large-scale data pipelines (batch and streaming)Strong knowledge of data engineering fundamentals: ETL/ELT, data governance, data warehousing, Medallion architectureStrong SQL skills for Data Warehouse data servingMinimum 3 years of experience in Python or Scala for data processingHands-on experience with Apache Spark, Kafka, Airflow, and distributed systems optimizationExperience with Apache Ranger and Atlas for security and metadata managementUpper-Intermediate English proficiencyWILL BE A PLUSExperience with Cloudera Data Platform (CDP)Advanced SQL skills and Hive/Impala query optimizationBS in Computer Science or related fieldExposure to ML frameworks and predictive modelingAdditional Information:PERSONAL PROFILEOwnership mindset and proactive approachAbility to drive initiatives forward and suggest improvementsTeam player with shared responsibility for delivery speed, efficiency, and qualityExcellent written and verbal communication skills
Apply Now