Kafka and Data Lake Engineer

Remote Full-time
ResponsibilitiesDesign data pipelines: Build robust, scalable, and secure data pipelines to ingest, process, and move data from various sources into the data lake using Kafka.Administer Kafka clusters: Deploy, configure, and maintain Kafka clusters and related ecosystem tools, such as Kafka Connect and Schema Registry, ensuring high availability and performance.Manage the data lake: Oversee the architecture and governance of the data lake, including managing data storage (e.g., in AWS S3 or ADLS), security, and metadata.Develop data processing applications: Create producers and consumers to interact with Kafka topics using programming languages like Python, Java, or Scala.Perform stream processing: Use tools like Kafka Streams, Apache Flink, or ksqlDB to perform real-time data transformations and analytics.Ensure data quality and security: Implement data quality checks, manage data lineage, and enforce security controls such as encryption, access controls (ACLs), and compliance (e.g., GDPR).Monitor and troubleshoot: Set up monitoring and alerting for Kafka and data lake infrastructure and respond to incidents to ensure operational reliability.Collaborate with teams: Work closely with data scientists, analysts, and other engineering teams to understand data requirements and deliver reliable data solutions.Essential skills and qualificationsExperience: Proven experience designing and managing data platforms with Apache Kafka and big data technologies.Programming: Strong proficiency in languages like Python, Java, or Scala.Big data technologies: Expertise in big data processing frameworks, such as Apache Spark and Apache Flink.Cloud platforms: Hands-on experience with cloud environments (AWS, Azure, or GCP) and relevant services like S3, Glue, or Azure Data Lake Storage.Data lake architecture: A solid understanding of data lake design principles, including storage formats (e.g., Delta Lake, Apache Iceberg), data modeling, and governance.Databases: Experience with various database systems, including both SQL and NoSQL.Infrastructure management: Familiarity with infrastructure-as-code tools like Terraform or Ansible and containerization with Docker and Kubernetes.Professionals in this field can advance from entry-level data engineering positions to senior roles, and then to a Big Data Architect or Solutions Architect, where they oversee large-scale data infrastructure Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Dedicated and Empathetic Customer Service Representative - Remote Opportunity with Blue Cross Blue Shield of Arizona

Remote Full-time

Visualization Developer

Remote Full-time

Real Estate Agent - Virginia (Arlington County)

Remote Full-time

Manager, Finance Transformation

Remote Full-time

Customer Service Associate job at Par Pacific Holdings in Wailuku, HI

Remote Full-time

**Experienced Remote Data Entry Clerk – Precision Data Management for arenaflex**

Remote Full-time

[Remote] Compliance Analyst - US (East Coast)

Remote Full-time

Experienced Investigations and Response Manager for Customer Trust and Privacy - Strategic Leader in Product Innovation and Customer Service Excellence at blithequark

Remote Full-time

3 positions - Metadata & Cataloging Librarian; Online Learning & Instruction Lib

Remote Full-time

**Experienced Entry-Level Remote Chat Assistant – Immediate Start at blithequark**

Remote Full-time
← Back to Home