Kafka and Data Lake Engineer

Remote Full-time
ResponsibilitiesDesign data pipelines: Build robust, scalable, and secure data pipelines to ingest, process, and move data from various sources into the data lake using Kafka.Administer Kafka clusters: Deploy, configure, and maintain Kafka clusters and related ecosystem tools, such as Kafka Connect and Schema Registry, ensuring high availability and performance.Manage the data lake: Oversee the architecture and governance of the data lake, including managing data storage (e.g., in AWS S3 or ADLS), security, and metadata.Develop data processing applications: Create producers and consumers to interact with Kafka topics using programming languages like Python, Java, or Scala.Perform stream processing: Use tools like Kafka Streams, Apache Flink, or ksqlDB to perform real-time data transformations and analytics.Ensure data quality and security: Implement data quality checks, manage data lineage, and enforce security controls such as encryption, access controls (ACLs), and compliance (e.g., GDPR).Monitor and troubleshoot: Set up monitoring and alerting for Kafka and data lake infrastructure and respond to incidents to ensure operational reliability.Collaborate with teams: Work closely with data scientists, analysts, and other engineering teams to understand data requirements and deliver reliable data solutions.Essential skills and qualificationsExperience: Proven experience designing and managing data platforms with Apache Kafka and big data technologies.Programming: Strong proficiency in languages like Python, Java, or Scala.Big data technologies: Expertise in big data processing frameworks, such as Apache Spark and Apache Flink.Cloud platforms: Hands-on experience with cloud environments (AWS, Azure, or GCP) and relevant services like S3, Glue, or Azure Data Lake Storage.Data lake architecture: A solid understanding of data lake design principles, including storage formats (e.g., Delta Lake, Apache Iceberg), data modeling, and governance.Databases: Experience with various database systems, including both SQL and NoSQL.Infrastructure management: Familiarity with infrastructure-as-code tools like Terraform or Ansible and containerization with Docker and Kubernetes.Professionals in this field can advance from entry-level data engineering positions to senior roles, and then to a Big Data Architect or Solutions Architect, where they oversee large-scale data infrastructure Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Customer Care Representative – Delivering Exceptional Service for American Express Card Members**

Remote Full-time

Experienced Part-Time Data Entry Clerk – Remote Work Opportunity for Career Growth and Development with blithequark

Remote Full-time

Apple Work From Home Part Time - Vacancyglobal: College Program At Home Advisor - Remote Customer Support and Technical Assistance Role

Remote Full-time

Vendor Manager

Remote Full-time

Video Content Creator At 313 South Ashland

Remote Full-time

Automation Test Lead CRM Platforms (MS Dynamics / Salesforce)

Remote Full-time

Entry-Level Remote Data Entry Specialist - Work from Home with Flexible Hours at Walmart

Remote Full-time

Experienced Part-Time Remote Overnight Data Entry Clerk – Entry-Level Opportunity for Detail-Oriented Individuals with Flexible Scheduling at arenaflex

Remote Full-time

Senior Cloud Developer (AWS)

Remote Full-time

Experienced Virtual Customer Care Representative – Homebuilding Industry Expertise with Emphasis on Customer Relations, Trade Partner Management, and Community Development

Remote Full-time
← Back to Home