Data Scientist (Big Data Engineer) 3

Remote Full-time
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Rapisource LLC, is seeking the following. Apply via Dice today! Solicitation Reference Number: 2026C0014 DIrect Client: Texas Department of Family and Protective Services Working Title: Data Scientist (Big Data Engineer) 3 Work Location: Austin, Tx - Telework JD: The Worker is responsible for developing, maintaining, and optimizing big data solutions using the Databricks Unified Analytics Platform. This role supports data engineering, machine learning, and analytics initiatives within this organization that relies on large-scale data processing. Duties include: • Designing and developing scalable data pipelines • Implementing ETL/ELT workflows • Optimizing Spark jobs • Integrating with Azure Data Factory • Automating deployments • Collaborating with cross-functional teams • Ensuring data quality, governance, and security. CANDIDATE SKILLS AND QUALIFICATIONS Minimum Requirements: Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity. Years Required/Preferred Experience 4 Required Implement ETL/ELT workflows for both structured and unstructured data 4 Required Automate deployments using CI/CD tools 4 Required Collaborate with cross-functional teams including data scientists, analysts, and stakeholders 4 Required Design and maintain data models, schemas, and database structures to support analytical and operational use cases 4 Required Evaluate and implement appropriate data storage solutions, including data lakes (Azure Data Lake Storage) and data warehouses 4 Required Implement data validation and quality checks to ensure accuracy and consistency 4 Required Contribute to data governance initiatives, including metadata management, data lineage, and data cataloging 4 Required Implement data security measures, including encryption, access controls, and auditing; ensure compliance with regulations and best practices 4 Required Proficiency in Python and R programming languages 4 Required Strong SQL querying and data manipulation skills 4 Required Experience with Azure cloud platform 4 Required Experience with DevOps, CI/CD pipelines, and version control systems 4 Required Working in agile, multicultural environments 4 Required Strong troubleshooting and debugging capabilities 3 Required Design and develop scalable data pipelines using Apache Spark on Databricks 3 Required Optimize Spark jobs for performance and cost-efficiency 3 Required Integrate Databricks solutions with cloud services (Azure Data Factory) 3 Required Ensure data quality, governance, and security using Unity Catalog or Delta Lake 3 Required Deep understanding of Apache Spark architecture, RDDs, DataFrames, and Spark SQL 3 Required Hands-on experience with Databricks notebooks, clusters, jobs, and Delta Lake 1 Preferred Knowledge of ML libraries (MLflow, Scikit-learn, TensorFlow) 1 Preferred Databricks Certified Associate Developer for Apache Spark 1 Preferred Azure Data Engineer Associate Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Clinical Manager, Integrated Care (Oregon Licensed) – Remote

Remote Full-time

Experienced Remote Data Entry and Analysis Specialist – Entry-Level Opportunity for Career Growth and Development with blithequark

Remote Full-time

**Experienced Customer Associate - Entry Level Opportunity at blithequark**

Remote Full-time

Amazon Delivery Driver

Remote Full-time

Product Manager - Banking

Remote Full-time

Google Cloud Solution Architect

Remote Full-time

Epic Credentialed Trainer - Learning and Organizational Development Educator

Remote Full-time

Experienced Remote Data Entry Specialist for Efficient and Flexible Data Management Solutions

Remote Full-time

**Experienced Customer Service Representative – Grand Rapids**

Remote Full-time

Experienced Full Time Data Entry and Talent Relations Facilitator – Remote Work Opportunity with Competitive Compensation and Benefits at blithequark

Remote Full-time
← Back to Home