[Remote] Senior Data Engineer - Remote - Multiple Levels
Note: The job is a remote job and is open to candidates in USA. Guidehouse is a consulting firm seeking multiple Data Engineers to join their Technology AI & Data practice. The role involves supporting public sector and health sector clients by building modern data foundations to improve outcomes and enable better decision-making.ResponsibilitiesAssist in developing and maintaining data pipelines and ETL/ELT processes under the guidance of more senior engineersWrite Python and SQL to extract, transform, validate, and load data from common sourcesPerform data quality checks (validation, reconciliation, basic monitoring) and help troubleshoot data issuesDevelop dashboards and analytic products using data visualization tools (e.g., Power BI, Tableau)Support cloud-based data workloads (e.g., Azure/AWS/GCP basics) and learn platform-native services and patternsDocument pipeline steps and technical processes to support maintainability and knowledge transferParticipate in team delivery rhythms (standups, sprint ceremonies) and contribute to reviews with a learning mindsetDesign, build, test, and maintain scalable data pipelines (batch and/or streaming as applicable) with increasing independenceIntegrate data from multiple sources, resolve inconsistencies, and deliver curated datasets for analytics and operational useOwn data quality for assigned domains by implementing validation checks, reconciliation, and monitoring/alerting patternsBuild, maintain, and deploy data products for analytics and data science teams on cloud platforms (e.g. AWS, Azure, GCP)Optimize performance of pipelines and queries (tuning, partitioning patterns, efficient compute usage)Collaborate cross-functionally with analysts, data scientists, and stakeholders to translate requirements into technical designs and delivery plansProduce and maintain technical documentation for data flows, data models, and operational proceduresContribute to governance and compliance practices (access controls, lineage awareness, controlled data handling) within your scopeLead the design and build of scalable data pipeline architectures and tools, including patterns for reliability, security, and maintainabilityDrive ETL/ELT and data quality strategy (frameworks, standards, repeatable testing/monitoring approaches) and raise engineering maturity across the teamArchitect solutions in cloud data platforms (e.g., Azure + Databricks, Snowflake) and guide implementation tradeoffs (cost, performance, scalability, governance)Design data stores and interactions across storage types (relational, warehouse, lake/lakehouse, and NoSQL where needed) aligned to use casesEnable data science / ML readiness by delivering well-modeled, reliable, well-documented datasets and featuresLead requirements gathering and technical planning; translate ambiguous problem statements into actionable architectures, backlogs, and delivery incrementsChampion data quality and governance standards through the development of sophisticated data quality frameworks, dashboards, and feedback loops to ensure transparency in data completeness, consistency, and quality for partners and researchersOwn client and stakeholder engagement for your workstream, including organizing/leading meetings, producing clear written outputs, and tracking follow-throughMentor and review: provide strong code/design reviews, coach engineers, and help remove technical blockersSkillsBachelor's degree from an accredited college/universityBased on our contractual obligations, candidate must be located within the United States and US CitizenMust be able to OBTAIN and MAINTAIN a Federal or DoD 'PUBLIC TRUST'Strong communication skills and ability to work independently, strong collaboration habits, and comfort operating autonomously in a remote environmentMinimum 1+ years of relevant software engineering/data experience (for the Junior role); Minimum of 3+ years of relevant software engineering/data experience (for the Data Engineer); and 8+ years of relevant software engineering/data experience (for the Senior Data Engineer)Advanced SQL and Python skills and experience with relational databases and database designExperience working with data ingestion tools such as AWS Lambda, AWS Data Migration Service, SFTPExperience making dashboards and using data visualization tools (Tableau, Power BI)Experience in integrating data from disparate systems and technologies (IBM Mainframe, Structured, Semi-structured and unstructured sources.)Proficiency with one or more cloud-based solutions (e.g., AWS, Azure, GCP)Designing/deploying data solutions on cloud platforms (AWS, GCP, Azure)Hands-on experience with cloud services and REST API integrationsProficiency with modern data tools (e.g., Spark/Databricks, Airflow, dbt, Kafka) is a plusExperience working with distributed data processing tools such as PySpark, AWS GlueDatabricks and/or Snowflake Data Engineer Associate or Professional certificationProficiency with workflow management systems (Nextflow, Snakemake, Airflow)Experience with regulated environments (GxP, 21 CFR Part 11) and data governanceBenefitsMedical, Rx, Dental & Vision InsurancePersonal and Family Sick Time & Company Paid HolidaysPosition may be eligible for a discretionary variable incentive bonusParental Leave and Adoption Assistance401(k) Retirement PlanBasic Life & Supplemental LifeHealth Savings Account, Dental/Vision & Dependent Care Flexible Spending AccountsShort-Term & Long-Term DisabilityStudent Loan PayDownTuition Reimbursement, Personal Development & Learning OpportunitiesSkills Development & CertificationsEmployee Referral ProgramCorporate Sponsored Events & Community OutreachEmergency Back-Up Childcare ProgramMobility StipendCompany OverviewGuidehouse offers consulting services for public and commercial markets with expertise in management, technology, and risk consulting. It was founded in 2018, and is headquartered in Washington, District of Columbia, USA, with a workforce of 10001+ employees. Its website is https://guidehouse.com.
Apply Now
Apply Now