Data Engineer (SQL, Python) /Charlestown, MA( Remote), 6 Months Contract
Job title: Data Engineer (SQL, Python) /Charlestown, MA( Remote), 6 Months Contract in Charlestown, MA at Suncap Technology
Company: Suncap Technology
Job description: who will support a variety of database/warehouse management, ETL scripting, and data validation tasks that include but are not limited to: querying databases, restructuring data, cleaning and validating data, performing manual ETL tasks, automating ETL tasks using tools and custom scripting, full pipeline management/monitoring, improving systems and processes, and documenting data systems. The qualified candidate will be highly detail-oriented and have a strong interest in and aptitude for data management and engineering. Some specific focus areas would be determined based on the candidate's skills and interests.The successful candidate must be highly organized, motivated, and able to thrive in a fast-paced team environment and must enjoy the challenge of a dynamic environment with evolving needs. It is extremely important that the candidate possess the ability to carefully keep track of multiple work streams.PRINCIPAL DUTIES AND RESPONSIBILITIES:
Relevant activities include, but are not limited to the following:
Utilizing, improving, and constructing and ETL tools and data warehousing solutions
Running current SQL, Python, and/or Tableau Prep ETL scripts
Using various monitoring and evaluation methods to validate that data flowing through these pipelines is accurate and troubleshooting/addressing issues when they are discovered
Data warehouse maintenance and support
Improving and better integrating scripts (ETL and validation) and warehouse elements into various data pipelines to achieve greater efficiency, reliability, and functionality
Constructing new ETL tools and warehouse components as necessary, specifically including a dedicated-use pipeline for a new collaborative research project
Data Cleaning
Writing queries (SQL) and scripts (Python) to identify data quality problems
Investigating the root cause of data quality problems
Working with appropriate team members to determine appropriate data remediation and process improvement plans
SKILLS & COMPETENCIES REQUIRED:
Background
Technical
Procedural programming for data manipulation using Python
PHP, Java, or other languages are a plus
Knowledge of relational database platforms, data modeling, and warehousing
Comfortable extracting data from and loading data into sources ranging from an Enterprise Data Warehouse to an Excel or text file, using built-in tools or custom-written ETL scripts
Above average SQL skills (e.g. familiar with subqueries, multiple joins, and grouping), specifically MySQL. SQL Server experience a plus
Comfortable with complex multi-stage, multi-technology ETL pipelines
Professional
Ability to interpret and follow-through on data requirements and with strong attention to detail
Strength in independently validating and debugging code and analyses, including consulting documentation, Stack Exchange, etc.
Demonstrates personal initiative and time management skills, as well as the ability to work effectively and kindly as part of a team
Excellent verbal and written communication skills
Familiar with agile software development methodologies
Interest in identifying process improvement opportunities is a plu
Required: Undergraduate degree in Health Informatics, Computer Science, Statistics, Mathematics, Engineering, or a related subject.
Preferred coursework would include most of the following:
Intermediate Databases and SQL
Intermediate Programming (Procedural and/or OO)
Data Structures and Algorithms
Data Quality Management
Data Flow and Automation
Agile Project Management
Expected salary:
Location: Charlestown, MA
Apply for the job now!
Apply Now
Company: Suncap Technology
Job description: who will support a variety of database/warehouse management, ETL scripting, and data validation tasks that include but are not limited to: querying databases, restructuring data, cleaning and validating data, performing manual ETL tasks, automating ETL tasks using tools and custom scripting, full pipeline management/monitoring, improving systems and processes, and documenting data systems. The qualified candidate will be highly detail-oriented and have a strong interest in and aptitude for data management and engineering. Some specific focus areas would be determined based on the candidate's skills and interests.The successful candidate must be highly organized, motivated, and able to thrive in a fast-paced team environment and must enjoy the challenge of a dynamic environment with evolving needs. It is extremely important that the candidate possess the ability to carefully keep track of multiple work streams.PRINCIPAL DUTIES AND RESPONSIBILITIES:
Relevant activities include, but are not limited to the following:
Utilizing, improving, and constructing and ETL tools and data warehousing solutions
Running current SQL, Python, and/or Tableau Prep ETL scripts
Using various monitoring and evaluation methods to validate that data flowing through these pipelines is accurate and troubleshooting/addressing issues when they are discovered
Data warehouse maintenance and support
Improving and better integrating scripts (ETL and validation) and warehouse elements into various data pipelines to achieve greater efficiency, reliability, and functionality
Constructing new ETL tools and warehouse components as necessary, specifically including a dedicated-use pipeline for a new collaborative research project
Data Cleaning
Writing queries (SQL) and scripts (Python) to identify data quality problems
Investigating the root cause of data quality problems
Working with appropriate team members to determine appropriate data remediation and process improvement plans
SKILLS & COMPETENCIES REQUIRED:
Background
Technical
Procedural programming for data manipulation using Python
PHP, Java, or other languages are a plus
Knowledge of relational database platforms, data modeling, and warehousing
Comfortable extracting data from and loading data into sources ranging from an Enterprise Data Warehouse to an Excel or text file, using built-in tools or custom-written ETL scripts
Above average SQL skills (e.g. familiar with subqueries, multiple joins, and grouping), specifically MySQL. SQL Server experience a plus
Comfortable with complex multi-stage, multi-technology ETL pipelines
Professional
Ability to interpret and follow-through on data requirements and with strong attention to detail
Strength in independently validating and debugging code and analyses, including consulting documentation, Stack Exchange, etc.
Demonstrates personal initiative and time management skills, as well as the ability to work effectively and kindly as part of a team
Excellent verbal and written communication skills
Familiar with agile software development methodologies
Interest in identifying process improvement opportunities is a plu
Required: Undergraduate degree in Health Informatics, Computer Science, Statistics, Mathematics, Engineering, or a related subject.
Preferred coursework would include most of the following:
Intermediate Databases and SQL
Intermediate Programming (Procedural and/or OO)
Data Structures and Algorithms
Data Quality Management
Data Flow and Automation
Agile Project Management
Expected salary:
Location: Charlestown, MA
Apply for the job now!
Apply Now