Junior Data Engineer (Remote US or Canada)

Remote Full-time
About Sayari: Sayari is the transparency company providing the public and private sectors with immediate visibility into complex commercial relationships by delivering the largest commercially available collection of corporate and trade data as a dynamic model of global ownership and trade activity. Sayari’s solutions harness this model to enable risk resilience, complex investigations, and clear-eyed business decisions. Sayari is headquartered in Washington, D.C., and its solutions are used by thousands of frontline analysts in over 35 countries.
Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.
Job Description:Sayari’s flagship product, Sayari Graph, provides instant access to structured business information from billions of corporate, legal, and trade records. As a member of Sayari's data team you will work with the Product and Software Engineering teams to collect data from around the globe, maintain existing data pipelines, and develop new pipelines that power Sayari Graph. Job Responsibilities:
• Write and deploy crawling scripts to collect source data from the web
• Write and run data transformers in Scala Spark to standardize bulk data sets
• Write and run modules in Python to parse entity references and relationships from source data
• Diagnose and fix bugs reported by internal and external users
• Analyze and report on internal datasets to answer questions and inform feature work
• Work collaboratively on and across a team of engineers using agile principles
• Give and receive feedback through code reviews
Skills & Experience:
• Professional experience with Python and a JVM language (e.g., Scala, Java, Kotlin)
• 2+ years of experience designing and maintaining data pipelines
• Experience using Apache Spark and Apache Airflow
• Experience with SQL and NoSQL databases (e.g., columns stores, graph, etc.)
• Experience working on a cloud platform like GCP, AWS, or Azure
• Experience working collaboratively with Git
• Understanding of Docker/Kubernetes
• Interest in learning from and mentoring team members
• Experience supporting and working with cross-functional teams in a dynamic environment
• Passionate about open source development and innovative technology
• Experience working with BI tools like BigQuery and Superset is a plus
• Understanding of knowledge graphs is a plus
Benefits: · 100% fully paid medical, vision, and dental for employees and their dependents· Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days · Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions· A strong commitment to diversity, equity, and inclusion· Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave· A collaborative and positive culture - your team will be as smart and driven as you· Limitless growth and learning opportunities Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.

Apply Now

Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Amazon Catalog Management & Organic Strategist

Remote Full-time

Cybersecurity Analyst II - Cyber Threat Intel (Remote)

Remote Full-time

Per Diem Physical Therapist - Home Based - Dickson City region

Remote Full-time

Data Analyst III

Remote Full-time

**Experienced Full Stack Live Customer Service Specialist – Web & Cloud Application Support**

Remote Full-time

Gestionnaire de projet senior - Équipe finance

Remote Full-time

[Work From Home] Healthcare Business Development Executive

Remote Full-time

Sr/ Competitive Intelligence Analyst/ Platform Security /Remote/

Remote Full-time

Apply Now: Customer Service/Product Information Specialist

Remote Full-time

Experienced Data Entry Clerk and Part-Time Focus Group Panelist for Remote Work from Home Opportunities with Flexible Scheduling

Remote Full-time
← Back to Home