Lead Data Engineer Job at WorkHQ, Los Angeles, CA

Kzhudy80SFZoVlJCVzlHQVd1T2dUeUlOSkE9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote job, Shift work,

Similar Jobs

Actalent

Phlebotomist Job at Actalent

Job Title: PhlebotomistJob DescriptionWe are seeking a skilled phlebotomist who is proficient in blood collection techniques including venipuncture and capillary methods, catering to patients of all age groups from pediatric to geriatric. The ideal candidate should have... 

Serra Chevrolet

Receptionist Job at Serra Chevrolet

 ...dealer groups in the nation. We are proud to represent the world's best automotive brands through our locations across Central Alabama...  .... Our mission is to provide everyone with a better automotive buying and ownership experience, and we are always looking for the right... 

Sodexo

Patient Services Manager 2 Job at Sodexo

 ...Impact in a 4-Star Rated Facility! Sodexo is seeking a Patient Services Manager 2 for NYC Health + Hospitals/Coler an 815-bed skilled...  ...determined by a candidate's education level or years of relevant experience. Salary offers are based on a candidate's specific... 

Nastech Global

Java Developer Job at Nastech Global

 ...Implementing automated testing platforms and unit tests. Proficient understanding of code versioning tools, such as Git. Familiarity with build tools such as Ant, Maven, and Gradle. Familiarity with concepts of CI/CD, Kafka, MQ, Performance Improvement, Splunk, SQL.... 

Avid/Candlewood Coralville

Laundry Attendant Job at Avid/Candlewood Coralville

 ...that you find* Place all laundry in the washer and add the cleaning agents as directed* Take out the clothes and linens after washing...  ...lift 20 pounds on a consistent basis* Strong knowledge of commercial cleaning techniques and products* Impeccable work ethic and...