Data Pipeline Engineer (ML) Job at OSI Engineering, Washington DC

N0pFcFFtbkdENW9udXYvYjJxdjU1Q0NZcFE9PQ==
  • OSI Engineering
  • Washington DC

Job Description

Our client is scaling production ML systems and needs a hands-on engineer to help build, maintain, and run essential ML data pipelines . You’ll own high-throughput data ingestion and transformation workflows (including image- and array-type modalities), enforce rigorous data quality standards, and partner with research and platform teams to keep models fed with reliable, versioned datasets.

  • Design, build, and operate reliable ML data pipelines for batch and/or streaming use cases across cloud environments.
  • Develop robust ETL/ELT processes (ingest, validate, cleanse, transform, and publish) with clear SLAs and monitoring.
  • Implement data quality gates (schema checks, null/outlier handling, drift and bias signals) and data versioning for reproducibility.
  • Optimize pipelines for distributed computing and large modalities (e.g., images, multi-dimensional arrays).
  • Automate repetitive workflows with CI/CD and infrastructure-as-code; document, test, and harden for production.
  • Collaborate with ML, Data Science, and Platform teams to align datasets, features, and model training needs.

Minimum Qualifications:

5+ years building and operating data pipelines in production.

  • Cloud: Hands-on with AWS , Azure , or GCP services for storage, compute, orchestration, and security.
  • Programming: Strong proficiency in Python and common data/ML libraries ( pandas , NumPy , etc.).
  • Distributed compute: Experience with at least one of Spark , Dask , or Ray .
  • Modalities: Experience handling image-type and array-type data at scale.
  • Automation: Proven ability to automate repetitive tasks (shell/Python scripting, CI/CD).
  • Data Quality: Implemented validation, cleansing, and transformation frameworks in production.
  • Data Versioning: Familiar with tools/practices such as DVC , LakeFS , or similar.
  • Languages: Fluent in English or Farsi .
  • Strongly PreferredSQL expertise (writing performant queries; optimizing on large datasets).
  • Data warehousing/lakehouse concepts and tools (e.g., Snowflake/BigQuery/Redshift; Delta/Lakehouse patterns).
  • Data virtualization/federation exposure (e.g., Presto/Trino) and semantic/metadata layers.
  • Orchestration (Airflow, Dagster, Prefect) and observability/monitoring for data pipelines.
  • MLOps practices (feature stores, experiment tracking, lineage, artifacts).
  • Containers & IaC (Docker; Terraform/CloudFormation) and CI/CD for data/ML workflows.
  • Testing for data/ETL (unit/integration tests, great_expectations or similar).
  • Soft Skills Executes independently and creatively ; comfortable owning outcomes in ambiguous environments.
  • Proactive communicator who collaborates cross-functionally with DS/ML/Platform stakeholders.

Location: Seattle, WA

Duration: 1+ year

Pay: $56/hr

Job Tags

Similar Jobs

Pride Health

Sterile Processing Quality Control Technician Job at Pride Health

 ...Pride Health is hiring a Sterile Processing Quality Control Technician to support one of our clients in Modesto, CA 95350 . This 13 Weeks Contract focuses on ensuring patient safety by maintaining the highest standards of instrument cleanliness, assembly accuracy,... 

Health Care Connectors

RN-Admissions Specialist (Full-Time, Part-Time, Per-Diem) Job at Health Care Connectors

 ...medical recordsJob Type: Part-timePay: $65.00 - $75.00 per hourExpected hours: 20 - 40 per weekMedical Specialty:...  ...)License/Certification:* BLS Certification (Preferred)* RN (Required)* IV Certification (Required)Work Location: On the... 

Phillips & Jordan, Inc.

Junior Estimator Job at Phillips & Jordan, Inc.

 ...learn and grow within the estimating field. Education and Experience: Bachelor's degree in Construction Management, Engineering, or related field preferred. Entry-level position; previous internship or work experience in construction or estimating is a plus.... 

Comfort Keepers of Spartanburg

Staffing and Retention Coordinator/CNA Job at Comfort Keepers of Spartanburg

 ...for others? If so, Comfort Keepers invites you to join our team as a Staffing and Retention Coordinator/CNA . Comfort Keepers is a leading provider of in-home care, proudly serving seniors in the Upstate for over 20 years. We are growing our dedicated team and... 

Agape Care

Executive Assistant to Chief Executive Officer(DODD Agency) Job at Agape Care

 ...LOCATION: LANCASTER/SUGAR GROVE AREA -- ZIP CODE 43155**PLEASE NOTE: Client requires total assistance with hygiene needs.**~1ST SHIFT POSITION 16 HOURS/ WEEK ~ SATURDAY AND SUNDAY ONLY ~8AM - 4PM Agape Care LLC is looking for RELIABLE, caring and dedicated...