Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • Ananindeua, Pará, Brazil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Ananindeua, Pará, Brazil
Há 19 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Ananindeua, Pará, Brazil

Vagas relacionadas
Data Engineer

Data Engineer

Insight Global • Belém, Pará, Brazil
Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : 3+ years of experience in Data Engineering Hands on data enginee...Mostre mais
Última atualização: 19 dias atrás • Promovida
Python / Data Engineer

Python / Data Engineer

Luxoft • Belém, Pará, Brazil
Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
Última atualização: 7 dias atrás • Promovida
Data Engineer

Data Engineer

Tata Consultancy Services • Belém, Pará, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Senior Data Engineer

Senior Data Engineer

Pride Global • Belém, Pará, Brazil
We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact : Temporary Are you passionate about building scalable data platf...Mostre mais
Última atualização: há mais de 30 dias • Promovida
LLM Engineer

LLM Engineer

Talentra • Belém, Brasil
Our client, an international AI development company based in New York, is currently seeking a ".This role will focus on implementing scalable vector store integrations, building retrieval pipelines...Mostre mais
Última atualização: 8 dias atrás • Promovida
Lead Data Engineer

Lead Data Engineer

Elios Talent • Castanhal, Pará, Brasil
Lead Data EngineerKey Highlights.Lead the end-to-end design and build of a brand-new, greenfield analytics ecosystem.Architect data pipelines, orchestration, warehousing, and BI layers from the gro...Mostre mais
Última atualização: 2 dias atrás • Promovida
Databricks Data Engineer

Databricks Data Engineer

GlobalSource IT • Belém, Pará, Brazil
Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta...Mostre mais
Última atualização: 3 dias atrás • Promovida
Data Lead Engineer – Snowflake

Data Lead Engineer – Snowflake

Ampstek • castanhal, estado do pará, Brasil
Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
Última atualização: 1 dia atrás • Promovida
Data Engineer...

Data Engineer...

Insight Global • belém, pa, estado do pará, BR
Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : - 3+ years of experience in Data Engineering - Hands on data...Mostre mais
Última atualização: 19 dias atrás • Promovida
Databricks Data Engineer

Databricks Data Engineer

Globalsource It • Belém, Pará, Brasil
Fully Remote ContractWe're looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake.This role focuses on ing...Mostre mais
Última atualização: 2 dias atrás • Promovida
Data Lead Engineer - Snowflake

Data Lead Engineer - Snowflake

Ampstek • Castanhal, Pará, Brazil
Data Lead Engineer – Snowflake Remote Contract Brazil / Mexico Responsibilities • Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best pract...Mostre mais
Última atualização: 4 dias atrás • Promovida
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Belém, Pará, Brazil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre mais
Última atualização: 19 dias atrás • Promovida
GCP Data Engineer

GCP Data Engineer

Tata Consultancy Services • Belém, Pará, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cu...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Sr Python Data Engineer

Sr Python Data Engineer

Softensity Inc • Castanhal, Pará, Brazil
Senior Python Data Engineer About the Project Responsibilities Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars).Develop and expose...Mostre mais
Última atualização: 8 dias atrás • Promovida
AWS Data Engineer

AWS Data Engineer

Tata Consultancy Services • Belém, Brasil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: 10 dias atrás • Promovida
Data Engineer

Data Engineer

HeartCentrix Solutions • Belém, Pará, Brazil
We are seeking a highly skilled Python Data Engineer with an AI / ML focus to join our client’s growing data & analytics team in Brazil. This role is ideal for someone who loves building scalable da...Mostre mais
Última atualização: 1 dia atrás • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Services • Ananindeua, Pará, Brasil
Come to one of the biggest IT Services companies in the world!!.Here you can transform your career!.Here at TCS we believe that people make the difference, that's why we live a culture of unlimited...Mostre mais
Última atualização: há mais de 30 dias • Promovida
AWS Developer (Data Lake)

AWS Developer (Data Lake)

Meta • Belém, Pará, Brazil
About the Role We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution.The ideal cand...Mostre mais
Última atualização: 21 dias atrás • Promovida