Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • Pindamonhangaba, São Paulo, Brazil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Pindamonhangaba, São Paulo, Brazil
Há 18 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Pindamonhangaba, São Paulo, Brazil

Vagas relacionadas
Data Engineer - Fluent English

Data Engineer - Fluent English

Artefact • Pindamonhangaba, São Paulo, Brazil
The current vacancy is for the Brazilian office and we work in a Free Office model.Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a p...Mostre mais
Última atualização: 16 dias atrás • Promovida
Sr Python Data Engineer

Sr Python Data Engineer

Softensity Inc • Taubaté, São Paulo, Brazil
Senior Python Data Engineer About the Project Responsibilities Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars).Develop and expose...Mostre mais
Última atualização: 6 dias atrás • Promovida
AI Gateway Engineer

AI Gateway Engineer

AVM Consulting Inc • Taubaté, São Paulo, Brazil
AI Gateway Engineer We are seeking a skilled AI Gateway Engineer to join our team.The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend develo...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Data Engineer

Data Engineer

HeartCentrix Solutions • pindamonhangaba, Brasil
We are seeking a highly skilled.Python Data Engineer with an AI / ML focus.This role is ideal for someone who loves building scalable data pipelines, operationalizing machine learning workflows, and ...Mostre mais
Última atualização: 3 horas atrás • Promovida • Nova!
Databricks Data Engineer

Databricks Data Engineer

GlobalSource IT • Pindamonhangaba, São Paulo, Brazil
Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta La...Mostre mais
Última atualização: 2 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Eightpoint • Pindamonhangaba, São Paulo, Brazil
About Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving signifi...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Python / Data Engineer

Python / Data Engineer

Luxoft • Taubaté, São Paulo, Brazil
Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
Última atualização: 6 dias atrás • Promovida
Data Engineer

Data Engineer

Insight Global • Pindamonhangaba, São Paulo, Brazil
Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : 3+ years of experience in Data Engineering Hands on data enginee...Mostre mais
Última atualização: 18 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Encora Inc. • Pindamonhangaba, São Paulo, Brazil
Important Information Location : Brazil Job Mode : Full-time Work Mode : Work from home Job Summary We are looking for a Senior Data Engineer with deep expertise in building, optimizing, an...Mostre mais
Última atualização: 20 dias atrás • Promovida
Engenheiro De Dados

Engenheiro De Dados

AGC Vidros do Brasil • Guaratinguetá, Federative Republic Of Brazil, BR
Vaga : Engenheiro(a) de Dados Jr.A AGC Vidros do Brasil, referência global em tecnologia e inovação na fabricação de vidros planos, está em busca de um(a). Produção – Forno, apoiando o processo de tr...Mostre mais
Última atualização: 5 dias atrás • Promovida
Software Developer (Mobile / AI-Enabled Workflow)

Software Developer (Mobile / AI-Enabled Workflow)

Earned Media Productions • Taubaté, São Paulo, Brazil
Software Developer (Mobile / AI-Enabled Workflow) We are looking for a talented software developer with strong mobile app development experience to join our team. The ideal candidate will be comfo...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Services • Pindamonhangaba, São Paulo, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Lead Data Engineer

Lead Data Engineer

Elios Talent • Taubaté, São Paulo, Brasil
Lead Data EngineerKey Highlights.Lead the end-to-end design and build of a brand-new, greenfield analytics ecosystem.Architect data pipelines, orchestration, warehousing, and BI layers from the gro...Mostre mais
Última atualização: 1 dia atrás • Promovida
Engenheiro de dados

Engenheiro de dados

AGC Vidros do Brasil • Guaratinguetá, São Paulo, Brasil
Vaga : Engenheiro(a) de Dados Jr.A AGC Vidros do Brasil, referência global em tecnologia e inovação na fabricação de vidros planos, está em busca de um(a). Produção – Forno, apoiando o processo de tr...Mostre mais
Última atualização: 4 dias atrás • Promovida
Machine Learning Engineer

Machine Learning Engineer

dhauz • taubaté, Brasil
DHAUZ North America is a rapidly scaling AI start-up offering consulting and product development capabilities to high growth organizations. The offerings will range from providing strategic advice t...Mostre mais
Última atualização: 3 horas atrás • Promovida • Nova!
Data Lead Engineer – Snowflake

Data Lead Engineer – Snowflake

Ampstek • pindamonhangaba, estado de são paulo, Brasil
Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
Última atualização: 15 horas atrás • Promovida • Nova!
Data Lead Engineer - Snowflake

Data Lead Engineer - Snowflake

Ampstek • Taubaté, São Paulo, Brazil
Data Lead Engineer – Snowflake Remote Contract Brazil / Mexico Responsibilities • Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best pract...Mostre mais
Última atualização: 3 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Pride Global • Taubaté, São Paulo, Brazil
We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact : Temporary Are you passionate about building scalable data platforms...Mostre mais
Última atualização: há mais de 30 dias • Promovida