Talent.com
As candidaturas não são mais aceitas
ML Data Pipeline Engineer

ML Data Pipeline Engineer

ProsigliereGuarujá, São Paulo, Brazil
Há 5 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Guarujá, São Paulo, Brazil

Vagas relacionadas
Data Engineer (Lead) ID41785

Data Engineer (Lead) ID41785

AgileEngineSão Bernardo do Campo, SP, br
Quick Apply
Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Sr. Data Analytics Engineer

Sr. Data Analytics Engineer

CredixSão Paulo, São Paulo, Brazil
Credix is a FinTech company dedicated to growing businesses in Latin America.Building on our expertise, we now focus on providing a tailored Buy Now, Pay Later (BNPL) solution for B2B transactions ...Mostre maisÚltima atualização: 19 dias atrás
  • Promovida
Data Engineer - Fluent English

Data Engineer - Fluent English

ArtefactSão Paulo, BR
The current vacancy is for the Brazilian office and we work in a Free Office model.At Artefact LatAm, we believe in and live a culture based on empathy!. A healthy work environment is a place where ...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

Amaris Consultingguarujá, Brazil
Amaris Consulting is an independent technology consulting firm with a global footprint, bringing together diverse talents from various backgrounds to deliver innovative solutions to clients worldwi...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Servicesmauá, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Ml Data Pipeline Engineer

Ml Data Pipeline Engineer

ProsigliereGuarujá, São Paulo, Brasil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Aws Developer (Data Lake)

Aws Developer (Data Lake)

MetaOsasco, São Paulo, Brasil
We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Data Engineer

Data Engineer

Tata Consultancy ServicesSão Paulo, São Paulo (microrregião), Brasil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Data Engineer

Data Engineer

able.digitalSão Paulo, São Paulo, Brazil
About the Role We are seeking an Intermediate Data Engineer to support our data infrastructure initiatives by connecting analytics systems, managing data pipelines, and enabling our teams with cl...Mostre maisÚltima atualização: 23 dias atrás
  • Promovida
Python Developer And Data Engineer | Remote Work | Sao Paulo, Brazil

Python Developer And Data Engineer | Remote Work | Sao Paulo, Brazil

BairesdevSão Paulo, Brasil
WinDifferent specializes in helping businesses achieve rapid and sustainable growth through our powerful proprietary marketing system. Our data-driven solutions generate positive engagement that lea...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Data Engineer

Data Engineer

TotalperformSão Paulo, Brasil
We are seeking a skilled and motivated Data Engineer to join our Business Technology team.You'll help drive our business data analytics strategy by building reliable, scalable data pipelines and en...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Senior Data Engineer

Senior Data Engineer

Encora Inc.São Paulo, BR
We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and managing data pipelines and modern cloud-based data platforms. This role involves designing scalable, secur...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Analista de Pesquisa e Desenvolvimento (Resinas a base de Epóxi e Poliuretano)

Analista de Pesquisa e Desenvolvimento (Resinas a base de Epóxi e Poliuretano)

MC-Bauchemie BrasilSítio Alecrim, São Paulo, Brazil
Buscamos Analista de P&D para atuar no Laboratório de Pesquisa e Desenvolvimento, no segmento de Resinas, Epóxi e Poliuretano , no modelo de trabalho presencial na unidade de cidade Vargem ...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Cloud Engineer with Data Platforms Experience

Cloud Engineer with Data Platforms Experience

TurnKey Tech Staffingsuzano, Brazil
For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Staff ML Engineer

Staff ML Engineer

TurnKey Tech StaffingPraia Grande, São Paulo, Brazil
About the Product Niche is the leader in school search.Our mission is to make researching and enrolling in schools easy, transparent, and free. With in-depth profiles on every school and college in ...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Data Engineer - Remote, Latin America

Data Engineer - Remote, Latin America

Bluelight ConsultingSão Paulo, Brasil
Bluelight Consulting is a leading software consultancy dedicated to designing and developing innovative technology that enhances users\' lives. With a steadfast commitment to delivering exceptional ...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
ML Data Pipeline Engineer

ML Data Pipeline Engineer

ProsigliereRibeirão Pires, São Paulo, Brazil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 5 dias atrás
  • Promovida
Senior Data Engineer

Senior Data Engineer

Pride GlobalSão Paulo, BR
We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required.Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work wit...Mostre maisÚltima atualização: há mais de 30 dias