Talent.com
ML Data Pipeline Engineer

ML Data Pipeline Engineer

ProsigliereJaguariúna, São Paulo, Brazil
Há 4 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Jaguariúna, São Paulo, Brazil

Vagas relacionadas
  • Promovida
Ml Data Pipeline Engineer

Ml Data Pipeline Engineer

ProsigliereLimeira, São Paulo, Brasil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Senior Data Engineer

Senior Data Engineer

Pride GlobalValinhos, São Paulo, Brazil
We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact : Temporary Are you passionate about building scalable data platforms...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Senior Data Engineer

Senior Data Engineer

BRQ Digital SolutionsMogi Guaçu, São Paulo, Brazil
Sobre a BRQ Digital Há 31 anos no mercado, a BRQ Digital Solutions se consolidou como uma das maiores empresas de transformação digital do país. Com uma plataforma de serviços end to end, oferecemos...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Senior Data Engineer

Senior Data Engineer

Encora Inc.Nova Odessa, São Paulo, Brasil
Work Mode : Work from homeJob Summary.We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and managing data pipelines and modern cloud-based data platforms.This rol...Mostre maisÚltima atualização: 3 dias atrás
  • Promovida
Sr. Data Engineer

Sr. Data Engineer

TeclaHortolândia, São Paulo, Brazil
Native / Bilingual English is required for this role (read / written / spoken) Please upload your CV Resume in English.Monthly salary : $4,000 - $5,000 USD Our partner is looking for a hands-on and ent...Mostre maisÚltima atualização: 4 dias atrás
  • Promovida
Data QA Engineer

Data QA Engineer

Microtalent is becoming INSPYR Global SolutionsNova Odessa, São Paulo, Brazil
Employment type : Direct Hire – Full-time, with all benefits required by Brazil law Salary range : Competitive and negotiable based on experience Language : Bilingual (Advanced English – excellent ...Mostre maisÚltima atualização: 6 dias atrás
  • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Servicespaulínia, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Data Engineer - Fluent English

Data Engineer - Fluent English

ArtefactPaulínia, São Paulo, Brazil
The current vacancy is for the Brazilian office and we work in a Free Office model.Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a p...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Staff ML Engineer

Staff ML Engineer

TurnKey Tech StaffingAtibaia, São Paulo, Brazil
About the Product Niche is the leader in school search.Our mission is to make researching and enrolling in schools easy, transparent, and free. With in-depth profiles on every school and college in ...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Data Scientist

Data Scientist

Progress Rail, A Caterpillar CompanyMogi Mirim, São Paulo, Brazil
Progress Rail’s Uptime team is seeking a talented AI / ML to drive innovation and deliver impactful business solutions through advanced analytics, machine learning, and artificial intelligence.This r...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

Amaris Consultingjaguariúna, Brazil
Amaris Consulting is an independent technology consulting firm with a global footprint, bringing together diverse talents from various backgrounds to deliver innovative solutions to clients worldwi...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
AWS Developer (Data Lake)

AWS Developer (Data Lake)

MetaMogi Mirim, São Paulo, Brazil
About the Role We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution.The ideal cand...Mostre maisÚltima atualização: 6 dias atrás
  • Promovida
AI Engineer (NLP)

AI Engineer (NLP)

ProsigliereAraras, São Paulo, Brazil
We're looking for a Senior ML / AI Engineer to own and evolve our LLM-powered user experience.You'll work directly with our technical co-founder to build, optimize, and monitor agent systems that par...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
Data Platform Engineer

Data Platform Engineer

Avenue Codemogi mirim, Brazil
About the Role and Reponsabilities : .In this role you will have the chance to build large additions to our platform ecosystem, contributing to an infrastructure that centralizes our ETL and streamin...Mostre maisÚltima atualização: 2 dias atrás
  • Promovida
  • Nova!
Data Modeler

Data Modeler

TurnKey Tech Staffingmogi mirim, Brazil
For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: 22 horas atrás
  • Promovida
ML Data Pipeline Engineer

ML Data Pipeline Engineer

ProsigliereHortolândia, São Paulo, Brazil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 4 dias atrás
  • Promovida
Data Engineer

Data Engineer

Tata Consultancy ServicesItatiba, São Paulo, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cu...Mostre maisÚltima atualização: há mais de 30 dias
  • Promovida
Aws Developer (Data Lake)

Aws Developer (Data Lake)

MetaMogi Guaçu, São Paulo, Brasil
We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 3 dias atrás