Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • Resende, Rio de Janeiro, Brazil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Resende, Rio de Janeiro, Brazil
Há 18 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Resende, Rio de Janeiro, Brazil

Vagas relacionadas
Data Engineer

Data Engineer

HeartCentrix Solutions • Volta Redonda, Rio de Janeiro, Brazil
We are seeking a highly skilled Python Data Engineer with an AI / ML focus to join our client’s growing data & analytics team in Brazil. This role is ideal for someone who loves building scalable da...Mostre mais
Última atualização: 6 horas atrás • Promovida • Nova!
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Volta Redonda, Rio de Janeiro, Brazil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre mais
Última atualização: 18 dias atrás • Promovida
Databricks Data Engineer...

Databricks Data Engineer...

GlobalSource IT • Resende, Rio de Janeiro, BR
Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta La...Mostre mais
Última atualização: 2 horas atrás • Promovida • Nova!
Databricks Data Engineer

Databricks Data Engineer

Globalsource It • Volta Redonda, Rio de Janeiro, Brasil
Databricks Data EngineerFully Remote Contract We're looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake...Mostre mais
Última atualização: 1 dia atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Eightpoint • Resende, Rio de Janeiro, Brazil
About Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving signifi...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Master Data Manager

Master Data Manager

Pride Global • volta redonda, Brasil
Vaga : MDM Tester – Híbrido em São Paulo.Testar e validar dados mestres, assegurando qualidade, consistência e precisão entre diferentes sistemas. .Investigar gaps de conhecimento e perfilar fontes d...Mostre mais
Última atualização: 7 dias atrás • Promovida
Data Engineer

Data Engineer

Insight Global • Resende, Rio de Janeiro, Brazil
Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : 3+ years of experience in Data Engineering Hands on data enginee...Mostre mais
Última atualização: 18 dias atrás • Promovida
Machine Learning Engineer

Machine Learning Engineer

dhauz • resende, Brasil
DHAUZ North America is a rapidly scaling AI start-up offering consulting and product development capabilities to high growth organizations. The offerings will range from providing strategic advice t...Mostre mais
Última atualização: 4 horas atrás • Promovida • Nova!
Data Scientist - Fluent English

Data Scientist - Fluent English

Artefact • Resende, Rio de Janeiro, Brazil
The current vacancy is for the Brazilian office and we work in a Free Office model.Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a p...Mostre mais
Última atualização: 17 dias atrás • Promovida
Data Lead Engineer – Snowflake

Data Lead Engineer – Snowflake

Ampstek • resende, estado do rio de janeiro, Brasil
Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
Última atualização: 15 horas atrás • Promovida • Nova!
Senior Data Engineer

Senior Data Engineer

Encora Inc. • Rio Claro, Rio de Janeiro, Brasil
Work Mode : Work from homeJob Summary.We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and managing data pipelines and modern cloud-based data platforms.This rol...Mostre mais
Última atualização: 17 dias atrás • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Services • Rio Claro, Rio de Janeiro, Brasil
Come to one of the biggest IT Services companies in the world!!.Here you can transform your career!.Here at TCS we believe that people make the difference, that's why we live a culture of unlimited...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Sr Python Data Engineer

Sr Python Data Engineer

Softensity Inc • Resende, Rio de Janeiro, Brazil
Senior Python Data Engineer About the Project Responsibilities Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars).Develop and expose...Mostre mais
Última atualização: 7 dias atrás • Promovida
Lead Data Engineer

Lead Data Engineer

Elios Talent • Resende, Rio de Janeiro, Brasil
Lead Data EngineerKey Highlights.Lead the end-to-end design and build of a brand-new, greenfield analytics ecosystem.Architect data pipelines, orchestration, warehousing, and BI layers from the gro...Mostre mais
Última atualização: 1 dia atrás • Promovida
AWS Data Engineer

AWS Data Engineer

Tata Consultancy Services • Resende, Rio de Janeiro, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: 10 dias atrás • Promovida
AI Gateway Engineer

AI Gateway Engineer

AVM Consulting Inc • Resende, Rio de Janeiro, Brazil
AI Gateway Engineer We are seeking a skilled AI Gateway Engineer to join our team.The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend develo...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Databricks Data Engineer

Databricks Data Engineer

GlobalSource IT • Volta Redonda, Rio de Janeiro, Brazil
Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta La...Mostre mais
Última atualização: 2 dias atrás • Promovida
Python / Data Engineer

Python / Data Engineer

Luxoft • Resende, Brasil
Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
Última atualização: 5 dias atrás • Promovida