Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • Ponta Grossa, Paraná, Brazil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Ponta Grossa, Paraná, Brazil
Há 18 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Ponta Grossa, Paraná, Brazil

Vagas relacionadas
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Services • Campo Largo, Paraná, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Data Engineer

Data Engineer

HeartCentrix Solutions • Ponta Grossa, Paraná, Brazil
We are seeking a highly skilled Python Data Engineer with an AI / ML focus to join our client’s growing data & analytics team in Brazil. This role is ideal for someone who loves building scalable da...Mostre mais
Última atualização: 8 horas atrás • Promovida • Nova!
Data Engineer

Data Engineer

Insight Global • Campo Largo, Paraná, Brazil
Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : 3+ years of experience in Data Engineering Hands on data enginee...Mostre mais
Última atualização: 18 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Encora Inc. • Ponta Grossa, Paraná, Brasil
Work Mode : Work from homeJob Summary.We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and managing data pipelines and modern cloud-based data platforms.This rol...Mostre mais
Última atualização: 17 dias atrás • Promovida
Lead Data Engineer

Lead Data Engineer

Elios Talent • Campo Largo, Paraná, Brasil
Lead Data EngineerKey Highlights.Lead the end-to-end design and build of a brand-new, greenfield analytics ecosystem.Architect data pipelines, orchestration, warehousing, and BI layers from the gro...Mostre mais
Última atualização: 1 dia atrás • Promovida
AWS Data Engineer

AWS Data Engineer

Tata Consultancy Services • Campo Largo, Paraná, Brazil
Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: 10 dias atrás • Promovida
Machine Learning Engineer

Machine Learning Engineer

dhauz • campo largo, Brasil
DHAUZ North America is a rapidly scaling AI start-up offering consulting and product development capabilities to high growth organizations. The offerings will range from providing strategic advice t...Mostre mais
Última atualização: 6 horas atrás • Promovida • Nova!
Data Scientist - Fluent English

Data Scientist - Fluent English

Artefact • Ponta Grossa, Paraná, Brazil
The current vacancy is for the Brazilian office and we work in a Free Office model.Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a p...Mostre mais
Última atualização: 17 dias atrás • Promovida
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Campo Largo, Paraná, Brazil
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre mais
Última atualização: 18 dias atrás • Promovida
Python / Data Engineer

Python / Data Engineer

Luxoft • Campo Largo, Brasil
Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
Última atualização: 5 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

BRQ Digital Solutions • Ponta Grossa, Paraná, Brazil
Sobre a BRQ Digital Há 31 anos no mercado, a BRQ Digital Solutions se consolidou como uma das maiores empresas de transformação digital do país. Com uma plataforma de serviços end to end, oferecemos...Mostre mais
Última atualização: 17 dias atrás • Promovida
Software Developer (Mobile / AI-Enabled Workflow)

Software Developer (Mobile / AI-Enabled Workflow)

Earned Media Productions • Ponta Grossa, Paraná, Brazil
Software Developer (Mobile / AI-Enabled Workflow) We are looking for a talented software developer with strong mobile app development experience to join our team. The ideal candidate will be comfo...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Data Analyst

Data Analyst

IMCS Group • Campo Largo, Paraná, Brazil
Data Analyst Location : Brazil Mode : Remote Duration : 1 year of contract with possibility of extension We are looking for talents in the AdOps Data Sharing team within its Performance Marketing g...Mostre mais
Última atualização: 4 dias atrás • Promovida
Data Lead Engineer - Snowflake

Data Lead Engineer - Snowflake

Ampstek • Campo Largo, Paraná, Brazil
Data Lead Engineer – Snowflake Remote Contract Brazil / Mexico Responsibilities • Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best pract...Mostre mais
Última atualização: 3 dias atrás • Promovida
Databricks Data Engineer

Databricks Data Engineer

Globalsource It • Ponta Grossa, Paraná, Brasil
Databricks Data EngineerFully Remote Contract We're looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake...Mostre mais
Última atualização: 1 dia atrás • Promovida
Databricks Data Engineer

Databricks Data Engineer

GlobalSource IT • Ponta Grossa, Paraná, Brazil
Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta La...Mostre mais
Última atualização: 2 dias atrás • Promovida
Data Lead Engineer – Snowflake

Data Lead Engineer – Snowflake

Ampstek • ponta grossa, estado do paraná, Brasil
Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
Última atualização: 18 horas atrás • Promovida • Nova!
AI Engineer (NLP)

AI Engineer (NLP)

Prosigliere • Ponta Grossa, Paraná, Brazil
We're looking for a Senior ML / AI Engineer to own and evolve our LLM-powered user experience.You'll work directly with our technical co-founder to build, optimize, and monitor agent systems that par...Mostre mais
Última atualização: 17 dias atrás • Promovida