Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • Campina Grande do Sul, Paraná, Brazil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Campina Grande do Sul, Paraná, Brazil
Há 19 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.

Improve video capture software robustness, particularly handling network interruptions and operational monitoring.

Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

Evolve our Python-based QC engine that validates data pre- and post-annotation

Implement checks for IMU-video time synchronization, sensor health, and measurement consistency

Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.

Develop validation logic comparing annotations against sensor data to ensure temporal alignment.

Analysis & Troubleshooting

Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes

Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors

Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes

Tooling and Visualization

Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders

Create visualizations (Chart.js) for QC metrics and signal analysis

Integrate with LabelStudio annotation interface

What You Bring

Required

Strong Python programming skills, particularly for data processing pipelines

Experience with time-series data and digital signal processing

Comfortable working in Linux environments and deploying / monitoring remote services

Ability to debug complex multi-component systems (sensors, video, networks, sync)

Data quality mindset : designing validation rules, tracking metrics, investigating anomalies

SQL / database experience for managing pipeline metadata

Highly Valued

Video processing experience (RTSP streams, encoding, OCR)

Working with sensor / IoT data and handling connectivity challenges

NextJS or modern web frameworks for data tooling

DevOps practices : containerization, monitoring, logging, alerting

Experience with annotation pipelines and ML training data workflows

Background in biomechanics, sports science, or wearable sensors

Tech Stack

Languages : Python (primary), JavaScript / TypeScript (NextJS UI)

Data : IMU sensor streams, video (RTSP), time-series analysis, DSP

Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries

Infrastructure : Remote deployment, monitoring systems

You'll Thrive Here If You

Enjoy detective work : diagnosing why data doesn't match expectations

Balance pragmatism with quality : shipping improvements while maintaining reliability

Communicate well across technical and non-technical stakeholders

Can work autonomously in a small, mission-driven team

Criar um alerta de emprego para esta pesquisa

Data Engineer • Campina Grande do Sul, Paraná, Brazil

Vagas relacionadas
Data Engineer - Fluent English

Data Engineer - Fluent English

Artefact • Curitiba, BR
The current vacancy is for the Brazilian office and we work in a Free Office model.At Artefact LatAm, we believe in and live a culture based on empathy!. A healthy work environment is a place where ...Mostre mais
Última atualização: 17 dias atrás • Promovida
AI Engineer (NLP)

AI Engineer (NLP)

Prosigliere • Paranaguá, Paraná, Brazil
We're looking for a Senior ML / AI Engineer to own and evolve our LLM-powered user experience.You'll work directly with our technical co-founder to build, optimize, and monitor agent systems that par...Mostre mais
Última atualização: 17 dias atrás • Promovida
Data Engineer (Relocation to Portugal)

Data Engineer (Relocation to Portugal)

Affinity • Paranaguá, Paraná, Brazil
A Job? Or a Lifetime Experience? Start Yours Here! Our mission is to be a meaningful part of our people's careers.As we grow, so does our determination to offer the best experience to our employee...Mostre mais
Última atualização: há mais de 30 dias • Promovida
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • Curitiba, BR
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre mais
Última atualização: 19 dias atrás • Promovida
AI Gateway Engineer

AI Gateway Engineer

AVM Consulting Inc • Paranaguá, Paraná, Brazil
AI Gateway Engineer We are seeking a skilled AI Gateway Engineer to join our team.The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend develo...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Lead Data Pipeline Engineer

Lead Data Pipeline Engineer

Bebeebackend • Curitiba, Paraná, Brasil
About the RoleAt our company, we're building a cutting-edge data collection pipeline using Go and Typescript on AWS.We are seeking an experienced Senior Backend Developer to join our team on a 12-m...Mostre mais
Última atualização: 12 horas atrás • Promovida • Nova!
Databricks Data Engineer

Databricks Data Engineer

GlobalSource IT • Curitiba, BR
We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake. This role focuses on ingesting data from mult...Mostre mais
Última atualização: 4 dias atrás • Promovida
Data Lead Engineer – Snowflake

Data Lead Engineer – Snowflake

Ampstek • Curitiba, BR
Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
Última atualização: 4 dias atrás • Promovida
Data Engineer

Data Engineer

Tata Consultancy Services • Curitiba, BR
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Sr Python Data Engineer

Sr Python Data Engineer

Softensity Inc • Curitiba, BR
Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars).Develop and expose RESTful APIs using. Consume and process normalized Parquet files fro...Mostre mais
Última atualização: 8 dias atrás • Promovida
Python / Data Engineer

Python / Data Engineer

Luxoft • Curitiba, BR
Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
Última atualização: 7 dias atrás • Promovida
Senior Data Engineer

Senior Data Engineer

Pride Global • Curitiba, BR
We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required.Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work wit...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Senior Data Engineer

Senior Data Engineer

Eightpoint • Campina Grande do Sul, Paraná, Brazil
About Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving signifi...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Data Engineer

Data Engineer

HeartCentrix Solutions • Curitiba, BR
We are seeking a highly skilled.Python Data Engineer with an AI / ML focus.This role is ideal for someone who loves building scalable data pipelines, operationalizing machine learning workflows, and ...Mostre mais
Última atualização: 1 dia atrás • Promovida
GCP Data Engineer

GCP Data Engineer

Tata Consultancy Services • Curitiba, BR
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Azure Data Engineer

Azure Data Engineer

Tata Consultancy Services • Curitiba, BR
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Data Engineer

Data Engineer

Insight Global • Curitiba, BR
Monthly Salary Range : 3-5k USD / Month.Hands on data engineering experience with Microsoft Fabric (highly preferred) or Azure Synapse / Databricks are also acceptable. Advanced Data Manipulation experie...Mostre mais
Última atualização: há mais de 30 dias • Promovida
AWS Data Engineer

AWS Data Engineer

Tata Consultancy Services • Curitiba, BR
Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre mais
Última atualização: 11 dias atrás • Promovida