Talent.com
ML Data Pipeline Engineer
ML Data Pipeline EngineerProsigliere • viana, estado do espírito santo, Brasil
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliere • viana, estado do espírito santo, Brasil
Há 18 horas
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

  • Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
  • Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
  • Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

  • Evolve our Python-based QC engine that validates data pre- and post-annotation
  • Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
  • Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
  • Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
  • Analysis & Troubleshooting

  • Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
  • Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
  • Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
  • Tooling and Visualization

  • Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
  • Create visualizations (Chart.js) for QC metrics and signal analysis
  • Integrate with LabelStudio annotation interface
  • What You Bring

    Required

  • Strong Python programming skills, particularly for data processing pipelines
  • Experience with time-series data and digital signal processing
  • Comfortable working in Linux environments and deploying / monitoring remote services
  • Ability to debug complex multi-component systems (sensors, video, networks, sync)
  • Data quality mindset : designing validation rules, tracking metrics, investigating anomalies
  • SQL / database experience for managing pipeline metadata
  • Highly Valued

  • Video processing experience (RTSP streams, encoding, OCR)
  • Working with sensor / IoT data and handling connectivity challenges
  • NextJS or modern web frameworks for data tooling
  • DevOps practices : containerization, monitoring, logging, alerting
  • Experience with annotation pipelines and ML training data workflows
  • Background in biomechanics, sports science, or wearable sensors
  • Tech Stack

  • Languages : Python (primary), JavaScript / TypeScript (NextJS UI)
  • Data : IMU sensor streams, video (RTSP), time-series analysis, DSP
  • Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries
  • Infrastructure : Remote deployment, monitoring systems
  • You'll Thrive Here If You

  • Enjoy detective work : diagnosing why data doesn't match expectations
  • Balance pragmatism with quality : shipping improvements while maintaining reliability
  • Communicate well across technical and non-technical stakeholders
  • Can work autonomously in a small, mission-driven team
  • Criar um alerta de emprego para esta pesquisa

    Data Engineer • viana, estado do espírito santo, Brasil

    Vagas relacionadas
    Sr Python Data Engineer

    Sr Python Data Engineer

    Softensity Inc • Vila Velha, Espírito Santo, Brazil
    Senior Python Data Engineer About the Project Responsibilities Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars).Develop and expose...Mostre mais
    Última atualização: 7 dias atrás • Promovida
    Lead Data Engineer

    Lead Data Engineer

    Elios Talent • Vitoria, Espírito Santo, Brasil
    Lead Data EngineerKey Highlights.Lead the end-to-end design and build of a brand-new, greenfield analytics ecosystem.Architect data pipelines, orchestration, warehousing, and BI layers from the gro...Mostre mais
    Última atualização: 2 dias atrás • Promovida
    Databricks Data Engineer

    Databricks Data Engineer

    GlobalSource IT • Cariacica, Espírito Santo, Brazil
    Databricks Data Engineer Fully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta La...Mostre mais
    Última atualização: 3 dias atrás • Promovida
    Python Developer

    Python Developer

    NTT DATA, Inc. • guarapari, Brasil
    Estamos, segundo o Great Place to Work, dentre as melhores empresas de Tecnologia para se trabalhar no Brasil.Temos uma cultura de trabalho inclusiva e prezamos muito pela liberdade de melhorar pro...Mostre mais
    Última atualização: 3 dias atrás • Promovida
    Data Engineer

    Data Engineer

    Insight Global • Região V Praia do Canto-Um, Espírito Santo, Brazil
    Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : 3+ years of experience in Data Engineering Hands on data enginee...Mostre mais
    Última atualização: 19 dias atrás • Promovida
    ML Data Pipeline Engineer

    ML Data Pipeline Engineer

    Prosigliere • Cariacica, Espírito Santo, Brazil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre mais
    Última atualização: 19 dias atrás • Promovida
    Data Engineer

    Data Engineer

    HeartCentrix Solutions • cariacica, Brasil
    We are seeking a highly skilled.Python Data Engineer with an AI / ML focus.This role is ideal for someone who loves building scalable data pipelines, operationalizing machine learning workflows, and ...Mostre mais
    Última atualização: 1 dia atrás • Promovida
    Data Engineer...

    Data Engineer...

    Insight Global • cariacica, estado do espírito santo, BR
    Position : Data Engineer Location : Remote in Brazil Duration : 3 year+ PJ Contract Monthly Salary Range : 3-5k USD / Month Requirements : - 3+ years of experience in Data Engineering - Hands on data...Mostre mais
    Última atualização: 10 horas atrás • Promovida • Nova!
    AML / CFT Compliance Expert

    AML / CFT Compliance Expert

    Bybit • serra, Brasil
    Develop, implement, and continuously enhance AML / CFT and sanctions policies in alignment with Brazilian regulatory requirements and global best practices. Act as a primary compliance interface with ...Mostre mais
    Última atualização: 1 dia atrás • Promovida
    Python / Data Engineer

    Python / Data Engineer

    Luxoft • Região V Praia do Canto-Um, Brasil
    Join a team focused on backend and data engineering for headend applications at a leading video content provider.This role emphasizes Python-based services, AWS data workflows, and integration with...Mostre mais
    Última atualização: 6 dias atrás • Promovida
    Data Lead Engineer – Snowflake

    Data Lead Engineer – Snowflake

    Ampstek • viana, estado do espírito santo, Brasil
    Data Lead Engineer – Snowflake.Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations.End...Mostre mais
    Última atualização: 1 dia atrás • Promovida
    Azure Data Engineer

    Azure Data Engineer

    Tata Consultancy Services • Cariacica, Espírito Santo, Brasil
    Come to one of the biggest IT Services companies in the world!!.Here you can transform your career!.Here at TCS we believe that people make the difference, that's why we live a culture of unlimited...Mostre mais
    Última atualização: há mais de 30 dias • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Pride Global • Cariacica, Espírito Santo, Brazil
    We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact : Temporary Are you passionate about building scalable data platforms...Mostre mais
    Última atualização: há mais de 30 dias • Promovida
    Master Data Manager

    Master Data Manager

    Pride Global • serra, Brasil
    Vaga : MDM Tester – Híbrido em São Paulo.Testar e validar dados mestres, assegurando qualidade, consistência e precisão entre diferentes sistemas. .Investigar gaps de conhecimento e perfilar fontes d...Mostre mais
    Última atualização: 8 dias atrás • Promovida
    Sr. SAP Developer – CPI (Cloud Platform Integration) – Advanced English

    Sr. SAP Developer – CPI (Cloud Platform Integration) – Advanced English

    HCLTech • serra, Brasil
    HCLTech is a global technology company, spread across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of tech...Mostre mais
    Última atualização: 1 dia atrás • Promovida
    Python / Fast API Development Lead (USD - Remote)

    Python / Fast API Development Lead (USD - Remote)

    Vintti • serra, Brasil
    Full availability (8 hrs / day) with at least 4 hours overlap with PST (8pm–12pm IST).Python, FastAPI, RESTful APIs, asynchronous programming, software engineering best practices, team leadership, au...Mostre mais
    Última atualização: 1 dia atrás • Promovida
    Data Lead Engineer - Snowflake

    Data Lead Engineer - Snowflake

    Ampstek • Vitória, Espírito Santo, Brazil
    Data Lead Engineer – Snowflake Remote Contract Brazil / Mexico Responsibilities • Technical Leadership : Provide technical direction and mentorship to a team of data engineers, ensuring best pract...Mostre mais
    Última atualização: 4 dias atrás • Promovida
    Databricks Data Engineer

    Databricks Data Engineer

    Globalsource It • Vila Velha, Espírito Santo, Brasil
    Databricks Data EngineerFully Remote Contract We're looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake...Mostre mais
    Última atualização: 2 dias atrás • Promovida