Talent.com
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosiglieremogi mirim, Brasil
Há 1 dia
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

  • Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
  • Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
  • Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

  • Evolve our Python-based QC engine that validates data pre- and post-annotation
  • Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
  • Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
  • Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
  • Analysis & Troubleshooting

  • Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
  • Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
  • Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
  • Tooling and Visualization

  • Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
  • Create visualizations (Chart.js) for QC metrics and signal analysis
  • Integrate with LabelStudio annotation interface
  • What You Bring

    Required

  • Strong Python programming skills, particularly for data processing pipelines
  • Experience with time-series data and digital signal processing
  • Comfortable working in Linux environments and deploying / monitoring remote services
  • Ability to debug complex multi-component systems (sensors, video, networks, sync)
  • Data quality mindset : designing validation rules, tracking metrics, investigating anomalies
  • SQL / database experience for managing pipeline metadata
  • Highly Valued

  • Video processing experience (RTSP streams, encoding, OCR)
  • Working with sensor / IoT data and handling connectivity challenges
  • NextJS or modern web frameworks for data tooling
  • DevOps practices : containerization, monitoring, logging, alerting
  • Experience with annotation pipelines and ML training data workflows
  • Background in biomechanics, sports science, or wearable sensors
  • Tech Stack

  • Languages : Python (primary), JavaScript / TypeScript (NextJS UI)
  • Data : IMU sensor streams, video (RTSP), time-series analysis, DSP
  • Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries
  • Infrastructure : Remote deployment, monitoring systems
  • You'll Thrive Here If You

  • Enjoy detective work : diagnosing why data doesn't match expectations
  • Balance pragmatism with quality : shipping improvements while maintaining reliability
  • Communicate well across technical and non-technical stakeholders
  • Can work autonomously in a small, mission-driven team
  • Criar um alerta de emprego para esta pesquisa

    Data Engineer • mogi mirim, Brasil

    Vagas relacionadas
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    GeorgiaTEK Systems Inc.araras, Brasil
    We’re Hiring – Snowflake Lead Developer (Azure & ADF).We are seeking a highly skilled.The ideal candidate will have hands-on experience designing and developing scalable data architectures, buildin...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Platform Engineer

    Data Platform Engineer

    Avenue Codeararas, Brasil
    About the Role and Reponsabilities : .In this role you will have the chance to build large additions to our platform ecosystem, contributing to an infrastructure that centralizes our ETL and streamin...Mostre maisÚltima atualização: 15 dias atrás
    • Promovida
    • Nova!
    Aws Developer (Data Lake)

    Aws Developer (Data Lake)

    MetaAraras, São Paulo, Brasil
    We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 11 horas atrás
    • Promovida
    Data QA Engineer

    Data QA Engineer

    Microtalent is becoming INSPYR Global SolutionsJaguariúna, São Paulo, Brazil
    Employment type : Direct Hire – Full-time, with all benefits required by Brazil law Salary range : Competitive and negotiable based on experience Language : Bilingual (Advanced English – excellent ...Mostre maisÚltima atualização: 3 dias atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Eightpointararas, Brasil
    Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for pa...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    AWS Developer (Data Lake)

    AWS Developer (Data Lake)

    Metamogi guaçu, Brasil
    We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Amaris Consultingmogi mirim, Brasil
    Amaris Consulting is an independent technology consulting firm with a global footprint, bringing together diverse talents from various backgrounds to deliver innovative solutions to clients worldwi...Mostre maisÚltima atualização: 22 dias atrás
    • Promovida
    ML Data Pipeline Engineer

    ML Data Pipeline Engineer

    ProsigliereAraras, São Paulo, Brazil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    • Nova!
    Senior Data Engineer

    Senior Data Engineer

    BRQ Digital SolutionsJaguariúna, São Paulo, Brazil
    Sobre a BRQ Digital Há 31 anos no mercado, a BRQ Digital Solutions se consolidou como uma das maiores empresas de transformação digital do país. Com uma plataforma de serviços end to end, oferecemos...Mostre maisÚltima atualização: 12 horas atrás
    • Promovida
    Cloud Engineer with Data Platforms Experience

    Cloud Engineer with Data Platforms Experience

    TurnKey Tech Staffingararas, Brasil
    For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: 22 dias atrás
    • Promovida
    "#982228 Senior Data Engineer"

    "#982228 Senior Data Engineer"

    Dexianararas, Brasil
    A Dexian, lançada em 2023, tem presença global e traz consigo quase 30 anos de experiência através de suas companhias legadas, principalmente da combinação da DISYS e Signature Consultants.Iniciamo...Mostre maisÚltima atualização: 22 dias atrás
    • Promovida
    Azure Data Engineer

    Azure Data Engineer

    Tata Consultancy Servicesararas, Brasil
    Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre maisÚltima atualização: 15 dias atrás
    • Promovida
    GCP Data Engineer

    GCP Data Engineer

    Tata Consultancy Serviceslimeira, Brasil
    Venha para uma das maiores empresas de Serviços IT do mundo!! Aqui você pode transformar sua carreira!.Por que fazer parte da TCS? Aqui na TCS acreditamos que as pessoas fazem a diferença, por isso...Mostre maisÚltima atualização: 24 dias atrás
    • Promovida
    AI Gateway Engineer

    AI Gateway Engineer

    AVM Consulting Incpaulínia, Brasil
    The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend development, and expertise in deploying and optimizing AI / ML models in a production environ...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    Data Engineer

    Data Engineer

    Tata Consultancy ServicesPaulínia, São Paulo, Brazil
    Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cu...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Encora Inc.Paulínia, São Paulo, Brazil
    Important Information Location : Brazil Job Mode : Full-time Work Mode : Work from home Job Summary We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and manag...Mostre maisÚltima atualização: 3 dias atrás
    • Promovida
    Sr. Data Engineer

    Sr. Data Engineer

    Teclaararas, Brasil
    Native / Bilingual English is required for this role (read / written / spoken).Please upload your CV Resume in English.Our partner is looking for a hands-on and entrepreneurial Data Engineer to build and...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Microtalent is becoming INSPYR Global Solutionsjaguariúna, Brasil
    Offer 100% remotly ONLY Brazil.The Senior Cloud Data Engineer leads the design, architecture, and implementation of secure, scalable data solutions on AWS, utilizing Snowflake, dbt, and modern auto...Mostre maisÚltima atualização: 24 dias atrás