Talent.com
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosigliereresende, Brazil
Há 1 dia
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

  • Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
  • Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
  • Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

  • Evolve our Python-based QC engine that validates data pre- and post-annotation
  • Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
  • Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
  • Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
  • Analysis & Troubleshooting

  • Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
  • Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
  • Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
  • Tooling and Visualization

  • Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
  • Create visualizations (Chart.js) for QC metrics and signal analysis
  • Integrate with LabelStudio annotation interface
  • What You Bring

    Required

  • Strong Python programming skills, particularly for data processing pipelines
  • Experience with time-series data and digital signal processing
  • Comfortable working in Linux environments and deploying / monitoring remote services
  • Ability to debug complex multi-component systems (sensors, video, networks, sync)
  • Data quality mindset : designing validation rules, tracking metrics, investigating anomalies
  • SQL / database experience for managing pipeline metadata
  • Highly Valued

  • Video processing experience (RTSP streams, encoding, OCR)
  • Working with sensor / IoT data and handling connectivity challenges
  • NextJS or modern web frameworks for data tooling
  • DevOps practices : containerization, monitoring, logging, alerting
  • Experience with annotation pipelines and ML training data workflows
  • Background in biomechanics, sports science, or wearable sensors
  • Tech Stack

  • Languages : Python (primary), JavaScript / TypeScript (NextJS UI)
  • Data : IMU sensor streams, video (RTSP), time-series analysis, DSP
  • Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries
  • Infrastructure : Remote deployment, monitoring systems
  • You'll Thrive Here If You

  • Enjoy detective work : diagnosing why data doesn't match expectations
  • Balance pragmatism with quality : shipping improvements while maintaining reliability
  • Communicate well across technical and non-technical stakeholders
  • Can work autonomously in a small, mission-driven team
  • Criar um alerta de emprego para esta pesquisa

    Data Engineer • resende, Brazil

    Vagas relacionadas
    • Promovida
    ML Data Pipeline Engineer

    ML Data Pipeline Engineer

    ProsigliereVolta Redonda, Rio de Janeiro, Brazil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 4 dias atrás
    • Promovida
    Azure Data Engineer

    Azure Data Engineer

    Tata Consultancy Servicesresende, Brazil
    Come to one of the biggest IT Services companies in the world!! Here you can transform your career!.Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cul...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Cloud Engineer with Data Platforms Experience

    Cloud Engineer with Data Platforms Experience

    TurnKey Tech Staffingresende, Brazil
    For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Platform Engineer

    Data Platform Engineer

    Avenue Coderesende, Brazil
    About the Role and Reponsabilities : .In this role you will have the chance to build large additions to our platform ecosystem, contributing to an infrastructure that centralizes our ETL and streamin...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Encora Inc.Resende, Rio de Janeiro, Brasil
    Work Mode : Work from homeJob Summary.We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and managing data pipelines and modern cloud-based data platforms.This rol...Mostre maisÚltima atualização: 2 dias atrás
    • Promovida
    Ml Data Pipeline Engineer

    Ml Data Pipeline Engineer

    ProsigliereResende, Rio de Janeiro, Brasil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Engineer

    Data Engineer

    Tata Consultancy ServicesVolta Redonda, Rio de Janeiro, Brazil
    Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cu...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    Data Engineer - Fluent English

    Data Engineer - Fluent English

    Artefactvolta redonda, Brazil
    The current vacancy is for the Brazilian office and we work in a Free Office model.At Artefact LatAm, we believe in and live a culture based on empathy!. A healthy work environment is a place where ...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Scientist

    Data Scientist

    Progress Rail, A Caterpillar Companyvolta redonda, Brazil
    Progress Rail’s Uptime team is seeking a talented AI / ML to drive innovation and deliver impactful business solutions through advanced analytics, machine learning, and artificial intelligence.This r...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Scientist - Fluent English

    Data Scientist - Fluent English

    Artefactresende, Brazil
    The current vacancy is for the Brazilian office and we work in a Free Office model.At Artefact LatAm, we believe in and live a culture based on empathy!. A healthy work environment is a place where ...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Pride GlobalVolta Redonda, Rio de Janeiro, Brazil
    We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact : Temporary Are you passionate about building scalable data platforms...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    AI Gateway Engineer

    AI Gateway Engineer

    AVM Consulting IncResende, Rio de Janeiro, Brazil
    AI Gateway Engineer We are seeking a skilled AI Gateway Engineer to join our team.The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend develo...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Amaris Consultingresende, Brazil
    Amaris Consulting is an independent technology consulting firm with a global footprint, bringing together diverse talents from various backgrounds to deliver innovative solutions to clients worldwi...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    AWS Developer (Data Lake)

    AWS Developer (Data Lake)

    MetaVolta Redonda, Rio de Janeiro, Brazil
    About the Role We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution.The ideal cand...Mostre maisÚltima atualização: 5 dias atrás
    • Promovida
    AI Engineer (NLP)

    AI Engineer (NLP)

    ProsigliereResende, Rio de Janeiro, Brazil
    We're looking for a Senior ML / AI Engineer to own and evolve our LLM-powered user experience.You'll work directly with our technical co-founder to build, optimize, and monitor agent systems that par...Mostre maisÚltima atualização: 2 dias atrás
    • Promovida
    Sr. Data Engineer

    Sr. Data Engineer

    TeclaResende, Rio de Janeiro, Brazil
    Native / Bilingual English is required for this role (read / written / spoken) Please upload your CV Resume in English.Monthly salary : $4,000 - $5,000 USD Our partner is looking for a hands-on and ent...Mostre maisÚltima atualização: 4 dias atrás
    • Promovida
    • Nova!
    Data Modeler

    Data Modeler

    TurnKey Tech Staffingvolta redonda, Brasil
    For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: menos de 1 hora atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Eightpointvolta redonda, Brazil
    Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for pa...Mostre maisÚltima atualização: 1 dia atrás