Talent.com
ML Data Pipeline Engineer

ML Data Pipeline Engineer

Prosiglieregramado, Brasil
Há 3 dias
Descrição da vaga

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.

This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.

What You’ll Do

Pipeline Operations & Improvement

  • Maintain and enhance our multi-source data collection system : IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
  • Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
  • Deploy and monitor services in remote Linux environments with appropriate DevOps practices.

Data Quality & Validation

  • Evolve our Python-based QC engine that validates data pre- and post-annotation
  • Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
  • Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
  • Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
  • Analysis & Troubleshooting

  • Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
  • Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
  • Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
  • Tooling and Visualization

  • Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
  • Create visualizations (Chart.js) for QC metrics and signal analysis
  • Integrate with LabelStudio annotation interface
  • What You Bring

    Required

  • Strong Python programming skills, particularly for data processing pipelines
  • Experience with time-series data and digital signal processing
  • Comfortable working in Linux environments and deploying / monitoring remote services
  • Ability to debug complex multi-component systems (sensors, video, networks, sync)
  • Data quality mindset : designing validation rules, tracking metrics, investigating anomalies
  • SQL / database experience for managing pipeline metadata
  • Highly Valued

  • Video processing experience (RTSP streams, encoding, OCR)
  • Working with sensor / IoT data and handling connectivity challenges
  • NextJS or modern web frameworks for data tooling
  • DevOps practices : containerization, monitoring, logging, alerting
  • Experience with annotation pipelines and ML training data workflows
  • Background in biomechanics, sports science, or wearable sensors
  • Tech Stack

  • Languages : Python (primary), JavaScript / TypeScript (NextJS UI)
  • Data : IMU sensor streams, video (RTSP), time-series analysis, DSP
  • Tools : LabelStudio, Chart.js, Linux / bash, OCR libraries
  • Infrastructure : Remote deployment, monitoring systems
  • You'll Thrive Here If You

  • Enjoy detective work : diagnosing why data doesn't match expectations
  • Balance pragmatism with quality : shipping improvements while maintaining reliability
  • Communicate well across technical and non-technical stakeholders
  • Can work autonomously in a small, mission-driven team
  • Criar um alerta de emprego para esta pesquisa

    Data Engineer • gramado, Brasil

    Vagas relacionadas
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Encora Inc.Sapiranga, Rio Grande do Sul, Brazil
    Important Information Location : Brazil Job Mode : Full-time Work Mode : Work from home Job Summary We are looking for a Senior Data Engineer with deep expertise in building, optimizing, and manag...Mostre maisÚltima atualização: 5 dias atrás
    • Promovida
    Sr. Data Engineer

    Sr. Data Engineer

    Teclanovo hamburgo, Brazil
    Native / Bilingual English is required for this role (read / written / spoken).Please upload your CV Resume in English.Our partner is looking for a hands-on and entrepreneurial Data Engineer to build and...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Aws Developer (Data Lake)

    Aws Developer (Data Lake)

    MetaIvoti, Rio Grande do Sul, Brasil
    We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 2 dias atrás
    • Promovida
    AWS Developer (Data Lake)

    AWS Developer (Data Lake)

    Metaflores da cunha, Brazil
    We are seeking a Senior AWS Developer to support the design and implementation of a large-scale data lake and analytics platform within a leading financial institution. The ideal candidate will demo...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Scientist

    Data Scientist

    Progress Rail, A Caterpillar Companyflores da cunha, Brazil
    Progress Rail’s Uptime team is seeking a talented AI / ML to drive innovation and deliver impactful business solutions through advanced analytics, machine learning, and artificial intelligence.This r...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    • Nova!
    ML Data Pipeline Engineer...

    ML Data Pipeline Engineer...

    ProsigliereFlores da Cunha, Rio Grande do Sul, BR
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 8 horas atrás
    • Promovida
    Data Engineer

    Data Engineer

    Tata Consultancy ServicesMontenegro, Rio Grande do Sul, Brazil
    Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a cu...Mostre maisÚltima atualização: há mais de 30 dias
    • Promovida
    Data Engineer - Fluent English

    Data Engineer - Fluent English

    ArtefactSapucaia do Sul, Rio Grande do Sul, Brazil
    The current vacancy is for the Brazilian office and we work in a Free Office model.Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a p...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Senior Infrastructure Linux Engineer (L2 | Production | Market Data | Remote | Brazil)

    Amaris Consultingcaxias do sul, Brazil
    Amaris Consulting is an independent technology consulting firm with a global footprint, bringing together diverse talents from various backgrounds to deliver innovative solutions to clients worldwi...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data Engineer_ Snowflake

    Data Engineer_ Snowflake

    Criticalriver Inc.Gramado, Rio Grande do Sul, Brasil
    Job Description : Senior Snowflake Data Engineer (Migration & Modelling).Location : Brazil / Costa Rica (Remote).Job Type : Contract / Project-Based (Staff Augmentation). Note : Need to support PST timi...Mostre maisÚltima atualização: 14 dias atrás
    • Promovida
    AI Gateway Engineer

    AI Gateway Engineer

    AVM Consulting Incestância velha, Brazil
    The ideal candidate will have hands-on experience with AI or API gateways, a strong background in backend development, and expertise in deploying and optimizing AI / ML models in a production environ...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Senior Data Engineer

    Senior Data Engineer

    Pride Globalivoti, Brazil
    We're Hiring : Senior Data Engineer | Remote from Brazil | Fluent English required.Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work wit...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Data QA Engineer

    Data QA Engineer

    Microtalent is becoming INSPYR Global SolutionsCaxias do Sul, Espírito Santo, Brazil
    Employment type : Direct Hire – Full-time, with all benefits required by Brazil law Salary range : Competitive and negotiable based on experience Language : Bilingual (Advanced English – excellent ...Mostre maisÚltima atualização: 5 dias atrás
    • Promovida
    Data Engineer | Azure Data Platform (Remote)

    Data Engineer | Azure Data Platform (Remote)

    Neo BI SolutionGramado, Rio Grande do Sul, Brazil
    We’re expanding our Data Platform Operations team and looking for an experienced Data Engineer with strong Azure administration and software engineering skills. This role combines operational...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    AI Engineer (NLP)

    AI Engineer (NLP)

    ProsigliereEstância Velha, Rio Grande do Sul, Brazil
    We're looking for a Senior ML / AI Engineer to own and evolve our LLM-powered user experience.You'll work directly with our technical co-founder to build, optimize, and monitor agent systems that par...Mostre maisÚltima atualização: 2 dias atrás
    • Promovida
    Cloud Engineer with Data Platforms Experience

    Cloud Engineer with Data Platforms Experience

    TurnKey Tech Staffingflores da cunha, Brazil
    For more than 30 years, Carnegie has been a leader and innovator in higher education marketing and enrollment strategy, offering groundbreaking services in the areas of Research, Strategy, Digital ...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    Ml Data Pipeline Engineer

    Ml Data Pipeline Engineer

    ProsigliereSão Leopoldo, Rio Grande do Sul, Brasil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 1 dia atrás
    • Promovida
    ML Data Pipeline Engineer

    ML Data Pipeline Engineer

    ProsigliereGramado, Rio Grande do Sul, Brazil
    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prep...Mostre maisÚltima atualização: 3 dias atrás