Data Scientist (Remote)
Our client is looking for a Data Scientist to support the development and implementation of advanced data and AI strategies. The ideal professional is passionate about data-driven innovation and eager to work with cutting-edge technologies in Machine Learning and Generative AI.
Key Responsibilities
Develop and implement data strategies to support business objectives and innovation.
Collaborate with business and technology teams to identify opportunities for data-driven improvements.
Design, train, and validate Generative AI and Machine Learning models.
Monitor and optimize NLP and Generative AI model performance to ensure accuracy and relevance.
Evaluate and fine-tune pre-trained models to address specific business challenges.
Ensure data privacy, security, and bias mitigation in AI models.
Build prediction and recommendation models, providing actionable insights.
Collaborate with data engineers to design efficient data pipelines.
Maintain data inventories and dictionaries, ensuring data quality and consistency.
Drive innovation by proposing improvements and new business solutions using Generative AI.
Stay current with advances in AI, NLP, and emerging frameworks.
Promote a data-driven culture within cross-functional teams.
Requirements
Bachelor’s degree in a technology-related field.
Solid experience with Machine Learning algorithms , from design to deployment and automation.
Proven background developing Data Science solutions : optimization, classification, prediction, statistical analysis, and NLP.
Hands-on experience with LLMs , RAG , and Generative AI (e.g., Amazon Bedrock).
Strong proficiency in Python , statistics , and AI / ML frameworks .
Advanced knowledge of relational (SQL Server, PostgreSQL) and non-relational databases .
Experience with Databricks (Spark optimization, Delta Lake, MLflow).
Familiarity with AWS services such as S3, Athena, Glue, SageMaker, and QuickSight.
Experience implementing MLOps best practices.
Advanced English (verbal and written).
Nice to Have
Expertise in Big Data and advanced use of Databricks to accelerate AI and analytics initiatives.
Research or projects published in NLP or Generative AI fields.
Data Scientist • Votorantim, São Paulo, Brazil