About the Role
We are seeking an Intermediate Data Engineer to support our data infrastructure initiatives by connecting analytics systems, managing data pipelines, and enabling our teams with clean, accessible data. This role will focus on integrating key data sources, developing efficient pipelines, and ensuring seamless data flow across platforms like Google Analytics , BigQuery , and Databricks .
Key Responsibilities
- Data Integration :
- Integrate Google Analytics with BigQuery to centralize and structure web analytics data.
- Establish and optimize the connection between BigQuery and cloud Databricks environments for downstream analytics and modeling.
Pipeline Development :
Design, build, and maintain data pipelines within Databricks to support the sandbox team and analytical workloads.Ensure pipelines are reliable, scalable, and adhere to best practices for data quality and performance.Collaboration & Support :
Partner with data analysts, engineers, and product teams to understand data needs and translate them into efficient engineering solutions.Assist in implementing version control, CI / CD processes, and monitoring for data workflows.Requirements
2–4 years of experience as a Data Engineer or in a similar data infrastructure role.Strong proficiency in SQL and experience with BigQuery .Hands-on experience working with Databricks (or similar cloud-based data platforms).Understanding of data pipeline orchestration , ETL processes , and data modeling concepts.Familiarity with Google Analytics data export and schema.Experience with Python , PySpark , or other data engineering languages is a plus.Nice to Have
Exposure to cloud environments (GCP, AWS, or Azure).Experience with Airflow , DBT , or other orchestration tools.Knowledge of data governance and data security best practices.Why Join Us
Be part of a growing data ecosystem that directly drives business insights.Collaborate with cross-functional teams across engineering, analytics, and product.Opportunity to work on modern cloud infrastructure and cutting-edge data tools.