Overview
We are seeking a skilled Data Engineer to design, build, and maintain scalable data pipelines and data platforms that power analytics, machine learning, and business applications. This role will focus on building reliable and efficient data infrastructure using Snowflake and PostgreSQL, ensuring high data quality, performance, and availability.
The ideal candidate has strong experience with data modeling, ETL/ELT pipelines, and modern cloud data platforms, and enjoys working closely with data scientists, analysts, and product teams to deliver high-quality data solutions.
Key Responsibilities
Data Platform & Architecture
- Design and maintain scalable data architectures using Snowflake and PostgreSQL.
- Build and manage data warehouses and data marts to support analytics and reporting needs.
- Optimize database performance, storage, and query efficiency.
Data Pipelines & Integration
- Develop and maintain robust ETL/ELT pipelines to ingest, transform, and deliver data from multiple sources.
- Implement reliable data ingestion pipelines from operational systems, APIs, and external datasets.
- Ensure pipelines are scalable, fault-tolerant, and easy to monitor.
Data Modeling
- Design and implement data models and schemas to support analytical workloads.
- Implement best practices in data normalization, denormalization, and dimensional modeling.
- Ensure data consistency, integrity, and governance across systems.
Performance & Optimization
- Optimize Snowflake workloads, warehouse usage, and query performance.
- Tune PostgreSQL databases for performance, reliability, and scalability.
- Implement data partitioning, indexing, and performance optimization strategies.
Data Quality & Governance
- Implement data validation, monitoring, and quality checks across pipelines.
- Maintain data lineage and ensure data reliability for downstream consumers.
- Support governance initiatives including access control and data security.
Collaboration
- Partner with data scientists, ML engineers, and product teams to support data-driven initiatives.
- Provide reliable datasets for analytics, reporting, and machine learning workflows.
Required Qualifications
- Strong experience with Snowflake data warehouse architecture and performance optimization.
- Hands-on experience with PostgreSQL database design and administration.
- Experience building ETL/ELT pipelines and large-scale data processing workflows.
- Strong proficiency in SQL and Python.
- Experience working with cloud data platforms and modern data engineering practices.
Preferred Qualifications
- Experience with data orchestration tools
- Familiarity with data streaming technologies (Kafka etc.).
- Experience supporting machine learning or AI data pipelines.
- Knowledge of data governance, lineage, and cataloging tools.
- Experience with cloud platforms such as AWS, Azure, or GCP.