Data Engineer (Female Candidates Only)

Posted on March 27, 2025

Apply Now

Job Description

  • Job Summary:
  • We are seeking a talented Female Data Engineer to design, build, and maintain scalable data pipelines and infrastructure. The ideal candidate will have expertise in ETL processes, cloud platforms, and big data technologies to support analytics and business intelligence teams.
  • Key Responsibilities:
  • Develop, maintain, and optimize ETL pipelines for data ingestion and transformation.
  • Design and manage data warehouses, lakes, and databases for efficient storage and retrieval.
  • Work with structured and unstructured data to ensure accuracy and integrity.
  • Implement data security, compliance, and governance best practices.
  • Collaborate with data analysts, scientists, and software engineers to support business needs.
  • Monitor data pipeline performance and troubleshoot issues as needed.
  • Automate workflows using Python, SQL, or Apache Airflow.
  • Required Skills & Qualifications:
  • 7+ years of experience as a Data Engineer, Big Data Engineer, or similar role.
  • Proficiency in SQL, Python, and Scala/Java for data processing.
  • Experience with cloud platforms (AWS, GCP, Azure) and big data tools (Hadoop, Spark, Kafka).
  • Strong knowledge of data modeling, ETL frameworks, and database management (PostgreSQL, Redshift, Snowflake, BigQuery, etc.).
  • Hands-on experience with data pipeline orchestration tools (Apache Airflow, Prefect, Dagster).
  • Understanding of CI/CD, containerization (Docker, Kubernetes), and DevOps practices for data engineering.
  • Excellent problem-solving skills and the ability to work in fast-paced environments.
  • Preferred Qualifications (Nice to Have):
  • Experience with machine learning pipelines and AI-driven data processing.
  • Familiarity with data governance frameworks and compliance (GDPR, HIPAA).
  • Certifications in AWS Data Engineering, Google Data Engineer, or similar.

Required Skills

sql python and scala/java cloud platforms hadoop spark kafka postgresql redshift snowflake bigquery