AWS AI Data Engineer
Posted on March 12, 2025
Job Description
- We are hiring an AWS AI Data Engineer for one of our client MNC.
- Job Title: AWS AI Data Engineer
- Experience: 6+ Years
- Job Type: 6 Months Contract
- Location: OUS with minimum 6 hours overlap with US timings
- Project Overview:
- The role is part of Project Acuity under the PASD Data Platform workstream, which includes building a centralized web application for internal PASD users across the Recruitment Business. The platform aims to support marketing and operational use cases, enhancing future reporting capabilities and engagement with external stakeholders.
- Role Scope / Deliverables:
- We are seeking an experienced AWS AI Data Engineer to develop, manage, and optimize data architectures supporting AI and Machine Learning (ML) workflows. The candidate will work with large-scale datasets, build scalable data pipelines, and integrate ML frameworks with AWS infrastructure.
- Must-Have Skills:
- Proficiency in programming languages such as Python, Scala, or similar.
- Strong understanding of ML frameworks like TensorFlow and PyTorch.
- Experience in data classification and identification of PII data entities.
- Knowledge of retrieval-augmented generation (RAG) and agent-based workflows.
- Experience improving LLM outputs using Index and Vector stores.
- Proficiency in AWS services such as SageMaker, Comprehend, Entity Resolution.
- Ability to manage and deploy ML models at scale using AWS infrastructure.
- Strong problem-solving and analytical skills.
- Experience with AWS ETL services such as AWS Glue, Lambda, and Data Pipeline.
- Strong knowledge of core AWS services, including IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch, and CloudTrail.
- Nice-to-Have Skills:
- Experience with data privacy and compliance related to PII data.
- Familiarity with advanced data indexing techniques and vector databases.
- Experience improving AI/ML outputs using advanced technologies.
Required Skills
strong understanding of ml frameworks like tensorflow and pytorch.
aws etl services such as aws glue
lambda
and data pipeline.
strong knowledge of core aws services
including iam
vpc
ec2
s3
rds
lambda
cloudwatch
and cloudtrail.