Job Descritpion of AWS Document DB Developer
5+ Years Relevant Experience
Must Have Skills
- Proficiency in programming languages such as Python, Scala, or similar
- Solid understanding of machine learning frameworks such as TensorFlow and PyTorch
- Strong experience in data classification, including the identification of PII data entities
- Knowledge and experience with retrieval-augmented generation (RAG) and agent-based workflows
- Deep understanding of how to re-rank and improve LLM outputs using Index and Vector stores
- Ability to leverage AWS services (e.g., SageMaker, Comprehend, Entity Resolution) to solve complex data and AI-related challenges
- Ability to manage and deploy machine learning models and frameworks at scale using AWS infrastructure
- Strong analytical and problem-solving skills, with the ability to innovate and develop new approaches to data engineering and AI/ML
- Experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) to handle data processing and integration tasks effectively
- Experience in core AWS Services including AWS IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch, CloudTrail
- Design, develop, and implement scalable and high-performance database solutions using Amazon DocumentDB to store and manage JSON-like data
- Expertise in data modelling with any NoSQL Databases like DocumentDB
- Manage and optimize DocumentDB clusters, ensuring reliability, scalability, and performance in a production environment
- Develop and maintain database queries, schemas, and indices for efficient data retrieval
- Work with developers and architects to ensure seamless integration of DocumentDB with other AWS services and applications
- Create and manage backups, disaster recovery plans, and data security protocols for DocumentDB
- Monitor and troubleshoot database performance issues with DocumentDB, performing root cause analysis and resolution
- Experience in core AWS Services including AWS IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch, CloudTrail
- Experience with data privacy and compliance requirements, especially related to PII data
- Familiarity with advanced data indexing techniques, vector databases, and other technologies that improve the quality of AI/ML outputs
- Experience in the development of Event-Driven Distributed Systems in the Cloud using Serverless Architecture
- Ability to design and develop Data Pipelines in AWS Cloud using Glue
- Ability to design, orchestrate, and schedule jobs using Airflow
- Familiar using Large Language Models (LLMs) for Data Classification and Identification of PII data entities
- Knowledge of AWS AI Services like AWS Entity Resolution, AWS Comprehend
Project Overview
- It’s one of the workstreams of Project Acuity
- PASD Data Platform includes centralized web application for internal PASD users across the Recruitment Business to support marketing and operational use cases
- Building a database at the patient level will provide significant benefit to PASD’s future reporting capabilities and engagement of external stakeholders
Role Scope / Deliverables
- Highly skilled and motivated AWS DocumentDB Developer
- Extensive experience working with Amazon DocumentDB
- Solid skills in building, maintaining, and optimizing database solutions within the AWS ecosystem
- Knowledge and some experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) for data processing and integration
Required Skills for AWS Document DB Developer Job
- AWS
- Python
- PySpark
- Spark
- CI/CD pipelines (GitHub Actions, Jenkins) Data Lakes
Our Hiring Process
- Screening (HR Round)
- Technical Round 1
- Technical Round 2
- Final HR Round