Cloud Operations Engineer (AWS)
Salary undisclosed
Apply on
Original
Simplified
Primary Responsibilities
- System Operations and Performance: Oversee daily operational tasks to ensure optimal system performance, uptime, and reliability. Develop and implement strategies to maintain business continuity and disaster recovery plans.
- System Health Monitoring: Perform daily system health checks, monitor system performance metrics, and proactively identify and report incidents before they escalate.
- Support for New Initiatives: Lead the support for new operations projects, including system integrations, acceptance testing, performance testing, and performance management.
- Technical Incident Resolution: Collaborate with internal and external stakeholders to troubleshoot and resolve technical incidents and service requests efficiently, ensuring compliance with SLA requirements.
- Onboarding and Data Support: Assist with onboarding new systems, applications, and stakeholders, focusing on data preparation, transfers, and provisioning to ensure a smooth process.
- Security and Platform Support: Provide timely support for application/platform security incidents by coordinating with internal teams and vendors to resolve issues swiftly.
- Asset Management: Track and maintain an accurate IT asset inventory to ensure all assets are accounted for and aligned with business requirements.
- Documentation and Compliance: Ensure all Standard Operating Procedures (SOPs) are documented, up-to-date, and in compliance with audit and regulatory requirements.
What I Am Looking For
- Educational Background: A Bachelor’s degree in Computer Science, Engineering, Information Technology, or a related discipline.
- Relevant Experience: At least 3 years of experience in AWS Cloud, IT operations, and vendor management.
- Cloud Expertise: Hands-on experience with cloud-based services such as AWS, Azure, or Government Commercial Cloud (GCC). AWS Certification is a strong advantage.
- Security Knowledge: Proficiency in implementing security and access control measures, especially in controlling privileged access to test and production environments.
- Networking Proficiency: Familiarity with networking technologies, including WAN, LAN, firewalls, load balancers, VPNs, and DNS. Ability to troubleshoot connectivity and data transfer issues.
- Incident Management Skills: Proven experience in incident, problem, and change management processes. Ability to quickly diagnose issues and ensure timely resolutions.
- Data Analytics Systems: Experience working with data analytics systems is highly desirable and will be a major plus.
- Disaster Recovery Knowledge: Strong understanding of disaster recovery, system backups, and restore procedures.
- Technical Skills: Programming or scripting experience, with proficiency in Python, R, or Shell scripting preferred.
- Certifications: AWS SysOps Administrator certification is a plus but not mandatory.
- Soft Skills: A proactive, dedicated individual who excels at multitasking, working collaboratively with cross-functional teams, and is committed to driving operational excellence.
Click on Apply now to find out more about this opportunity and other available positions.
EA License: 22C1396
EA Personnel: R1551466
Similar Jobs