Epicareer Might not Working Properly
Learn More

Cloud Operations Engineer (AWS)

Salary undisclosed

Apply on


Original
Simplified

Primary Responsibilities

  • System Operations and Performance: Oversee daily operational tasks to ensure optimal system performance, uptime, and reliability. Develop and implement strategies to maintain business continuity and disaster recovery plans.
  • System Health Monitoring: Perform daily system health checks, monitor system performance metrics, and proactively identify and report incidents before they escalate.
  • Support for New Initiatives: Lead the support for new operations projects, including system integrations, acceptance testing, performance testing, and performance management.
  • Technical Incident Resolution: Collaborate with internal and external stakeholders to troubleshoot and resolve technical incidents and service requests efficiently, ensuring compliance with SLA requirements.
  • Onboarding and Data Support: Assist with onboarding new systems, applications, and stakeholders, focusing on data preparation, transfers, and provisioning to ensure a smooth process.
  • Security and Platform Support: Provide timely support for application/platform security incidents by coordinating with internal teams and vendors to resolve issues swiftly.
  • Asset Management: Track and maintain an accurate IT asset inventory to ensure all assets are accounted for and aligned with business requirements.
  • Documentation and Compliance: Ensure all Standard Operating Procedures (SOPs) are documented, up-to-date, and in compliance with audit and regulatory requirements.

What I Am Looking For

  • Educational Background: A Bachelor’s degree in Computer Science, Engineering, Information Technology, or a related discipline.
  • Relevant Experience: At least 3 years of experience in AWS Cloud, IT operations, and vendor management.
  • Cloud Expertise: Hands-on experience with cloud-based services such as AWS, Azure, or Government Commercial Cloud (GCC). AWS Certification is a strong advantage.
  • Security Knowledge: Proficiency in implementing security and access control measures, especially in controlling privileged access to test and production environments.
  • Networking Proficiency: Familiarity with networking technologies, including WAN, LAN, firewalls, load balancers, VPNs, and DNS. Ability to troubleshoot connectivity and data transfer issues.
  • Incident Management Skills: Proven experience in incident, problem, and change management processes. Ability to quickly diagnose issues and ensure timely resolutions.
  • Data Analytics Systems: Experience working with data analytics systems is highly desirable and will be a major plus.
  • Disaster Recovery Knowledge: Strong understanding of disaster recovery, system backups, and restore procedures.
  • Technical Skills: Programming or scripting experience, with proficiency in Python, R, or Shell scripting preferred.
  • Certifications: AWS SysOps Administrator certification is a plus but not mandatory.
  • Soft Skills: A proactive, dedicated individual who excels at multitasking, working collaboratively with cross-functional teams, and is committed to driving operational excellence.

Click on Apply now to find out more about this opportunity and other available positions.

EA License: 22C1396

EA Personnel: R1551466