Epicareer Might not Working Properly
Learn More

Senior SRE/高级站点可靠性工程师 (Kubernetes)

Salary undisclosed

Apply on


Original
Simplified
My client is a forward-thinking technology company that supports a portfolio of high-performance, globally recognized platforms. As a Senior Site Reliability Engineer, your expertise in building and managing Kubernetes clusters will be crucial to ensuring that the systems are resilient, scalable, and secure.

  • Up to $11,000 X 12 months (can stretch a little) + RSUs
  • If you are overseas, this role is open to you relocating to Singapore

Job Responsibilities

Your role involves:

  • Dive deep into the architecture and mechanics of our distributed applications, ensuring product scalability, stability, and performance.
  • Build and manage Kubernetes clusters to orchestrate containerized applications across multiple nodes, ensuring seamless deployment and scaling.
  • Manage and maintain middleware, big-data applications, and services critical to the infrastructure’s operation.
  • Perform regular and ad-hoc deployments, server performance fine-tuning, and troubleshooting.
  • Design and implement automation workflows to enhance operational efficiency.
  • Oversee capacity and resource management to ensure optimal system performance.
  • Conduct full-chain stress testing to identify and eliminate application redundancies.

Job Requirements

As a successful candidate, you should have:

  • At least 4 years of experience in Site Reliability Engineering or equivalent roles.
  • Proven experience in building and managing Kubernetes clusters.
  • Ideally come from Technology firms or Technology consulting firms.
  • Experience in Linux operating systems (Ubuntu, CentOS, etc.).
  • In-depth knowledge of computer networks (TCP/IP, DNS, etc.) and operating systems.
  • Familiarity with automation tools (e.g., Ansible, Jenkins) and monitoring tools (e.g., Prometheus, Grafana) is highly desirable.

Why Join Them

  • Work on innovative projects that have a direct impact on a global scale.
  • Join a newly established SRE team with opportunities for leadership and influence.
  • Collaborate with top-tier engineers in a challenging and rewarding environment.
  • Competitive compensation package with relocation support for international candidates.
  • 可达 $11,000 X 12 个月(可以适当增加)+ RSUs (限制性股票单位)
  • 如果您在海外,这个职位开放给您搬迁到新加坡

我的客户是一家前瞻性的科技公司,支持一系列高性能、全球公认的平台。作为高级站点可靠性工程师,您在构建和管理Kubernetes集群方面的专业知识对于确保系统的弹性、可扩展性和安全性至关重要。

职位职责:

您的职责包括:

  • 深入了解我们分布式应用程序的架构和机制,确保产品的可扩展性、稳定性和性能。
  • 构建和管理Kubernetes集群,以编排跨多个节点的容器化应用程序,确保无缝部署和扩展。
  • 管理和维护中间件、大数据应用程序以及对基础设施运行至关重要的服务。
  • 执行定期和临时的部署、服务器性能调优和故障排除。
  • 设计并实施自动化工作流程,以提高运营效率。
  • 监督容量和资源管理,以确保系统性能的最优化。
  • 进行全链条压力测试,识别并消除应用程序冗余。

职位要求:

作为成功的候选人,您应具备:

  • 至少4年的站点可靠性工程或同等职位的经验。
  • 在构建和管理Kubernetes集群方面的实际经验。
  • 理想情况下,来自科技公司或科技咨询公司。
  • 有Linux操作系统(如Ubuntu、CentOS等)的使用经验。
  • 对计算机网络(如TCP/IP、DNS等)和操作系统有深入的了解。
  • 熟悉自动化工具(如Ansible、Jenkins)和监控工具(如Prometheus、Grafana)是非常理想的。

为什么加入他们:

  • 参与具有全球影响力的创新项目。
  • 加入一个新成立的SRE团队,拥有领导和影响的机会。
  • 与顶级工程师合作,在充满挑战和回报的环境中工作。
  • 提供具有竞争力的薪酬待遇,并为国际候选人提供搬迁支持

Do note that we will only be in touch if your application is shortlisted.

Robert Walters (Singapore) Pte Ltd

ROC No.: 199706961E | EA Licence No.: 03C5451

EA Registration No.: R1766249 Josh Lim