Site Reliability Engineer

  • Dublin, Ireland
  • Full-time

Company Description

Job Description

We have very exciting opportunities for people to shape and influence the new Watson Health division of IBM. We're bringing together the best of IBM's capabilities to re-imagine healthcare and wellness on a global scale. Alongside an Ecosystem of partners and clients we are developing solutions that fundamentally change the identification, treatment and prevention of illness. Our work will improve the health and well-being of people across the globe.
Site Reliability Engineers are hybrid systems and software engineers who are responsible and take ownership for scaling and automation of production systems. We need you for a green field project to deliver health care products into the Cloud. We need you to design, develop and deliver our new Cloud environments; development, test, production, sandbox, partners, etc.
You will have and full system view, you will be collaborating with stakeholders, you will be creating scripts, you will be triaging and troubleshooting issues. If you enjoy a varied role, have a natural interest in technology and Development Operations this role is for you.

• Participate in technical proof of concepts from conception to delivery.
• Be hungry to get involved in every part of our system — from the earliest stage of product architecture, design and development to deployment, troubleshooting, and performance analysis – to ensure a reliable quality product in production.
• Be able to collaborate and communicate clearly on status and progress.
• Design and build tools to manage a rapidly growing number of servers and services
• Do what must be done in order to keep critical systems operating
• Perform general OS, Web/Application server, database configuration, installs, automation,
• Participate in periodic on-call rotation
• Have Scripting experience in 1 or more of the following languages: Python, Ruby, Perl.
• In-depth understanding of web application models and key components, including the HTTP.
• Experience of OpenStack Heat, Urban Code Deploy, Chef, Jenkins, Java.
• Experience in a similar role or project.
• Proficiency in any of the following DevOps areas: ELK, Spluk, Collect D, Graphite.
• Proven and relatable troubleshooting and triage skills across systems.



See job description

Additional Information