IBM

Site Reliability and Automation Engineer

Posted on: 14 Apr 2021

Austin, TX

Job Description

Introduction
Software Developers at IBM are the backbone of our strategic initiatives to design, code, test, and provide industry-leading solutions that make the world run today planes and trains take off on time, bank transactions complete in the blink of an eye and the world remains safe because of the work our software developers do. Whether you are working on projects internally or for a client, software development is critical to the success of IBM and our clients worldwide. At IBM, you will use the latest software development tools, techniques and approaches and work with leading minds in the industry to build solutions you can be proud of.

Your Role and Responsibilities

Are you passionate about technology? Do you love building new things? Do you want to develop the future of IBMs Cloud offerings? If you answered YES, then we have the right opportunity for you!

The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.

We are looking for a dynamic, Site Reliability and Automation Engineer to join our Cloud Operations Team, who is responsive to market needs, to deliver value to our clients in a fast-changing cloud landscape. The Cloud team is dedicated to ensuring the IBM Cloud is at the forefront of cloud technology, from data center design to network architecture to storage and compute clusters to flexible infrastructure services. We are building and operating IBMs next generation cloud platform to deliver performance and predictability for our customers most demanding workloads, at global scale and with leadership efficiency, resiliency and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.

In this Site Reliability and Automation Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure. Your focus will be the following key responsibilities:
* Automate health monitoring of the production and test systems

* Automate return to service procedures for Cloud Platform Components

* Support the compliance and security integrity of the environment through your work

* Partner with other teams, functional managers and program managers to deliver mission-critical services to the market

* Support development of new and existing capabilities for our compute, storage and network services

* Integrate automation with operational requirements

* Work with Engineering to:

* Define operational requirements

* Automate operational requirements

* Participate in the full deployment pipeline

* Work with Support and Development to:

* Identify and resolve issues

* Discuss and plan integration requirements

Shift 8am to 4pm

Required Technical and Professional Expertise

* Minimum of 5 years experience in hands-on production administration of large system environments, including virtual platforms.

* 5+ years of experience in data center infrastructure or relevant work experience, large-scale infrastructure design, engineering, and support, IT Change, Incident, Problem, Asset management, infrastructure engineering with proven record for delivering high-quality, large-scale solutions. Experience designing architectures for scale and performance

* Must be efficient in writing, debugging and maintaining scripts (Bash and Python)

* Ability to do low level debugging and problem analysis by examining logs and running Unix commands

* 2-3 years of extensive experience with open-source products

* 3-5 years of experience with configuration management systems (Ansible / Chef)

* Hands on knowledge of using Splunk or ELK

* Working knowledge with Network and Storage technologies

* Working knowledge with ServiceNow, JIRA, Confluence, and GitHub

Preferred Technical and Professional Expertise

* 2+ years of experience with Kubernetes

* 4+ years of experience with GitHub, Perl and Python

* 5+ years of experience with configuration management systems (SaltStack/Ansible/Chef)

* 8+ years of experience in virtualization environments such as AWS /Softlayer/Zen/VMWARE

None

IBM

Armonk, New York

International Business Machines Corporation operates as an integrated technology and services company worldwide. Its Cognitive Solutions segment offers a portfolio of enterprise artificial intelligence platforms, such as analytics and data management platforms, cloud data services, talent management, and industry solutions primarily under the Watson Platform, Watson Health, and Watson Internet of Things names. This segment also offers transaction processing software for use in banking, airlines, and retail industries.

The company’s Global Business Services segment offers business consulting services; delivers system integration, application management, maintenance, and support services for packaged software applications; and finance, procurement, talent and engagement, and industry-specific business process outsourcing services. Its Technology Services & Cloud Platforms segment provides project, managed, outsourcing, and cloud-delivered services for enterprise IT infrastructure environments; technical support, and software and solution support services; and integration software solutions. The company’s Systems segment offers servers for businesses, cloud service providers, and scientific computing organizations; data storage products and solutions; and z/OS, an enterprise operating system.

Its Global Financing segment provides lease, installment payment plans, and loan financing services; short-term working capital financing to suppliers, distributors, and resellers; and remanufacturing and remarketing services. International Business Machines Corporation serves financial services institutions, airlines, manufacturers, and consumer goods and retail companies. The company was formerly known as Computing-Tabulating-Recording Co. and changed its name to International Business Machines Corporation in 1924. The company was incorporated in 1911 and is headquartered in Armonk, New York.

  • Industry
    Information Technology
  • No. of Employees
    350,600
  • Jobs Posted
    4684

Similar Jobs