We are the Azure Reliability team. We are a multidisciplinary engineering organization tasked with leading reliability holistically across the Azure platform - our goal is to Make Azure the World's Safest and Most Reliable Cloud.
For the most important Azure services and products, Azure Reliability adopts the Site Reliability Engineering (SRE) approach, where skilled teams of software engineers collaborate closely with product development teams to improve the availability, reliability, observability, and operability of our planet-scale distributed systems.
Azure SRE teams strive to improve reliability fundamentals via software engineering, preferring long-lasting platform improvements delivered as engineering projects over repetitive manual operations. We contribute to the product fundamentals and architecture, share knowledge, and code, and prefer reuse over re-invention, always looking for ways to make what we build useful to multiple teams and products.
We know that the SRE discipline is evolving; we learn from our peers in industry and aim to contribute to this evolution by innovating on SRE within our group and sharing those innovations in public.
Our people have a wide variety of professional experiences, and we are interested to meet both candidates with traditional engineering backgrounds and those without. Some of us are industry veterans, while others joined quite recently. Together we form a varied and talented team, and we want to continue building our diversity with our new hires. We strongly believe that diversity and an environment where everyone can feel safe to contribute their own insights is the key to making the best workplace possible. We know that the best workplace makes the best products and services: not only is it the smart thing to do, but it is also the right thing.
We are not looking for people who know it all, we are looking for people who want to learn it all.
We value the input of people who aren't afraid to learn all the time and embrace mistakes as they continuously improve both our services and themselves. If you are excited by this type of challenge and you love to work in groups of people who are similarly excited: come join us!
Billions of users across the world rely on our products, and to meet this demand we design and implement world-class distributed systems.
As a Software Engineer in one of our Azure SRE teams, you will be responsible for improving the reliability of key Azure products.
The Azure SRE key focus areas are:
* Defining our systems' reliability goals via Service Level Objectives (SLOs).
* Improving our systems' production posture via targeted observability and operability enhancements (telemetry, alerting, incident management, change management, safe production changes).
* Building reusable automation to empower multiple teams to achieve their reliability goals.
* Influencing the product architecture and roadmap to make sure the customer-experienced reliability is always a key consideration when evolving the product.
We are looking for engineers passionate about the above areas who are also interested in:
* Providing technical leadership for engineers across multiple teams within Azure.
* Mentoring engineers on SRE principles, practices, and tools.
* 7+ years of software development experience in online services.
* 7+ years of experience using programming languages such as C, Java, Python, Go, PowerShell or Bash.
* Experience working with large-scale distributed systems (e.g., cloud computing providers, SaaS services, etc., ideally with millions or billions of users) or similarly complex environments.
* Experience starting and coordinating complex projects across multiple teams.
* Experience setting technical direction for engineering teams.
Preferred Qualifications
* Sc., M.Sc. or Ph.D. in Computer Engineering, Computer Science or related fields.
* Experience working on large and unfamiliar codebases (millions of lines of code)
* Experience as a technical lead or engineering manager.
* .NET experience (only for some teams).
* Awareness of, and ability to reason about, modern distributed software design patterns and cloud systems architecture, including microservices, containers, load-balancing, queuing, caching.
AZCXP AzRelJobs
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Redmond, WA
Microsoft Corporation develops, licenses, and supports software, services, devices, and solutions worldwide. Its company’s Productivity and Business Processes segment offers Office 365 commercial products and services, such as Office, Exchange, SharePoint, Skype for Business, Microsoft Teams, and related Client Access Licenses (CALs); Office 365 consumer services, including Skype, Outlook.com, and OneDrive; LinkedIn online professional network; and Dynamics business solutions comprising financial management, enterprise resource planning, customer relationship management, supply chain management, and analytics applications for small and medium businesses, large organizations, and divisions of enterprises.
The company’s Intelligent Cloud segment licenses server products and cloud services, such as SQL Server, Windows Server, Visual Studio, System Center, and related CALs, as well as Azure, a cloud platform; and enterprise services, including premier support and Microsoft consulting services to assist customers in developing, deploying, and managing Microsoft server and desktop solutions, as well as provides training and certification to developers and IT professionals.
Its More Personal Computing segment offers Windows OEM, volume, and other non-volume licensing of the Windows operating system; patent licensing, Windows Internet of Things, and MSN display advertising; Surface, PC accessories, and other devices; Xbox hardware and software and services; and Bing and Bing Ads search advertising. It markets its products through original equipment manufacturers, distributors, and resellers; and online and Microsoft retail stores.
Microsoft Corporation has collaboration with E.ON, NIIT Technologies Ltd., CUNA Mutual Group, and Mastercard Incorporated; strategic alliance with Nielsen Holdings plc and PAREXEL International Corp.; and a strategic partnership with SK Telecom Co., Ltd. The company was founded in 1975 and is headquartered in Redmond, Washington.