New
Fleet Debugger
![]() | |
![]() United States, Washington, Redmond | |
![]() | |
OverviewMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for a Fleet Debugger to help achieve that mission. As Microsoft's cloud business continues to grow, the ability to deploy new offerings and hardware infrastructure on time, at scale, with quality and cost efficiency is critical. To support this, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team defines and delivers operational success metrics for hardware manufacturing-enhancing planning, quality, delivery, scalability, and sustainability across Microsoft cloud hardware.We are hiring a Fleet Debugger who brings customer-focused thinking, technical insight, and industry knowledge to envision and implement future solutions that manage and optimize cloud infrastructure.
ResponsibilitiesExecute system level end to end debug solutions for at scale datacenter systems. Lead collaboration projects with hardware, firmware and software teams that drive root cause analysis. Accountable for successful execution of targeted system level root cause analysis and defect reduction projects. Contribute technical recommendations on diagnostics or debug deployment technologies. Facilitate the resolution of complex technical and business challenges, drawing on team expertise and shared understanding.Develop scalable debug methodologies, test strategies, and routines for datacenter environments.Address issues related to mission-critical services and implement automation to improve debug processes.Communicate effectively with partners and stakeholders, using data to support planning and progress updates. |