Principal Artificial Intelligence (AI) Software Engineer
Redmond, Washington
Employer: Microsoft
Industry: Software Engineering
Salary: $133600 per year
Job type: Full-Time
Microsoft Azure Artificial Intelligence High Performance Computing team is looking for a systems engineers to enable customers in deploying, monitoring, profiling, and debugging their applications on hyperscale cloud infrastructure. Azure is enabling the largest supercomputing deployments to tackle complex computational problems in public cloud, evident from the various High Performance Computing (HPC) Stock Keeping Units (SKU) that have already made the mark on Top500, MLPerf and Graph500 rankings.
We are looking for a Principal Artificial Intelligence (AI) Software Engineer who would also bring to the table establishing best practices drive architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact business goals of a wide range of users and facilitate the next wave of growth and innovation in Artificial Intelligence, and High Performance Computing (HPC) in the cloud in general. We are looking for a Principal Artificial Intelligence Software Engineer who is committed to quality, wants the customer to succeed and get things done. You will join a phenomenal team of dedicated engineers and researchers with deep experience in high performance computing, machine learning, deep learning, middleware, and software engineering.
At this supercomputing scale, we need specialized tools and techniques to maintain the reliability, runtime performance, health of the system and running jobs continuing to meet the Service Level Agreements of users. Your job would be to use the state-of-the-art tools and techniques, find operational gaps and instrument features to achieve the smooth operation of cloud-native supercomputers.
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities:
Qualifications:
Required Qualifications:
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#azurecorejobs
We are looking for a Principal Artificial Intelligence (AI) Software Engineer who would also bring to the table establishing best practices drive architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact business goals of a wide range of users and facilitate the next wave of growth and innovation in Artificial Intelligence, and High Performance Computing (HPC) in the cloud in general. We are looking for a Principal Artificial Intelligence Software Engineer who is committed to quality, wants the customer to succeed and get things done. You will join a phenomenal team of dedicated engineers and researchers with deep experience in high performance computing, machine learning, deep learning, middleware, and software engineering.
At this supercomputing scale, we need specialized tools and techniques to maintain the reliability, runtime performance, health of the system and running jobs continuing to meet the Service Level Agreements of users. Your job would be to use the state-of-the-art tools and techniques, find operational gaps and instrument features to achieve the smooth operation of cloud-native supercomputers.
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities:
- Design and code solutions that improve the management of remote systems.
- Leads by example within the team by producing extensible and maintainable.
- Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well as appropriate coding patterns and best practices.
- Holds accountability as a Designated Responsible Individual (DRI) and mentors other engineers across products/solutions, working on call to monitor system/product/service for degradation, downtime, or interruptions.
- Alerts stakeholders as to status and initiates actions to restore system/product/service for complex issues.
- Develops a playbook for the team to resolve issues.
- Coordinates people and resources to ensure DRI responsibilities are covered across teams.
- Keep infrastructure services running and deliver code updates on a regular cadence to improve performance and reliability.
- Maintains communication with key partners across the Microsoft ecosystem of engineers.
- Acts as a key contact for leadership to ensure alignment with partners' expectations. Considers partner teams across own organization and their end goals for products to drive and achieve desirable user experiences and fitting dynamic needs of partners/customers through product development.
- Dedicated to the mission to help ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads.
- Help build a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality.
Qualifications:
Required Qualifications:
- Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python
- OR equivalent experience.
- 5+ years of experience in Developing and Running Artificial Intelligence (AI) or High Performance Computing (HPC) applications on clusters or related
- 5+ years of experience with AI software
- 2+ years of experience in tuning performance of AI or HPC applications
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Bachelor's Degree in Computer Science or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python
- OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python
- OR equivalent experience.
- Hands-on knowledge in Compute Unified Device Architecture (CUDA), C++ Heterogenous-Compute Interface for Portability) HIP, or related parallel programming domains
- Exposure to operational challenges of running Artificial Intelligence/High Performance Computing (HPC) (Availability, Fault Tolerance) and Mitigation Mechanisms
- Experience with running and troubleshooting Artificial Intelligence/Machine Learning workloads on clusters
- Exposure to Cloud Computing, Virtualization and Container Technologies
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#azurecorejobs
Created: 2024-08-22
Reference: 1687045
Country: United States
State: Washington
City: Redmond
Similar jobs:
-
Senior Software Engineer - CTJ - Poly
Microsoft in Redmond, Washington💸 $117200 per year -
Software Development Engineer III, AFT Inbound, Inbound
Amazon in Bellevue, Washington💸 $151300 per year -
Software Development Engineer, AWS Managed Services (AMS)
Amazon in Seattle, Washington💸 $129300 per year -
Software Engineer II
Microsoft in Redmond, Washington💸 $193200 per year -
Software Dev Engineer, Applied AI
Amazon in Bellevue, Washington💸 $129300 per year -
Software Engineer 2 - M365 Core
Microsoft in Redmond, Washington💸 $193200 per year -
Software Engineer II
Microsoft in Redmond, Washington💸 $193200 per year -
Software Development Engineer, AGI Data Services
Amazon in Bellevue, Washington💸 $129300 per year -
Senior Software Engineer
Microsoft in Redmond, Washington💸 $117200 per year -
Senior Software Engineer
Microsoft in Redmond, Washington💸 $117200 per year -
Software Dev Engineer, L5
Amazon in Seattle, Washington💸 $129300 per year -
Software Development Engineer, WorkSpaces Clients
Amazon in Bellevue, Washington💸 $129300 per year -
Principal, Software Engineering
Slalom Consulting in Seattle, Washington -
Senior Software Engineer
Microsoft in Redmond, Washington💸 $117200 per year -
Software Dev Engineer, Amazon Seller Fulfillment Tech
Amazon in Seattle, Washington💸 $129300 per year -
Principal Software Engineer
Microsoft in Redmond, Washington💸 $137600 per year -
Software Development Engineer , AWS, Training and Certifications
Amazon in Seattle, Washington💸 $129300 per year -
Software Development Engineer, Fintech
Amazon in Bellevue, Washington💸 $129300 per year -
Software Dev Engineer II, Advertising Data Management
Amazon in Seattle, Washington💸 $129300 per year -
Software Engineer II, Privacy and Identity Experiences
Amazon in Seattle, Washington💸 $129300 per year