Job Listings

11,836 jobs

Why AI Match requires a free account

AI Match goes far beyond keyword search — it reads your resume, learns your preferences, and ranks every job by how well it fits you. That requires a personal profile we can reference on every search.

Resume-based ranking

Paste your resume once and every listing is scored against your actual experience and skills — not just a keyword.

Salary & location filters

Set hard cutoffs for minimum pay and preferred location so only genuinely relevant roles surface in your feed.

Boost & block keywords

Promote jobs that mention your niche technologies and hide anything containing terms you want to avoid.

Role Details

Overview

The Microsoft Azure Host Networking and Hardware Acceleration team is responsible for building a high-performance and reliable network in the cloud. Our organization develops and maintains the network layers between physical infrastructure and virtual machines. Current and future development focuses on Linux platforms and hardware design. We are seeking a Senior Software Engineer to design and develop innovative hyperscale observability tools and platforms that help diagnose complex software-defined networking issues.In this role, you will build out a globally distributed OpenTelemetry data infrastructure that will process >= 1 billion events per minute for all Azure regions. This data pipeline will be used to all for near real-time attribution of packet loss in the Azure backend networks. Expertise should Include: understanding the fundamentals of networking and how packets traverse networks and knowledge in a distributed systems platform such as Kubernetes or Service Fabric. You will also build advanced tools for network debugging, mentor team members, interact with customers, and collaborate across organizations.This position requires curiosity about system internals, effective communication skills, and experience with low-level networking technologies. This opportunity provides exposure to large-scale distributed systems and cutting-edge networking solutions, enabling you to make a significant impact on Azure’s reliability and performance while advancing your technical knowledge. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond!

Responsibilities

  • Partners with appropriate stakeholders to determine user requirements for observability improvements.
  • Contribute code to existing automation and monitoring frameworks.
  • Build and deploy net new tools across production datacenters.
  • Take ownership and drive mission critical customer escalations.
  • Mentor and teach engineers across Azure to improve visibility, use of tools to diagnose, and scale learnings through improved documentation and training.
  • Embody our culture and values

Qualifications

Required Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Rust, GO, TypeScript
  • OR equivalent experience.
  • 1+ years experience working with systems observability.

Other Requirements

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Rust, GO, TypeScript
  • OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Rust, GO, TypeScript
  • OR equivalent experience.
  • 1+ years experience with systems programming, distributed system, CI/CD, data pipelining, shipping products or services.
  • 1+ years experience working with multiple partner teams and external vendors.

#azurecorejobs

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Senior Data Center Technician

Microsoft

Palmetto, Ga,Us, USA today
Role Details

Overview

As a Microsoft Senior Data Center Technician (SDCT) (Locations: Palmetto & Lithia Springs, GA), you will demonstrate expertise in standard processes and procedures for preparing, installing, performing diagnostics, troubleshooting, replacing, and/or decommissioning equipment, as well as a holistic understanding of the functions of, and interactions between, network and server equipment. You will provide input for suggested modifications to these processes and procedures as needed to improve service quality and efficiency while providing guidance to other technicians.

Microsoft’s Cloud Operations & Innovation (CO+I) is the engine that powers our cloud services. As a CO+I DCT, you will perform a key role in delivering the core infrastructure and foundational technologies for Microsoft's online services including Bing, Office 365, Xbox, OneDrive, and the Microsoft Azure platform. As a group, CO+I is focused on the personal and professional development for all employees and offers trainings and growth opportunities including Career Rotation Programs, Diversity & Inclusion trainings and events, and professional certifications.

Our infrastructure is comprised of a large global portfolio of more than 200 Data Centers in 32 countries and millions of servers. Our foundation is built upon and managed by a team of subject matter experts working to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide.

With environmental sustainability and optimization at the forefront of our data center design and operations, we continue to grow and evolve as we meet the ever-changing business demands that hold Microsoft as a world-class cloud provider.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

In this environment, you may be asked to go on business travel to support other metros periodically, 0-25% of the time and have the ability to work 12-hour shifts, including shift assignments during non-standard business hours that may include evening, nighttime, weekends and/or holidays.

Responsibilities

Datacenter Operations

  • Demonstrates expertise in and participates in providing guidance on standard processes and procedures for preparing, installing, performing diagnostics, troubleshooting, replacing, and/or decommissioning IT datacenter technology(ies) and equipment while proactively prioritizing security considerations. Maintains expertise with the functions of, and participates in providing guidance on the interactions between cabling infrastructure, network, server, and storage equipment.

  • Provides guidance to other technicians on changes to processes and procedures. Reviews Process Change Notifications (PCNs) and proactively shares relevant information to enable efficient workflow and evaluate impact on work execution to identify and mitigate impact changes to their area of work will have on others. Asks questions when they do not have required information, resolves others' questions in a timely manner, and may provide feedback on changes to processes to direct-line management. Coaches others on incorporating security requirements in feedback and process changes, ensuring compliance with industry standards and best practices.

  • Completes assigned tickets efficiently and in alignment with relevant Key Performance Indicators (KPIs) per task type independently. Provides guidance and/or assists less experienced technicians with complex tickets or tasks, and appropriately escalates challenging or complex tickets to internal business partners as needed. May coordinate and assign tasks to other technicians on temporary basis, (i.e. in absence of direct-line manager availability and/or ticket assignment automation) providing direction as needed, to ensure work is appropriately allocated to meet Key Performance Indicators (KPIs) per task type.

  • Complies with Data Center business unit policies, procedures, and deadlines with guidance from experienced technicians and/or direct-line management, maintaining audit readiness. Escalates issues related to compliance or operational assurance activities to direct-line management. Identifies potential vulnerabilities in operational security, escalates promptly, and implements best practices.

Datacenter Work Environment

  • Reinforces a positive and effective team environment by sharing information and best practices with other shifts/technician teams, assisting as needed in cross-discipline collaborations, staying apprised of the status of others' work to respond to questions regarding work-window adjustments, internal business partner inquires, and other team members, while partnering with other shifts of technician teams to complete smooth transition and effective handover of ticketed work. Provides, responds to, and encourages providing feedback regarding on ways to work more effectively or enhance efficiency within their team and adopts best practices shared within-and-across shifts or technician teams. Proactively shares security-related learnings, resources, and best practices across teams.

  • May conduct, assists in conducting, and/or participates in daily safety briefings. Completes required Environmental & Health Safety (EHS) training, provides other technicians with guidance to comply with safety procedures (e.g., equipment use, lifting, electrical hazards, ladder/rolling stair use), completes required Task Hazard Analysis (THAs), and uses appropriate equipment and Personal Protective Equipment (PPE) for assigned tasks. Adheres to and promotes a culture of safety, taking proactive action to alert others of safety concerns, near-misses, and/or incidents. Participates in the regular cadence of proactive safety observation reporting processes and systems

  • Completes required security and data management training while complying with security and data management procedures/policies with guidance from direct-line management. May escort third party vendors or IT support on-premises at data centers or network sites, enforcing adherence to security requirements for third-party access and operations. Appropriately takes action and reports physical security access concerns and/or incidents to direct-line management or via established reporting methods.

  • As indicated above, this role has a travel requirement of up to 25% which means you may be required to travel, from time to time, as part of this role.

  • As indicated above, have the ability to work 12-hour shifts, including shift assignments during non-standard business hours that may include evening, nighttime, weekends and/or holidays.

Managing Service

  • Completes required training aligned to job focus areas and workloads (i.e., Break fix, Deployment, Simple Change, Decommission, IT Critical Environment) in a timely manner per direct-line management assignment(s). Actively mentors and supports training of other technicians through on-the-job training (OJT) and by providing direct guidance on specific job focus areas and workloads (i.e., Break fix, Deployment, Simple Change, Decommission, IT Critical Environment). May complete additional or supplemental training to obtain or maintain relevant industry or technical certifications.

  • Maintains an awareness of Key Performance Indicators (KPIs) through reporting dashboards, systems, for personal monthly performance discussions as needed.

  • Escalates and/or seeks guidance from experienced technicians or direct-line management regarding client interactions.

  • Develops a comprehensive understanding of cross-functional Data Center processes to support partnerships with internal and external stakeholders.

Physical Requirements

  • Applies to but is not limited to US-based Data Center roles: Occasional climbing of ladders. Frequent climbing of stairs and/or ramps. Prolonged standing. Occasional lifting 50lbs / 22.5kg. Occasional push or pull 50-75 lbs / 22.5-34kg. with assistive device. Normal visual acuity (near, far and peripheral with correction), defined via standard medical terms and applicable criteria. Normal color vision for electrical work, defined via standard medical terms and applicable criteria.

Service Delivery

  • Provides guidance to less experienced technicians for, and develops/employs own effective execution order strategy(ies) for assigned tasks. Prepares, stages, sets up, and performs basic startups and shutdowns for hardware (e.g., racks, hard drives, switches) according to specific written instructions provided via checklists, guides, standard processes, emails, while providing guidance to less experienced technicians, with direction from management where applicable. May perform tasks in tandem with other technicians to comply with procedures and safety requirements. Ensures all tasks are executed and aligned with security principles, configurations, and controls.

  • Follows procedures to immediately communicate, report, and escalate data center technical, safety, or security related incidents to direct-line management. Participates in bridge calls to provide details on incident status and executes on-site follow-up actions as directed if necessary. May leverage learnings to contribute to the improvement of quality of service and support.

  • Applies advanced diagnostics and troubleshooting expertise and/or leverages standard procedures to quickly and efficiently identify the cause(s) of technical issues and replace faulty components in network, storage, or server equipment with zero-to-minimal customer and/or business disruption. Maintains advanced awareness of conditions, circumstances, and scenarios which may reflect significant custo

Role Details

Overview

The Xbox Video team is looking for a Senior Software Engineer who is passionate about building high-quality video experiences on cutting-edge hardware and software platforms. Our team is responsible for delivering the video technologies that power Xbox experiences used by missions around the world.

In this role, you'll work across the Windows platform and Xbox video stack, helping build and optimize the APIs and system-level components that interact directly with advanced graphics and video hardware. You'll contribute to technologies such as video encoding and decoding, color processing, motion estimation, and rate control to ensure smooth, high-performance playback across devices.

You'll be part of a collaborative and experienced engineering team that works closely with internal and external partners to solve complex technical challenges across the Xbox ecosystem. While prior experience with media or video technologies is a plus, you'll be successful in this role if you enjoy working close to the system - across operating systems, APIs, and platform-level code - to deliver impactful user experiences.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

  • Collaborates with appropriate stakeholders to determine user requirements for a scenario.
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
  • Embody our culture and values.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C or C++,
  • OR equivalent experience.

Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 3+ years of experience contributing to operating systems, application programming interface (API), or device drivers for consumer PCs or electronics.
  • 2+ years of experience developing software for the Windows platform, including work at the platform, API, or media systems level.
  • Interest and experience in all things video and video hardware.
  • Proficiency in design, coding, debugging, and problem solving skills.

W+DJOBS

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified solutions. The Microsoft Security organization accelerates Microsoft’s mission and bold ambitions to ensure that our company and industry is securing digital technology platforms, devices, and clouds in our customers’ heterogeneous environments, as well as ensuring the security of our own internal estate. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world.

The Identity Security Breach Response Squad (IDSEC BRS) is where engineers and security researchers come to work on the most challenging identity security problems at Microsoft. The team operates at the front lines of incident response, partnering closely with investigators and engineers to understand how real attacks unfold and to turn those insights into scalable detection, investigation, and response capabilities. BRS is deeply hands on: team members work directly with large scale identity telemetry, trace complex attack paths, and help shape the tools and systems that responders rely on during high impact security events. If you’re motivated by solving ambiguous problems, learning from real adversary behavior, and seeing your work directly improve how a global platform defends itself, this team offers the opportunity to make a tangible difference at cloud scale.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

MSFTSecurity, #Cybersecurity #EntraID #IdentitySecurity #Identity&NetworkAccess

Responsibilities

  • Build and operate services that help security teams investigate and respond to sophisticated identity-related threats at global scale.
  • Create agentic systems that make security investigations faster, more reliable, and easier to execute under pressure.
  • Design systems that connect signals across large datasets to surface high-confidence findings and actionable remediation steps.
  • Apply AI thoughtfully to assist human decision-making in investigations (with clear safety boundaries, evaluation, and human-in-the-loop controls).
  • Lead architecture and technical direction across multiple workstreams; partner across engineering and security teams to deliver end-to-end outcomes.
  • Drive operational excellence for mission-critical services (reliability, observability, incident response, and safe rollouts).
  • Mentor engineers and model strong engineering fundamentals, security best practices, and inclusive collaboration.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft background and Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 5+ years of professional software engineering experience building and operating production services.
  • Background in distributed systems (reliability, performance, scalability, and operational excellence).
  • Experience designing secure systems (threat modeling, secure coding practices, and principled access control).
  • Demonstrated technical leadership: driving architecture decisions, aligning stakeholders, and delivering complex projects end-to-end.
  • Experience applying AI to improve workflows in high-consequence environments (with evaluation and safety boundaries).
  • Experience building security, investigation, eviction, detection, or forensics-adjacent tooling.
  • Experience building data-intensive platforms (pipelines, indexing/search, large-scale analytics, or workflow automation). -
  • Experience mentoring engineers and raising the quality bar through design/code reviews and operational practices.
  • Experience designing and shipping agentic AI systems with security-first guardrails.

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

CoreAI is at the forefront of Microsoft’s mission to redefine how software is built and experienced. We are responsible for building the foundational platforms, services, programming models, and developer experiences that power the next generation of applications using Generative AI. Our work enables developers and enterprises to harness the full potential of AI to create intelligent, adaptive, and transformative software.

The AI Core Infrastructure team, part of AI Platform team in CoreAI Organization is responsible for large-scale, highly reliable and efficient GPU management infrastructure and the inference and training platforms that power all of Microsoft’s AI workloads, such as M365 CoPilot, Github CoPilot, Microsoft CoPilot, AI Foundry’s Inference and Fine-Tuning offering of OAI and OSS models, and many more.

As a Principal Engineer on the team, you’ll shape the architecture and strategy on how customers monitor, troubleshoot, and scale their AI training workloads. You’ll work across ML infrastructure, distributed systems, and observability to power large-scale pre-training, post-training, and fine-tuning on some of the world’s largest AI supercomputers.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Responsibilities

As the Principal engineer on the team, your responsibilities include:

  • Set the roadmap and drive the execution of the training infrastructure built for AI workloads at a supercomputer scale.

  • Design, develop and ship the backend services that power the AI workloads.

  • Deliver deep insights that empower customers to troubleshoot and optimize their large-scale AI workloads

  • Collaborate closely with engineers, data scientists across Microsoft’s internal research teams building models to shape the infrastructure.

  • Leverage production telemetry to influence next-generation infrastructure design, boosting efficiency, reliability, and performance

  • Mentor and guide engineering teams, elevating technical excellence and championing a customer-focused approach to system design.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field and 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python or equivalent experience.

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications in one of those areas:

  • Excellent analytical and problem-solving skills, with the ability to extract customer pain points, synthesize ambiguous requirements, and design clear, scalable solutions.

  • Expertise with distributed observability technologies (e.g., Prometheus, OpenTelemetry, Grafana) and 2+ years of experience designing or scaling telemetry pipelines for high-throughput production systems.

  • Advanced, hands-on experience with production ML systems, large-scale training infrastructure, NCCL, CUDA libraries and tools.

  • 6+ years of experience building or operating distributed systems, with a strong focus on reliability, scalability, and performance.
  • Understanding of Docker, Kubernetes, scalable architectures, and automation for production systems.
  • Passionate and self-motivated. Strong ability in self-learning, entering new domain, managing through uncertainty in an innovative team environment.

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

Software Engineering IC6 - The typical base pay range for this role across the U.S. is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day and we need you as a Data Center Critical Environment Technician Manager.

Microsoft’s Cloud Operations & Innovation (CO+I) is the engine that powers our cloud services. As a CO+I CE Technician Manager, you will perform a key role in delivering the core infrastructure and foundational technologies for Microsoft's online services including Bing, Office 365, Xbox, OneDrive, and the Microsoft Azure platform. As a group, CO+I is focused on the personal and professional development for all employees and offers trainings and growth opportunities including Career Rotation Programs, Diversity & Inclusion trainings and events, and professional certifications.

Our infrastructure is comprised of a large global portfolio of more than 200 datacenters in 32 countries and millions of servers. Our foundation is built upon and managed by a team of subject matter experts working to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide.

With environmental sustainability and optimization at the forefront of our datacenter design and operations, we continue to grow and evolve as we meet the ever-changing business demands that hold Microsoft as a world-class cloud provider.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

Responsibilities:

People Management

  • Managers deliver success through empowerment and accountability by modeling, coaching, and caring.
  • Model - Live our culture; Embody our values; Practice our leadership principles.
  • Coach - Define team objectives and outcomes; Enable success across boundaries; Help the team adapt and learn.
  • Care - Attract and retain great people; Know each individual’s capabilities and aspirations; Invest in the growth of others.

Equipment and Systems Operations

  • Serve as an operations specialist one or more major area of operations (e.g., electrical, mechanical, controls, generators, and work on advanced tasks independently.
  • Oversee and coach team with the inspection of critical environment-related facility equipment (e.g., controls, heating, ventilation, and air conditioning [HVAC], mechanical systems), building, and grounds regularly for unsafe or abnormal conditions to develop and analyze trends.
  • Monitor performance of maintenance and operations utilizing telemetry, control systems, and other platforms and is able to identify all alarms.
  • Utilize internal computerized maintenance management system (CMMS) to track all equipment assets and to complete work order requests for maintenance work and generate reporting to identify outstanding and ongoing work orders.
  • Safely and quickly respond to and lead an onsite incident response team for all abnormal conditions that impact operations and coordinate with other critical facilities professionals to perform corrective repairs.
  • Enhances, develops new, or follows preexisting emergency operating procedures (EOPs), methods of procedure (MOPs), and standard operating procedures (SOPs) in relation to incidents.
  • Gathers necessary information and creates incident timelines/data, root-cause analyses, and/or action items following an abnormal condition.

Equipment and Systems Maintenance

  • Guide, oversee, and perform various types of maintenance (e.g., planned, predictive, corrective) and repairs following methods of procedure (MOPs), and standard operating procedures (SOPs) for one or more disciplines and one or more types of equipment (e.g., electrical, mechanical, cooling systems) and escalate when appropriate.
  • Serve as a subject matter expert for one type of equipment and oversee everyday tasks and troubleshooting within their area of expertise
  • Have a hands-on understanding of how equipment works within disciplines they have been trained and how to troubleshoot equipment, systems, subsystems, and components independently within their trained discipline(s).
  • Provide and/or assign team to provide necessary escort to third-party contractors, sub contractors, vendors, and service providers on site for all severity leveled procedures. Coordinate and schedule supplier/vendor on-site activities and recognizes circumstances when to stop supplier work to address potential and/or identified concerns.
  • Take part in getting third-party work underway (e.g., making sure systems are properly energized/deenergized), ensuring the work is started and completed in a safe manner in accordance with standard practices, procedures, federal/local legislation, and municipal codes.
  • Advises junior colleagues on inspection and supervision issues.
  • Provides consultation to lower-level colleagues in troubleshooting systems and problems

Critical Environment Culture

  • Understands, follows, ensures, and coaches team on safety and security requirements (e.g., job hazard assessments [JHAs], toolbox talks), and business processes and procedures to properly perform work in a safe, quality, and reliable manner in accordance with applicable federal, state, local, and Microsoft requirements.
  • Proactively ensures safety and security requirements are followed and met for the work of themselves and others.
  • Maintain safe working conditions and escalate immediately when unsafe working conditions are observed.
  • Assesses and identifies appropriate resources and equipment necessary to fully support environmental health and safety (EH&S) objectives.
  • Participates in required meetings, trainings, and necessary handoffs.

Other

  • Embody our culture and values

Qualifications

Required Qualifications:

  • High School Diploma, GED, or equivalent AND 3+ years mission critical services work/applied learning experience (e.g., high availability assembly/manufacturing/critical infrastructure environments such as data centers, oil and gas refineries, hospitals, pharmaceutical, manufacturing, or related fields)
  • OR equivalent experience.
  • Ability to work shifts, including shift assignments during non-standard business hours that may include evening, nighttime, weekends, and/or holidays

Background Check Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

While not required, we also look for the following Preferred Qualifications:

  • High School Diploma, GED, or equivalent AND 6+ years mission critical services experience (e.g., high-availability assembly/manufacturing/critical infrastructure environments such as data centers, oil and gas refineries, hospitals, pharmaceutical, manufacturing, or related fields)
  • OR Associate's Degree or technical trade certification (e.g., military, trade school), or higher-equivalent education AND 5+ years mission-critical services experience (e.g., high-availability assembly/manufacturing/critical infrastructure environments such as data centers, oil and gas refineries, hospitals, pharmaceutical, manufacturing, or related fields)
  • OR equivalent experience.
  • 1+ year(s) people management experience.
  • 1+ year(s) experience in a specialized area (e.g., mechanical field, electrical field, controls field) or related field.

Critical Environment Ops M3 - The typical base pay range for this role across the U.S. is USD $75,400 - $167,900 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $105,800 - $185,300 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

$100,600 - $199,000

Role Details

Overview

As a Research Engineer in the Power Platform Managed Transformation Team, you’ll play a pivotal role in delivering our mission: enabling organizations to adopt and scale AI agents with measurable business impact, unified governance, and actionable insights. You’ll collaborate with experts across governance, analytics, and platform services to drive innovation in Copilot and Copilot Studio scenarios, helping customers realize the full potential of AI-powered agents.

We are looking for an Research Engineer II to join our team! Join us in shaping the future of AI agents.

This opportunity will allow you to:

  • Accelerate your career growth in applied AI and agentic systems.
  • Develop deep business expertise in enterprise-scale AI adoption.
  • Hone your skills in designing, evaluating, and scaling intelligent agent experiences.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

The team consists of Applied Scientists and Engineers partnering regularly with Product Managers and Designers. Our team works in an agile start-up like environment where we expect each team member to think out of the box to contribute and collaborate towards the mission of the team.

As part of this team, you would get to work on novel experiences & cutting edge technologies across a variety of platforms to build the next generation application development experiences.

  • Conduct applied research to develop innovative AI and machine learning models that address real-world challenges.
  • Lead end-to-end lifecycle of machine learning models, from prototyping and implementation to evaluation, deployment, and monitoring.
  • Create and adapt novel training and fine-tuning algorithms for language models.
  • Bring research projects to successful completion yielding new algorithms, prototypes, theories, tools, methods, analyses, insights, or collections of data which solve one or more open research problems.
  • Operate and support 24X7 usage of AI Models by customers
  • Document and share best practices across the organization.
  • Supports mentorship by assisting with onboarding of research interns or other team members.
  • Stay up-to-date with the latest advancements in AI and machine learning, and apply new techniques to solve complex problems.
  • Embody our Culture and Values

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python  OR
  • Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR
  • Equivalent experience.
  • 1+ years' hands on experience with AI/LLM (Large Language Modeling)

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications: -

  • Experience working on successful applied research projects in industry environments .
  • Experience programming with Python, Typescript, C# (any) is preferred.
  • An applicant with the following skills or relevant experience would be a plus:    -
  • Applications of Foundation Models, Domain Adaptation of Foundation Models (e.g. fine-tuning LLMs/SLMs)
  • Vision, Audio, and Multimodal Foundation Models
  • Agentic Systems
  • Prompt Tuning
  • Natural Language Processing
  • Human-AI Interaction.

#BICJOBS

Applied Sciences IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers to levels they cannot achieve anywhere else. This is a world of more possibilities, more innovation, more openness in a cloud-enabled world.  The Business & Industry Copilots group is a rapidly growing organization that is responsible for the Microsoft Dynamics 365 suite of products, Power Apps, Power Automate, Dataverse, AI Builder, Microsoft Industry Solution and more. Microsoft is considered one of the leaders in Software as a Service in the world of business applications and this organization is at the heart of how business applications are designed and delivered.

We are looking for a Senior Software Engineer to join our team!  
This is an exciting time to join our group and work on something highly strategic to Microsoft.  This team builds a suite of microservices to get near real-time insights over your data in Microsoft Copilot Studios. You will be a part of a team of engineers who thrive on solving complex problems at scale while doing it with impeccable quality.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

    • Collaborates with appropriate stakeholders to determine user requirements for a scenario.
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform.

  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).

  • Drives creation and conducting of experimentation to determine the effectiveness of changes, monitors developments for prototyping and testing products, and interprets results from experimentation.

  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.

  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.

  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.

  • Embody our culture and values

Qualifications

  • Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or ReactJS OR equivalent experience.

Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or ReactJS OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or ReactJS OR equivalent experience.

  • Experience creating AI-centric experiences, or familiarity with building AI orchestration systems

  • Proven ability to utilize generative AI, conduct experiments, and evaluate AI effectiveness in product improvements.

  • Experience building scalable applications and services with Azure or other scalable cloud platforms with robust performance, resiliency, telemetry, and security.

#BICJOBS

#MCSJobs

#AgentFlows

#CopilotStudio

#AiAgents

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

Azure Specialized collaboratively work to bring the next generation of workloads to our Public Cloud platform. We work together across Microsoft to enable end to end new scenarios for Azure customers. Our team imagines and builds differentiating customer features and fundamental building blocks at the heart of the Azure platform working collaboratively with many industry partners.

We are a highly impactful team with robust growth opportunities. If you are interested in working on the latest areas that will help you develop skills in AI infrastructure, Cloud services, and Security, this is the team you are looking for! We are a small, agile and nimble team in Azure, focused on bringing the state of the art of mission-critical software into Microsoft.

As a Senior Site Reliability Engineeer in Azure Specialized, you will gain valuable experience in service architecture, datacenter networking, monitoring and security as well as working with partner teams. You have the opportunity to work on control and data plane enablement required by Azure Specialized workloads. A primary focus is designing, developing, deploying, and monitoring various product features and infrastructure. This will allow you to develop backend infrastructure supporting diverse services. The work for this position will cross many layers of Azure Services, presenting unique engineering challenges. This role also offers great opportunities to work with many partner teams and gain broad exposure to control plane and data plane technologies end-to-end.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

  • Acts as a Designated Responsible Individual (DRI) working on call to monitor service for degradation, downtime, or interruptions. Alerts stakeholders as to the status and gains approval to restore system/product/service for simple problems. Responds within Service Level Agreement (SLA) timeframe. Escalate issues to appropriate owners.
  • Contributes to efforts to collect, classify, and analyze data with little oversight on a range of metrics (e.g., health of the system, where bugs might be occurring). Contributes to the refinement of product features by escalating findings from analyses to inform decisions regarding the engineering of products.
  • Contributes to the development of automation within production and deployment of a complex product feature. Runs code in simulated, or other non-production environments to confirm functionality and error-free runtime for products with little to no oversight.
  • Contributes to efforts to ensure the correct processes are followed to achieve a high degree of security, privacy, safety, and accessibility. Checks for visible evidence to demonstrate compliance for product areas. Develops and holds an understanding of the implications of onboarding new technologies following expectations of compliance at Microsoft.
  • Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
  • Applies best practices to reliably build code that is based on well-established methods. Follows best practices for product development and scaling to customer requirements and applies best practices for meeting scaling needs and performance expectations.
  • Maintains communication with key partners across the Microsoft ecosystem of engineers. Considers partners across teams and their end goals for products to drive and achieve desirable user experiences and fitting the dynamic needs of partners/customers through product development.
  • Maintains operations of live service as issues arise on a rotational, on-call basis. Implements solutions and mitigations to more complex issues impacting performance or functionality of Live Site service and escalates as necessary. Reviews and writes issues postmortem and shares insights with the team.

Qualifications

Required Qualifications:

  • Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
  • OR equivalent experience.
  • 1+ years experience with support of physical infrastructure.
  • 1+ years experience with GPU and/or Infiniband support.

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 7+ years technical experience in software engineering, network engineering,
  • OR systems administration
  • OR Bachelor's Degree in Computer Science, Information Technology,
  • OR related field AND 4+ years technical experience in software engineering, network engineering,
  • OR systems administration
  • OR Master's Degree in Computer Science, Information Technology,
  • OR related field AND 3+ years technical experience in software engineering, network engineering

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

azurecorejobs

Site Reliability Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Role Details

Overview

As a Senior Research Engineer at Microsoft, you will advance Microsoft’s mission to empower every person and every organization to achieve more. You will help build and integrate cutting-edge AI into Microsoft products and services within the Business & Industry Copilot (BIC) group, ensuring solutions are inclusive, ethical, and impactful. This role blends applied research, machine learning engineering, and product innovation. You will lead efforts to ship reliable, production-grade AI systems across the stack, from model development and fine-tuning to performance optimization and deployment.

Mission and Impact

We are in an era of unprecedented AI innovation. As Microsoft leads the way in foundation models, multimodal systems, and AI agents, our goal is to build an open architecture platform where users can interact with tailored AI agents that drive tangible, real-world outcomes. As a Senior Research Engineer, you will:

  • Bridge the gap between state-of-the-art research and customer-facing features
  • Drive systems-level innovation across models, infrastructure, and deployment
  • Champion responsible AI by embedding fairness, safety, privacy, and performance from the ground up

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

Bringing State-of-the-Art Research to Products

  • Design and implement AI systems using foundation models, prompt engineering, retrieval-augmented generation, multi-agent architectures, and classic ML
  • Fine-tune large language models on domain-specific data and evaluate via offline and online methods such as A/B testing, telemetry, and shadow deployments
  • Build and harden prototypes into production-ready services using robust software engineering and MLOps practices
  • Drive original research and thought leadership (whitepapers, internal notes, patents); convert insights into shipped capabilities
  • Research Translation: Continuously review emerging work; identify high-potential methods and adapt them to Microsoft problem spaces

End-to-End System Development

  • ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loops
  • Identify and resolve model quality gaps, latency issues, and scale bottlenecks using PyTorch, or TensorFlow
  • Operate CI/CD and MLOps workflows including model versioning, retraining, evaluation, and monitoring
  • Integrate AI components into Microsoft products in close partnership with engineering and product teams

Data-Driven Innovation

  • Evaluation & Instrumentation: Build robust offline/online evals, experimentation frameworks, and telemetry for model/system performance.
  • Learning Loop Creation: Operationalize continuous learning from user feedback and system signals; close the loop from experimentation to deployment.
  • Experimentation & E2E Validation: Design controlled experiments, analyze results, and drive product/model decisions with data.
  • Develop proofs of concept that validate ideas quickly at realistic scales
  • Curate high-signal datasets, including synthetic and red-team corpora, and establish labeling protocols and data quality checks tied to evaluation KPIs

Cross-Functional Collaboration

  • Partner with software engineers, scientists, designers, and product managers to deliver high-impact AI features
  • Translate research breakthroughs into scalable applications aligned with product priorities
  • Communicate findings and decisions through internal forums, demos, and documentation

Responsible AI & Ethics

  • Identify and mitigate risks related to fairness, privacy, safety, security, hallucination, and data leakage
  • Uphold Microsoft’s Responsible AI principles throughout the lifecycle
  • Contribute to internal policies, auditing practices, and tools for responsible AI

Operating Altitudes

  • Paper level (ideas and math): Read, critique, and adapt the latest research; identify gaps; design methods with clear trade-offs and guarantees; communicate complex ideas clearly. 
    Example: “This objective is brittle under our data regime. Here is a tighter analysis and a revised loss we can test this sprint.”
  • Code level (implementation): Turn ideas into robust, tested, maintainable modules; integrate with CI/CD; profile and optimize for latency and throughput. 
    Example: “Refactored the prototype into a reusable PyTorch component, added unit tests and benchmarks, and cut P95 inference latency by 30%.”

Specialty Technical Areas

  • Large-scale training and fine-tuning of LLMs, vision-language, or multimodal models
  • Multi-agent systems, dialogue agents, and copilots
  • Optimization of inference speed, accuracy, reliability, and cost in production
  • Retrieval systems and hybrid architectures using RAG and vector databases
  • ML for real-world data constraints such as missing data, noisy labels, and class imbalance

Qualifications

Required Qualifications:

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
  • OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
  • OR equivalent experience.

Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.

Preferred Qualifications:

  • Master’s degree and 3 or more years in applied ML or AI research and product engineering,
  • OR PhD in a relevant field and 2 or more years with generative AI, LLMs, or related ML algorithms.
  • Experience across the product lifecycle from ideation to shipping.
  • Proficiency in Python and at least one deep learning framework such as PyTorch, JAX, or TensorFlow
  • Experience deploying Fine Tuned LLMs or multimodal models in live production environments
  • Experience shipping and maintaining production AI systems
  • Experience with Microsoft’s LLMOps stack: Azure AI Foundry, Azure Machine Learning, Semantic Kernel, Azure OpenAI Service, and Azure AI Search for vector/RAG.
  • Familiarity with responsible AI evaluation frameworks and bias mitigation methods.

BICJobs

CXAjobs

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.