Senior Principal Software Engineer, AI Infra Compute

Oracle

Austin, TX, United States

Posted March 24, 2026

How to Get Hired at Oracle

Uses Oracle Recruiting Cloud (Oracle HCM)
  • Create your Oracle Careers profile with fully completed structured fields — recruiters filter candidates by location, job family, and experience level before ever reading a resume, so incomplete profiles reduce your visibility
  • Embed Oracle-specific product terminology (OCI, Oracle Fusion, PL/SQL, Oracle Autonomous Database, NetSuite) throughout your resume to maximize keyword matching in Oracle Recruiting Cloud's parsing system
Read the full guide

Score Your Resume

Check how well your resume matches this Senior Principal Software Engineer, AI Infra Compute role. Free, no signup required.

Choose your resume or drop it here

PDF or DOCX, max 5 MB

Analyzing resume...

Comparing keywords...

Job Description

Our team is the GPU Availability and Monitoring team in the Compute Org. we are responsible for designing and developing architectural changes for GPU delivery, health monitoring, triage automation, and diagnostic services. These are essential for running distributed AI/ML/HPC workloads across thousands of GPUs, leveraging technologies like RoCE and Infiniband.

We are looking for a highly skilled and motivated distributed systems engineer who can architect solutions to scale and optimize Monitoring and Repair solutions for AI infrastructure components like GPU control plane and GPU data plane that provide computing resources to customer AI workloads. You will provide technical leadership to the team and bring clarity to ambiguous problems and come up with innovative solutions. You will collaborate with cross-functional teams to enhance our AI infrastructure to deliver exceptional customer experience and peak performance. 

Finished reading? See how your resume stacks up against this role.

Score Your Resume
Apply on company website

Similar Jobs

Network Developer 3

Oracle

Seattle, WA, United States