Principal GPU Network Software Engineer - Communications Library (Santa Clara) Job at AMD, Santa Clara, CA

SlVLU3ZYNmg0WDMrRVpFWDgwd2NxTUZsS3c9PQ==
  • AMD
  • Santa Clara, CA

Job Description

Principal GPU Network Software Engineer - Communications Library

Principal GPU Network Software Engineer - Communications Library

1 week ago Be among the first 25 applicants

This range is provided by AMD. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Passionate about building teams that will continue to shape organization's future.....

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

The Role

As a GPU Network Software Engineer with our NTSG - Network Technology Solutions Group, you will design, implement, and test features in communication libraries, middleware, and frameworks to provide best in class support for GPU applications running high performance computing and machine learning workloads at scale. You will work with technical experts within AMD, our partners, and the open-source community to implement these features as part of AMD’s open source ROCm stack for GPU computation.

The Person

You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and collaboration are paramount. You possess excellent written and verbal communication skills, and strong attention to detail. You are results-oriented and accustomed to tight deadlines and changing priorities. Most importantly, you are constantly thinking of ways to improve performance of multi-node GPU applications.

Key Responsibilities

  • Design, implement, and test features to enhance GPU support in communication libraries, middleware and frameworks
  • Benchmark, profile and optimize code to maximize performance of multi-node GPU applications
  • Deliver high-quality code and documentation following best practices for open-source software development
  • Work with key technical experts at our customers, across AMD, and with our industry partners in the Ultra Ethernet Consortium and Ultra Accelerator Link Consortium to advance scale out and scale-up software and hardware solutions.

Preferred Experience

  • Strong background developing system software in C/C++
  • Experience with at least one of the following:
  • Implementing communication middleware like MPI/SHMEM
  • Implementing lower-level communication frameworks like UCX and libfabric, or development using RDMA APIs
  • Development and optimization of communication collective algorithms (e.g. AllReduce)
  • Familiarity with GPU programming in HIP or CUDA
  • In-depth knowledge of best practices in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
  • Proven track record contributing to open-source projects

Academic Credentials

  • B.Sc. or B.Eng. degree in Computer Science, Electrical Engineering, or equivalent

Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred.

LOCATION:

Santa Clara, CA

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Job function

    Engineering and Information Technology
  • Industries

    Semiconductor Manufacturing

Referrals increase your chances of interviewing at AMD by 2x

Get notified about new Network Software Engineer jobs in Santa Clara, CA .

Mountain View, CA $147,000.00-$240,000.00 2 days ago

Internship, Network Engineer, Infrastructure Engineering (Fall 2025)

Sunnyvale, CA $130,000.00-$160,000.00 2 weeks ago

Network Production Engineer (University Grad)

Network Engineer, Operations and Support

Network Production Engineer, Infrastructure

Fremont, CA $133,000.00-$190,000.00 4 days ago

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Network Engineer, Operations and Support (Labs)

Fremont, CA $161,000.00-$225,000.00 4 days ago

San Jose, CA $60,000.00-$75,000.00 2 weeks ago

Software Engineer (L5) - Open Connect Platform

San Mateo, CA $150,000.00-$170,000.00 2 weeks ago

Fremont, CA $120,000.00-$140,000.00 1 day ago

Software Engineer - FBOSS (Facebook Open Switching System)

San Mateo, CA $130,000.00-$280,000.00 2 weeks ago

Software Engineer - Security Research (Intrusion Prevention System Development)

Fremont, CA $112,000.00-$128,000.00 4 weeks ago

Sunnyvale, CA $160,000.00-$210,000.00 2 months ago

Sunnyvale, CA $156,000.00-$229,000.00 2 weeks ago

Foster City, CA $81,600.00-$115,250.00 5 days ago

Software Engineer, Traffic Infrastructure

San Francisco Bay Area $160,000.00-$180,000.00 2 weeks ago

Sunnyvale, CA $122,000.00-$174,000.00 2 weeks ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

Benchmark Senior Living

RN-Registered Nurse Part Time 3p-11p Job at Benchmark Senior Living

Join, stay, and grow with Benchmark. Connect with your calling. Part time second shift opening! The Birches at Concord is looking for a compassionate Registered Nurse (RN) or Licensed Nurse Practitioner (LPN) to join our team! As a Care Nurse Supervisor, your main... 

Farm Job Search

Farm Manager Job at Farm Job Search

 ...Farm Manager (6505) Location: Long Valley, New Jersey JobNumber: 6505 Farm Mgr. of Regenerative Organic Farm that provides produce to our own Meal Kitting business. It has a large, state of the art Green-house Aquaponics operation, 20 acres of no-till organic... 

Four Seasons Hotels and Resorts

Director of Banquets Job at Four Seasons Hotels and Resorts

 ...outlets, outdoor pursuits to include a tennis center, and an 18-hole golf course, bowling and movie theater experiences, and an exclusive...  ...This role will not be onboarded until July 29th 2025**The Director of Banquets is responsible for overseeing the planning, organization... 

Amazon

Sr. Partner Operations Manager - Digital Supply Chain, Prime Video Job at Amazon

 ...disruptive, fast-growing, and evolutionary global Amazon service? How frequently do you get to...  ...entertainment streaming industry? At Prime Video, we are pioneering a new generation...  ...requirement roadmaps, improving partner delivery experience, and delivering programs that... 

Refactor Games

Senior Animator (Unreal Engine 5 / Sports) Job at Refactor Games

 ...Job Description Refactor Games is a AAA sports video game studio . We are developing the next generation of professional football...  ...Animator to deliver high-quality, authentic football player animation . Experience: ~3+ years developing games for console...