Senior Staff Engineer, AI Software

Samsung Semiconductor · San Francisco Bay Area

📍 San Jose, California, United States💰 $189,000via greenhousePosted 2026-06-10

CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Samsung Semiconductor.

Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World’s Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you’ll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what’s possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We’re dedicated to empowering people to be their true selves. Together, we’re building a better tomorrow for our employees, customers, partners, and communities. Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you’ll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what’s possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We’re dedicated to empowering people to be their true selves. Together, we’re building a better tomorrow for our employees, customers, partners, and communities. Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you’ll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what’s possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We’re dedicated to empowering people to be their true selves. Together, we’re building a better tomorrow for our employees, customers, partners, and communities. The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance. To achieve this goal, we collaborate closely with both hardware and software engineers to identify and address the unique challenges posed by AI/ML workloads and to explore new computing abstractions that can provide a better balance between the hardware and software components of our systems. Additionally, we continuously conduct research and development in emerging technologies and trends across memory, computing, interconnect, and AI/ML, ensuring that our platforms are always equipped to handle the most demanding workloads of the future. By working together as a dedicated and passionate team, we aim to revolutionize the way AI/ML applications are deployed and executed, ultimately contributing to the advancement of AGI in an affordable and sustainable manner. Join us in our passion to shape the future of computing! Location: Daily onsite presence at our San Jose, CA office / U.S. headquarters in alignment with our Flexible Work policy. What You’ll Do Lead the co-design of software and hardware solutions that optimize AI model inference performance, with a focus on overcoming memory bottlenecks. Analyze and optimize LLM and agentic AI workloads across the full software stack, identifying opportunities for hardware-aware acceleration. Profile and characterize model execution to expose memory wall limitations and guide architectural decisions for HBM and memory-centric compute. Collaborate with hardware teams to influence memory architecture, acceleration strategies, and compute placement based on real workload behavior. Develop, optimize, and benchmark inference and serving solutions using frameworks such as PyTorch and vLLM. Define best practices and provide technical mentorship across software–hardware co-design efforts. What You Bring Bachelor’s with 15+ years, or Master’s with 13+ years, or PhD's with 10+ years of industry experience. Strong experience writing high-performance AI framework software development for GPUs or other accelerators. Strong, end-to-end understanding of the AI infrastructure, AI software stack, from model definition through deployment and serving. Solid understanding of LLM model architectures and workflows, including modern transformer-based designs. Solid understanding of agentic AI architecture and workflows. Hands-on expertise with the PyTorch framework. Practical experience with vLLM for high-throughput model inference and serving. Solid understanding of the memory wall problem and its impact on AI system performance. Strong knowledge of memory architecture, including High Bandwidth Memory (HBM), and familiarity with memory-centric acceleration and compute approaches. Proficiency working in a Linux development environment. Solid command of development tooling, including agentic coding, GitHub and Jira. #LI-VL1 What We Offer The pay range below is for all roles at this level across all US locations and functions. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community. Enjoy Time Away Yo

More San Francisco Bay Area jobs

Facility Manager - Residential Valet Operations
Reimagined Parking
Valet Parking Attendant - Hyatt Regency ( Seasonal / Part Time )
Reimagined Parking
Valet Supervisor - Millennium Tower
Reimagined Parking
Hotel Parking Operations Manager- San Jose
Reimagined Parking
Administrative Clerk
TeleSolv Consulting
File Clerk
TeleSolv Consulting

San Francisco Bay Area jobs · Browse all locations