CareerRiver

Inference Optimization Intern – Performance Modeling

Institute of Foundation Models · San Francisco Bay Area

📍 Sunnyvale, CAvia leverPosted 2026-06-24
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Institute of Foundation Models.
About the Institute of Foundation Models The Institute of Foundation Models is dedicated to advancing the science and engineering of large-scale AI systems. Our researchers and engineers develop cutting-edge foundation models while pushing the limits of high-performance computing and efficient AI inference. By combining deep expertise in machine learning, systems engineering, and hardware optimization, we build scalable AI solutions that drive scientific discovery and real-world impact. As part of the team, interns work alongside world-class researchers and performance engineers to optimize the execution of large-scale foundation models on next-generation NVIDIA GPU architectures. This internship provides hands-on experience in low-level GPU performance analysis, kernel optimization, and hardware-aware inference acceleration.

More San Francisco Bay Area jobs

San Francisco Bay Area jobs · Browse all locations