Inference Optimization Intern – Performance Modeling

Institute of Foundation Models · San Francisco Bay Area

📍 Sunnyvale, CAvia leverPosted 2026-06-24

CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Institute of Foundation Models.

About the Institute of Foundation Models The Institute of Foundation Models is dedicated to advancing the science and engineering of large-scale AI systems. Our researchers and engineers develop cutting-edge foundation models while pushing the limits of high-performance computing and efficient AI inference. By combining deep expertise in machine learning, systems engineering, and hardware optimization, we build scalable AI solutions that drive scientific discovery and real-world impact. As part of the team, interns work alongside world-class researchers and performance engineers to optimize the execution of large-scale foundation models on next-generation NVIDIA GPU architectures. This internship provides hands-on experience in low-level GPU performance analysis, kernel optimization, and hardware-aware inference acceleration.

More San Francisco Bay Area jobs

Facility Manager - Residential Valet Operations
Reimagined Parking
Valet Parking Attendant - Hyatt Regency ( Seasonal / Part Time )
Reimagined Parking
Valet Supervisor - Millennium Tower
Reimagined Parking
Hotel Parking Operations Manager- San Jose
Reimagined Parking
Administrative Clerk
TeleSolv Consulting
File Clerk
TeleSolv Consulting

San Francisco Bay Area jobs · Browse all locations