CareerRiver

Senior Engineer, Network Observability

CoreWeave Europe · Remote

📍 London, England / Remote - Irelandvia greenhousePosted 2026-06-26
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to CoreWeave Europe.
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at  www.coreweave.com . We're proud to be a Living Wage accredited Employer. What You'll Do: CoreWeave’s Network Observability team designs, develops, and maintains the telemetry and monitoring systems that keep our global GPU cloud network operating reliably and at scale. We focus on building automated ingestion pipelines that provide real-time performance insights, empowering the broader engineering organization to detect anomalies proactively and resolve them before they ever impact our customers. About the role:   As a Senior Engineer for Network Observability, you will be a key player in optimizing and scaling our network metrics, analytics, and automated alerting systems. You will build and automate custom collectors, exporters, and dashboards using Python and Golang to ingest and unify telemetry across multi-vendor network platforms (including Arista EOS, NVIDIA Cumulus Linux, Nokia SR OS, and SR Linux). This position requires you to design telemetry solutions using gNMI, SNMP, and streaming analytics, integrate these systems into Kubernetes-native environments, and collaborate cross-functionally via RFCs and design discussions with SRE and security teams. You will also join a rotating on-call schedule to support production environments and mentor junior team members on observability best practices. Who You Are: 5+ years of experience working as a Network Engineer, SRE, Software Developer, or Systems Administrator in large-scale production environments with a focus on telemetry. Deep technical familiarity with Prometheus, Grafana, Alertmanager, gNMI, and SNMP, including experience writing or extending custom metric collectors and exporters. Proficient in Python, Go, and Bash scripting, alongside configuration management and templating tools (e.g., Ansible, Jinja2). Solid engineering knowledge of Linux systems and IP networking concepts (routing, switching, and protocol troubleshooting). Hands-on familiarity with diverse network operating systems, such as Arista EOS, NVIDIA Cumulus Linux, Nokia SR OS, or SR Linux. Strong experience containerizing solutions in Kubernetes and deploying container-based workloads efficiently. Passion for automation-first development to minimize manual tasks and eliminate operational error. Preferred: Bachelor’s degree in Computer Science, Engineering, or a related technical field. Experience with advanced metrics, data pipelines, event correlation, or distributed tracing tools (e.g., OpenTelemetry, Jaeger, Zipkin). Practical experience applying Machine Learning techniques or frameworks (e.g., TensorFlow, scikit-learn) for proactive network traffic anomaly detection. Industry network certifications (e.g., CCNA, CCNP, or vendor equivalents). Wondering if you're a good fit?   We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams even if you aren't a 100% skill or experience match. Here are a few qualities we've found compatible with our team. If some of this describes you, we'd love to talk. You love to build robust telemetry pipelines and automate repetitive configuration tasks to achieve near-zero human error. You're curious about multi-vendor network platform scaling and unifying complex metrics into a single, high-throughput streaming analytics layer. You're an expert in breaking down telemetry anomalies, engineering custom alerting rules, and collaborating cross-functionally to raise platform reliability standards. Why CoreWeave? At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at Your Core Act Like an Owner Empower Employees Deliver Best-in-Class Client Experiences Achieve More Together We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems. As we get set for takeoff, the organization's growth opportunities are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! The starting salary will be determined by job-related knowledge, skills, experience, and the market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). To fulfill our obligation to protect client data, successful applicants offered employment with CoreWeave will be required to complete a basic criminal record check, conducted in compliance with GDPR. Employment offers are conditional upon receiving satisfactory check results What We Offer In addition to a competitive salary, we offer a variety of benefits to support your needs, including: Family-level Medical Insurance Family-level Dental Insurance  Generous Pension Contribution  Life Assurance at 4x Salary  Critical Illness Cover  Employee Assistance Programme  Tuition Reimbursement Work culture focused on innovative disruption Benefits may vary by locati

More Remote jobs

Remote jobs · Browse all locations