CareerRiver

Senior Infrastructure Engineer

BlackSky · Remote

📍 Herndon, VA; Remote💰 $135,000-$150,000via greenhousePosted 2026-06-25
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to BlackSky.
Senior Infrastructure Engineer About Us: BlackSky is a real-time intelligence company. We own and operate the world's most advanced space-based intelligence platform and provide customers satellite imagery, automated analytics and high-frequency monitoring of strategic locations, economic assets and events from around the globe. BlackSky is trusted by the most demanding allied military and intelligence organizations and commercial companies to deliver foresight into critical matters that affect national security and the economy. BlackSky's data enables governments and businesses to see, understand and anticipate change as it happens, giving them the ultimate strategic advantage so they can act quickly. Our global team works with cutting-edge technology to make a difference around the world and prides itself on being people-first, customer-focused and fun. The BlackSky Platform team is building the premier global intelligence platform to deliver timely, relevant, and actionable information to customers. As a Senior Infrastructure Engineer, you will design, build and operate platforms that run our customer workloads across public cloud, private data centers, and air-gapped/disconnected environments.  This is a hands-on engineering role for someone who is equally comfortable writing Terraform against AWS, troubleshooting kubernetes deployments and packaging full application stacks for delivery and implementation in isolated network environments. As one of the team, you will be engaged in all aspects of our our multi-environment delivery pipelines from development through production and business continuity (BCP) environments.  You will directly be involved in the successful automated deployment of solutions that meet our customers’ business objectives. Your contribution matters! The ideal candidate brings deep Kubernetes and GitOps operational experience implementing in public, private and isolated environments.  This position reports to the Manager, Infrastructure and while we have offices in Herndon, VA and Seattle, WA, we are open to remote candidates in certain states. Responsibilities : Design and operate AWS infrastructure (VPC, subnets, NLB/ALB, IAM, EKS, EC2, S3, Route 53) and the hybrid connectivity that ties cloud to on-premises and private/air-gapped networks. Stand up and run production-grade Kubernetes clusters on EKS, Rancher (RKE2) and/or Red Hat OpenShift 4, including upgrades, capacity planning, networking, storage, and day-2 operations. Implement and own GitOps workflows with Argo CD — declarative cluster and application state, app-of-apps patterns, sync policies, drift detection, and progressive rollout strategies. Author, version, and maintain Helm charts for internal and third-party workloads, including values management, chart dependencies, and templating standards across environments. Build repeatable delivery into disconnected environments using Zarf (and equivalent packaging/mirroring tooling) — bundling images, charts, and manifests for air-gapped installs and reproducible deployments. Codify infrastructure and platform configuration as code (Terraform, Helm, Kustomize) with a clear build-once / promote-per-environment strategy. Build and harden CI/CD pipelines that move artifacts safely from dev through to restricted production and BCP targets. Integrate platform services — certificate management (cert-manager), secrets management, container registries, storage, and observability — as shared, reusable building blocks. Establish operational standards: monitoring, alerting, logging, runbooks, incident response, and capacity/cost management. Other responsibilities as assigned.  Required Qualifications: At least five years years in infrastructure, platform, DevOps, or SRE engineering, with at least 3 years running Kubernetes in production. Bachelor's degree in a relevant field of study or equivalent experience (four years). Strong hands-on AWS experience across networking, compute, storage, and IAM, including hybrid/on-prem connectivity patterns. Production experience operating Kubernetes in one or more enterprise distributions — Amazon EKS, Rancher/RKE2, or OpenShift 4. Demonstrated GitOps experience with Argo CD (or Flux) as the primary deployment mechanism. Proficiency authoring and maintaining Helm charts, and a solid grasp of Kubernetes primitives (workloads, networking, RBAC, storage, CRDs/operators). Experience with the Kubernetes Operator deployment model — deploying and managing workloads via operators and CRDs (OLM/OperatorHub). Strong infrastructure-as-code skills, ideally with Terraform. Comfort with Linux systems administration and scripting (Bash, plus Python or Go). Experience building on hardened, non-CVE / zero-known-vulnerability base images (e.g., Chainguard, Iron Bank, or distroless/minimal baselines) and supply-chain security practices. Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting, dashboards). Clear written and verbal communication, and the ability to work independently across the full lifecycle of a platform component. Preferred Qualifications :     Breadth across all three of EKS, Rancher/RKE2, and OpenShift 4, with the ability to move fluidly between them.     Experience running Kubernetes in edge / resource-constrained environments (e.g., k3s), including the operational tradeoffs of lightweight and disconnected deployments.     Direct experience packaging and deploying into air-gapped / disconnected environments using Zarf, image mirroring, and private registries.     Container and image scanning experience (Trivy, Grype, Clair, or equivalents) integrated into CI/CD and registry workflows.     Familiarity with secrets management (Vault, External Secrets Operator) and PKI/certificate automation.     Experience with persistent storage at scale (Ceph, EBS/EFS-backed storage classes).     Hands-on OpenTelemetry (OTEL) expe

More Remote jobs

Remote jobs · Browse all locations