Sr. Software Engineer - K8s - GPU Orchestration - REMOTE Job at Living Talent, San Jose, CA

V0hxQ0xtM3VxdUF1VWhzYm5PRWQwTDV3TFE9PQ==
  • Living Talent
  • San Jose, CA

Job Description

GPU Orchestration
  • Startup
  • Company size: 30
  • Remote within North America
  • Compensation: Base Salary 250k + Equity

Key Responsibilities

  • Lead Design, Architecture & Development of K8s-based cloud infrastructure.
  • Use K8s Controllers, Operators & CRs to Implement scalable, high-availability solutions.
  • Integrate Karpenter, and/or other advanced tools for infrastructure optimization.
  • Architect MLOps Middleware integration (dynamic workload migration, resource disaggregation).
  • Build monitoring, logging & alerting systems.
  • Drive infrastructure cost optimization through FinOps best practices in K8s deployments.
  • Promote K8s best practices & mentor software engineers.
  • Collaborate across teams to drive K8s adoption in multi-cloud and hybrid environments.
  • Open-Source Contributions in the Kubernetes community.

Qualifications

Kubernetes Expertise

  • Designing, deploying, and managing K8s clusters (AKS, EKS, GKE, OpenStack, etc.).
  • Hands-on experience with K8s core components (Karpenter, cluster autoscaler, CNI, CSI, CRI, CRD, operators).
  • 5+ years in Kubernetes infrastructure.
  • Contributing to open-source Kubernetes projects.
  • 10+ years: software engineering experience.
  • Go, Python, Bash, etc. (one or more).
  • Excellent communication skills for both technical and non-technical stakeholders.
  • Bachelor’s or Master’s degree in Computer Science or related field (preferred).

Preferred Experience

  • GPU scheduling, container orchestration, HPC (high-performance computing) workloads.
  • Multi-cloud & hybrid cloud deployments familiarity.
  • MLOps platforms experience (Kubeflow, TFX, etc.).
  • FinOps practices & cloud cost management experience/knowledge

Job Tags

Remote job,

Similar Jobs

Allied Universal

Security Site Supervisor - Warehouse Job at Allied Universal

Allied Universal, North America's leading security and facility services company, offers rewarding careers that provide you a sense of purpose. While working in a dynamic, welcoming, and collaborative workplace, you will be part of a team that contributes to a culture... 

Coalition Technologies

Work From Home - Office Assistant Job at Coalition Technologies

[Administrative Assistant / Remote] - Anywhere in U.S. / Competitive pay / Benefits - As an Office Assistant you'll: Answer phones and direct calls; Complete entry-level bookkeeping, including recording expenses, organizing receipts, and completing other transaction records... 

Luxe Media LLC / Felix

Volunteer Advisory Board Member Job at Luxe Media LLC / Felix

 ...to lead by example and create lasting impacts on our organization, its programs, and the community. This is an unpaid internship/volunteer opportunity.Job Description Help shape Felix Magazine by contributing to each issue. The Advisory Board solicits and writes articles... 

Arconic

Thermal and Combustion Process Engineer Job at Arconic

 ...Arconic is currently in search of Thermal & Combustion Process Engineer to join our Manufacturing Technology Engineering team based out of Alcoa, TN or Davenport, IA or Lancaster, PA. At Arconic, we are looking for people who share our values of integrity, inclusion... 

Hydrostor

Sourcing Manager Job at Hydrostor

 ...Title: Manager, Sourcing Location: Denver, CO (hybrid) or Remote in California or Texas Job Type: Full-Time Hybrid (3+ days/week...  ...technology is uniquely positioned to revolutionize the energy landscape. Join us as we lead the way in shaping a greener, more sustainable...