hero

Job board

Explore opportunities across our network.
Mayfield
companies
Jobs

Software Engineer Intern - Kubernetes & Inferencing Infrastructure

Gruve

Gruve

Other Engineering, Software Engineering
Texas, USA
Posted on Sep 12, 2025

About Gruve

Gruve is an innovative software services startup dedicated to transforming enterprises to AI powerhouses. We specialize in cybersecurity, customer experience, cloud infrastructure, and advanced technologies such as Large Language Models (LLMs). Our mission is to assist our customers in their business strategies utilizing their data to make more intelligent decisions. As a well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks.

Location: Houston, TX
Team:
Inferencing & Infrastructure
Employment Type:
Part-time Internship (20 hours/week)
Compensation:
Monthly stipend of $2,500

About the Role

Gruve is seeking a part-time Software Engineer Intern with interest in Kubernetes, container orchestration, and distributed systems to support the infrastructure powering our Inferencing Services platform. In this role, you will assist in designing and implementing a robust on-demand container model for AI workloads, enable multi-tenancy with strong isolation and security, and contribute to performance and cost optimization efforts.

This internship is an excellent opportunity for students or early-career engineers who want hands-on experience at the intersection of infrastructure and AI application development, while building proficiency in Kubernetes, Python, and Go.

Key Responsibilities

  • Support the design, build, and maintenance of Kubernetes-based infrastructure for AI inferencing services.
  • Assist in developing multi-tenant microservices to support customer isolation and scalability.
  • Contribute to applying security best practices for inference workloads and model protection.
  • Help with optimization projects such as container startup times, memory footprint, and compute utilization.
  • Write and maintain code in Python and Go, and support automation using Terraform, Helm, etc.

Basic Qualifications

  • Familiarity with Kubernetes concepts (operators, CRDs, Helm, networking).
  • Exposure to container orchestration for AI/ML workloads (e.g., TensorRT, ONNX Runtime, PyTorch).
  • Proficiency in Python or Go for infrastructure or service development (academic or project experience acceptable).
  • Understanding of multi-tenant system design and workload isolation.
  • Interest in GPU/accelerator scheduling and performance optimization.

Preferred Qualifications

  • Experience deploying AI/ML inferencing in lab, project, or internship settings.
  • Exposure to edge inference architectures or low-latency workloads.
  • Familiarity with observability tools (Prometheus, Grafana, OpenTelemetry).
  • Knowledge of service mesh technologies (Istio, Linkerd, Cilium).
  • Contributions to open-source Kubernetes or cloud-native projects.

Opportunity for Growth

This internship has the potential to transition into a full-time position based on performance, business needs, and mutual interest.

Why Gruve

At Gruve, we foster a culture of innovation, collaboration, and continuous learning. We are committed to building a diverse and inclusive workplace where everyone can thrive and contribute their best work. If you’re passionate about technology and eager to make an impact, we’d love to hear from you.

Gruve is an equal opportunity employer. We welcome applicants from all backgrounds and thank all who apply; however, only those selected for an interview will be contacted.