Staff Software Development Engineer
Gruve
About Gruve
Gruve is an innovative software services startup dedicated to transforming enterprises to AI powerhouses. We specialize in cybersecurity, customer experience, cloud infrastructure, and advanced technologies such as Large Language Models (LLMs). Our mission is to assist our customers in their business strategies utilizing their data to make more intelligent decisions. As a well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks.
About the Role
We're seeking an experienced Site Reliability Engineer with a strong security mindset to join our founding team. You'll be instrumental in building and scaling our infrastructure while ensuring the highest standards of security, reliability, and compliance. This role requires someone who can architect robust systems, automate everything, and think like both a builder and a defender.
Key Responsibilities
- Design, build, and maintain highly secure and reliable cloud infrastructure that supports our network security platform.
- Own the reliability and security posture of our systems from the ground up, implementing infrastructure as code, establishing monitoring and observability practices, and creating automation that enables our team to move fast without compromising security.
- Work closely with engineering, product, and security teams to embed reliability and security best practices throughout the development lifecycle. You'll influence architecture decisions, drive incident response processes, and ensure our systems meet stringent compliance requirements.
- Build and maintain CI/CD pipelines that include security scanning, automated testing, and compliance checks.
- You'll create the tools and frameworks that allow developers to ship quickly while maintaining our security standards.
- Establish comprehensive monitoring, alerting, and observability across our microservices architecture. You'll ensure we have deep visibility into system behavior and can detect and respond to issues before they impact customers.
Basic Qualifications
- 7+ years of experience in Site Reliability Engineering, DevOps, or similar roles with increasing responsibility
- Deep expertise with major cloud providers (AWS, GCP, or Azure) including security services, container services, IAM, networking, and compliance features
- Strong experience with Kubernetes in production environments, including security hardening, RBAC, network policies, and secrets management
- Proven track record with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, or similar)
- Extensive experience building and maintaining CI/CD pipelines with security and compliance integration
- Hands-on experience with microservices architectures and distributed systems
- Strong background in security hardening across the stack (OS, containers, networks, applications)
- Experience implementing and maintaining systems to meet compliance requirements (SOC 2, ISO 27001, HIPAA, or similar frameworks)
- Proficiency with monitoring and observability tools (Prometheus, Grafana, SigNoz, Wazuh, or similar)
- Expert-level scripting and automation skills (Python, Go, Bash, or similar)
Preferred Qualifications
- Strong understanding of test automation frameworks and how they integrate with CI/CD pipelines.
- Solid understanding of Git branching strategies, merge practices, and versioning.
- Knowledge of monitoring, logging, alerting, and observability practices.
- Good understanding of cloud cost optimization, right-sizing, and resource efficiency.
- Familiarity with DevSecOps concepts: IAM, RBAC, security scanning, and secret management.
Why Gruve
At Gruve, we foster a culture of innovation, collaboration, and continuous learning. We are committed to building a diverse and inclusive workplace where everyone can thrive and contribute their best work. If you’re passionate about technology and eager to make an impact, we’d love to hear from you.
Gruve is an equal opportunity employer. We welcome applicants from all backgrounds and thank all who apply; however, only those selected for an interview will be contacted.