My job alerts

Senior Software Engineer - AI/ML

Gruve

Software Engineering, Data Science

California, USA

USD 80-85 / hour

Posted on Mar 7, 2026

Apply now

About Gruve

Gruve is an innovative software services startup dedicated to transforming enterprises to AI powerhouses. We specialize in cybersecurity, customer experience, cloud infrastructure, and advanced technologies such as Large Language Models (LLMs). Our mission is to assist our customers in their business strategies utilizing their data to make more intelligent decisions. As a well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks.

About the Role

We are seeking a highly skilled Senior Software Engineer - AI/ML to architect and deliver enterprise-grade AI solutions within a complex healthcare environment. This role focuses on designing, building, and deploying Large Language Model (LLM) and Retrieval-Augmented Generation (RAG) systems that integrate securely and seamlessly into clinical and operational workflows.

The ideal candidate brings deep expertise in transformer-based models, production-scale ML systems, and cloud-native architectures, with experience operating in regulated environments such as healthcare. This is a hands-on technical leadership role requiring ownership of the full AI lifecycle—from design through deployment and optimization.

Key Responsibilities

Architect and deliver scalable AI/ML solutions with emphasis on LLMs, RAG architectures, and deep learning systems.
Own the full AI lifecycle including data ingestion, document indexing, embedding generation, retrieval design, preprocessing, fine-tuning, evaluation, and production deployment.
Design and optimize RAG pipelines leveraging vector databases (FAISS, Pinecone, Milvus, Weaviate) and frameworks such as LangChain and LlamaIndex.
Implement advanced fine-tuning methodologies including LoRA and Q-LoRA for domain-specific transformer optimization.
Develop hybrid RAG + reasoning workflows for complex enterprise use cases.
Curate and manage structured and unstructured healthcare datasets; implement chunking, embedding, and retrieval strategies to enhance contextual accuracy.
Establish robust evaluation frameworks measuring retrieval accuracy, faithfulness, latency, hallucination rates, and response relevance.
Optimize model performance through embedding tuning, reranking strategies, inference optimization, and efficient compute utilization.
Build and maintain MLOps / LLMOps pipelines covering CI/CD, deployment automation, monitoring, drift detection, and continuous improvement.
Deploy AI services across AWS and Azure in secure cloud-native and hybrid architectures.
Develop APIs and microservices to integrate AI capabilities into enterprise healthcare systems.
Ensure HIPAA-aligned data security, privacy, and regulatory compliance standards.
Collaborate with cross-functional stakeholders including clinical, product, engineering, and compliance teams.
Mentor engineers and establish best practices in AI architecture and production-grade ML systems.

Basic Qualifications

5–8+ years of experience in AI/ML engineering or related roles.
Strong foundation in machine learning, deep learning, and transformer architectures.
Hands-on experience with Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems.
Proficiency in Python and ML frameworks such as PyTorch, TensorFlow, and scikit-learn.
Experience working with vector databases (e.g., FAISS, Pinecone, Milvus, Weaviate).
Experience designing and deploying production-grade AI systems.
Familiarity with cloud platforms (AWS, Azure) and containerized deployment models.
Experience operating in regulated environments with healthcare compliance standards (HIPAA or similar).
Strong problem-solving skills and cross-functional communication abilities.

Preferred Qualifications

Experience designing hybrid Vector + Graph RAG architectures.
Hands-on experience with knowledge graph design and graph databases (Neo4j, RDF/SPARQL, Cypher).
Expertise in advanced fine-tuning techniques such as LoRA and Q-LoRA.
Experience implementing LLM evaluation frameworks and hallucination detection systems.
Background in healthcare AI systems or clinical data integration.
Experience building scalable microservices architectures for AI platforms.
Prior experience mentoring engineers or leading AI architecture initiatives.

Salary Range & Employment Details

Hourly Rate: $80–85 per hour

This position is being hired for a customer of Gruve.

Candidates may engage in one of the following ways:

W-2 employee of Gruve, contracted to provide services to one of our clients
Corp-to-Corp contractor arrangement

This is an initial 3–6-month contract, with the opportunity for renewal or extension based on performance and client needs.

Gruve is unable to provide visa sponsorship for this role. Applicants must be authorized to work in the United States without the need for current or future sponsorship.

‍

Why Gruve

At Gruve, we foster a culture of innovation, collaboration, and continuous learning. We are committed to building a diverse and inclusive workplace where everyone can thrive and contribute their best work. If you’re passionate about technology and eager to make an impact, we’d love to hear from you.

Gruve is an equal opportunity employer. We welcome applicants from all backgrounds and thank all who apply; however, only those selected for an interview will be contacted.

Apply now

See more open positions at Gruve

Job board

Senior Software Engineer - AI/ML