Restaurant365 is seeking a Senior DevOps Engineer to lead the design, buildout, and operational ownership of our Azure cloud infrastructure supporting internally developed Claude AI applications. This role is the technical cornerstone of a new initiative enabling business users across the organization to build, host, and securely share AI-powered applications. You will own the full lifecycle from infrastructure provisioning and CI/CD pipeline design through change management, security governance, and production operations.
This role focuses on enterprise-facing applications and internal stakeholder enablement and does not support customer-facing product infrastructure.
This role is a strong fit for someone who enjoys owning cloud platform architecture, strengthening security and governance, improving CI/CD and operational standards, and helping internal teams bring business applications into production in a scalable and supportable way. The role sits at the intersection of DevOps, internal platform engineering, and applied AI enablement.
- Architect, implement, and operate Azure infrastructure (App Services, Functions, Container Apps, networking, storage) to host Claude AI applications built by internal business users
- Design, maintain, and improve CI/CD pipelines (Azure DevOps / GitHub Actions) for automated build, test, and deployment of user-developed applications
- Establish and enforce a change management process including environment promotion gates, approval workflows, and rollback procedures
- Serve as the primary technical owner of the hosted application platform, ensuring reliability, scalability, and cost optimization
- Partner directly with internal application owners and business stakeholders to onboard new enterprise applications to the platform, advising on architecture, deployment, and support best practices.
- Define and enforce security policies for application sharing, including authentication, authorization, and network access controls
- Build and monitoring observability solutions (Azure Monitor, Application Insights, Log Analytics) to provide proactive alerting and performance visibility
- Author and maintain Infrastructure as Code (Terraform / Bicep) for all Azure resources, ensuring environments are reproducible and auditable
- Drive incident response processes and conduct post-incident reviews to improve platform resilience
- Mentor mid-level DevOps engineers and contribute to team standards, documentation, and runbooks
- Develop and enforce governance policies defining standards for hosted Claude AI applications, including acceptable use policies, approval workflows for new deployments, and application lifecycle management
- Own Azure cost management and FinOps practices, monitoring spend across hosted applications, identifying optimization opportunities, and providing regular cost reporting to leadership
- Lead capacity planning and scaling strategy as internal adoption of the Claude AI hosting platform grows, ensuring infrastructure stays ahead of demand
- Manage the vendor relationship with Anthropic, coordinating on API usage, rate limits, enterprise support, and roadmap alignment
- Design and deliver training and enablement sessions for business users on how to properly build, package, and submit Claude AI applications for secure hosting
- Collaborate closely with InfoSec and IT Security teams on Entra ID integration, Conditional Access policies, vulnerability remediation, and compliance-related infrastructure controls
- Ensure platform operations meet compliance and audit requirements (SOC 2, SOX, or other applicable frameworks), maintaining evidence of controls and supporting audit activities
- Participate in an on-call rotation providing after-hours production support for hosted applications, and define escalation procedures for the broader team
- 5+ years of hands-on DevOps / Cloud Infrastructure engineering experience
- Deep expertise in Microsoft Azure (App Services, AKS, Functions, Networking, Entra ID, Key Vault)
- Strong experience designing and operating CI/CD pipelines (Azure DevOps, GitHub Actions, or similar)
- Proficiency with Infrastructure as Code tools (Terraform, Bicep, or ARM templates)
- Solid understanding of identity and access management, OAuth 2.0 flows, and secrets management patterns
- Experience with container orchestration (Docker, Kubernetes / AKS)
- Demonstrated ability to implement change management frameworks in an enterprise environment
- Experience with cloud cost management, FinOps principles, or Azure Cost Management tooling
- Familiarity with compliance frameworks (SOC 2, SOX) and supporting audit processes in a cloud environment
- Excellent communication skills with the ability to translate technical concepts for non-technical business users
- Experience supporting AI/ML application workloads or LLM-powered applications
- Familiarity with the Anthropic API, Claude AI platform, or similar LLM service integrations
- Azure certifications (AZ-104, AZ-400, AZ-305)
- Experience with policy-as-code frameworks (Azure Policy, OPA/Rego)
- Background in restaurant technology, SaaS platforms, or multi-tenant application hosting
- Experience managing vendor relationships and coordinating with third-party platform providers
- Track record of building internal developer platforms or self-service infrastructure tooling
- Compensation for this position is 1,230,000- 1,540,000 MXN annually ($102,500 – $128,333 monthly), depending on experience.
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- Meal Allowance
- Monthly Internet & Electricity Stipend
- Mental Health Support Resources
- And more!
To apply for this job, please visit the application page

