We are expanding our team and looking for a skilled Cloud Systems Engineer who will be responsible for supporting our growing application infrastructure. The candidate will be responsible for maintaining strict production SLAs of a custom distributed application running on AWS cloud infrastructure, monitoring, alerting, incident management, SOP, capacity planning, change management, security, and overall platform stability and improvement. The ideal candidate will have a deep technical understanding of Linux operating systems and at least an intermediate understanding of Windows operating systems, strong experience with Amazon Web Services (AWS), advanced scripting skills, experience with variety of monitoring solutions, deep understanding of network communications and excellent troubleshooting skills.
Finally, we are seeking someone who wants to be a major contributor in a small, dynamic work environment, loves a challenge, and has a strong balance of technical and people skills. The Cloud Systems Engineer will be responsible for:
Maintain, monitor, and improve the performance and availability of the 24×7 production environment including networks, servers, databases, etc.
Participate in on-call rotation and provide hands-on support during emergencies, outages, and service transitions.
Define and monitor compliance to production environment SLAs.
Participate in creating long-term and short term strategies for scaling the production environment.
Adhere to a comprehensive incident management program including problem management.
Generate KPIs for service availability, uptime, and adherence to SOPs, and SLAs.
This position is open to candidates from the US, Canada and UK.

