EDGE CLOUD OPERATIONS ENGINEER
Fastly’s Edge Cloud Operations (ECO) team is focused on the performance, availability, and observability of the Edge Cloud platform. We have a high level of situational awareness, curiosity, and inquisitiveness. Our observability posture gives us a global view of the internet’s behavior allowing us to provide a best-in-class platform for our customers to deliver content in the most efficient, reliable, and secure vehicle possible. The team is guided tremendously by the notion that every packet is sacred.
As an ECO Engineer, you act as guardian and caretaker of the Edge Cloud platform. You will work as part of a globally distributed cross-functional team, caring for a globally distributed network and system that currently exceeds 15 Tbps of connected network capacity. Our job is to not only maintain the multitude of servers, switches, and bandwidth that make up over 40 Points of Presence (POP) globally, but also to work across various engineering teams to tune alerts, develop runbooks, support customer events, refine processes, and shepherd new features and infrastructure into the production environment.
RESPONSIBILITIES
As someone who is a great collaborator with a passion for continually growing your knowledge and capabilities, you come to us with a strong operations background in either network engineering, systems engineering, or both. Not limited to the following, you’ll be exposed to, and responsible for:
Critical alert response, tuning, management
Operational tool creation, routine task automation
Operational runbook development, execution, improvements
Server administration, troubleshooting
Traffic engineering, DNS management, DDOS mitigation
IP transit partner maintenance coordination
Escalation support for internal teams
New POP deployment, configuration, activation
End-to-end ownership and accountability for all Edge Cloud activities and incidents
REQUIREMENTS
Your experience and skillset includes many of the following:
Comprehensive knowledge of how CDNs work: TCP/IP, BGP anycast, DNS, HTTP, TLS, reverse proxies, etc
Experience in data center routing and switching systems
Hands-on experience operating a global network and/or a globally distributed Linux-based system
Capable of troubleshooting and diagnosing failures of hardware in a data center environment. Everything from SSD’s to AOC cables are fair game for ECO
Development or administrative experience in a Linux-based environment, and associated open-source tools (Chef, Ansible, Git, Awk, Sed, cURL, etc)
Solid understanding of how the internet works – from client to server, and everything in-between. HAProxy, nginx, and Varnish are not foreign to you
Cultivation of various monitoring platforms, such as Datadog, Nagios, Ganglia, Icinga, Pingdom, Catchpoint, Cedexis and others
Exposure to cloud environments and systems like AWS, GCE, Azure, Softlayer
Data analysis using tools and systems like Deepfield, MySQL, Google BigTable
An innate curiosity and inquisitiveness
Your qualifications include:
3+ years of direct involvement with network and/or systems engineering in a web-scale production environment
3+ years working with at least one of: Perl, Python, Go, Ruby
B. A. or B. S. in an engineering or computer-related field of study or equivalent on-the-job training
Fastly provides equal employment opportunity without regard to an applicant’s race, sex, pregnancy, sexual orientation, gender identity or expression, genetic information, national origin, age, physical or mental disability, medical condition, religion, marital status or veteran status.
Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on Fastly. Please inform us if you need assistance completing any forms or to otherwise participate in the application process.

