Stack Exchange is growing fast, and our infrastructure needs just keep getting bigger. We’re looking for a developer to join our Site Reliability Engineering team of outstanding sysadmins and developers working on sites that see 3000-4000 hits per second during peak times.
At Stack Exchange we’re passionate about our technology. We own and operate our own infrastructure, and take the time to do it right. We like to stay on the cutting edge of technology, so you will always being working with or working towards using the latest and greatest there is. We get all the hardware we need for redundancy and performance, and we create our own tools when needed (like our open source monitoring system).
Some projects that you will be working on:
Bosun and scollector – our open source monitoring system and data collection agent written in Go
Opserver – our open source dashboard and additional monitoring tool
Building systems to automate some of the SRE pain points around a mixed Windows and Linux environment including deployment, patching, inventory, and config management
Extend some of the open source projects we use including logstash and puppet
Technologies you’ll work with:
Windows 2012 and 2012 R2
Modern Linux distributions – we’re running CentOS 6 and 7
IIS, DFS, Multi-site AD, SQL Server 2012 and 2014
Go and C# as primary languages, plus some work in Powershell, Python, etc.
