SRE (Site Reliability Engineering) Deployment Engineer Mirantis
Mirantis has more experience delivering OpenStack cloud operating systems to more customers than any other company. Our corporate goal is to deliver OpenStack for mission critical applications and we are totally committed to keeping production open source clouds free of proprietary hooks or opaque packaging.
Passionate about working on mission critical projects? Like to break things apart just to see if you can put it back together ? only better?
The Mirantis Wrecking Crew, our Site Reliability Engineering team, does the performance, reliability and scalability engineering so that Mirantis can deliver these Mission Critical OpenStack clusters to our customers.
The Wrecking Crew?s mission is to break OpenStack, break all of OpenStack. All the time. Why? Because this is the only way to deliver enterprise grade OpenStack to the Enterprise for running large scale mission critical applications. The Wrecking Crew?s goal is to provide everything necessary to run tight SLA large scale applications on OpenStack with no user visible failures. Our focus is performance and reliability for OpenStack at scales of larger than 100 hypervisors.
Mirantis is building a rock solid SRE team and need you to drive the SRE culture within the company and push our reliability and performance changes back into the community. Our Site Reliability Engineering Team is what Lucius Fox is to Batman ? without him, Batman is just a man in a cape.
The Wrecking Crew is looking for people willing to continuously build OpenStack Clusters, deploy applications and break them to explore and fix every reliability, performance and scaling issue.
If this excites you, join the Mirantis Wrecking Crew.
Rapidly deploy OpenStack and workloads onto the cluster in different configurations
Instrument the OpenStack and VM?s used for the workloads
Run Tests and Fail parts of the cluster or workload vms to collect data
Collect results, diagnose what happened, propose a fix, make changes to OpenStack or workloads, retest
Document and fix the issues and grow the documentation and operational runbooks for MOSX
Deployment Engineer Qualifications:
Strong Linux Administration skills
Strong Programming skills in (2 of), Ruby, BASH, Python
Master of automation via Puppet, SaltStack or Chef
Solid experience working with PXE boot servers (Cobbler, Foreman, Razor)
Deep experience in deploying large scale OpenStack clouds
Experience with Server Setup (BIOS, RAID, PXE booting, IPMI, DHCP)
Experience with Enterprise Networking and troubleshooting problems related to OpenStack
Experience with Zabbix or Nagios
Operations Experience with Production Systems
Experience operating compliant production systems (PCI, FISMA, SOCI Type II, ISO 27001)
Passion for continuous delivery and true devops
Self-motivated problem solver mentality
Ability to thrive in a fast paced, dynamic environment with constantly changing requirements
Willingness to learn and be proactive
Ability to learn new skills quickly
Solid team player with excellent communication skills
What We Offer
Work in the Silicon Valley with established leaders in their industry
Work with exceptionally passionate, talented and engaging colleagues
High-energy atmosphere of a young company
Competitive compensation package with strong benefits plan and stock options
Lots of freedom for creativity and personal growth
||Mountain View, CA |