SRE - TechOps MediaMath
MediaMath is a confluence of media, technology, and massive amounts of data. There is a transformation of an industry underway, and MediaMath is at the cutting edge. Our engineers develop complex, innovative, and highly scalable technology that is changing how advertising and media are bought and sold. Their breakthroughs create new marketplaces, solve long-standing problems, and push new technology every day. It's an exciting company in a very exciting industry. Our platform handles billions of transactions every hour, and we hundreds of millions of internet and mobile users every day? and we're not done yet! The platform and tools we develop are built to scale because this revolution has just begun.
The TechOps team builds and manages MediaMath?s infrastructure and data centers across four continents. The TechOps team is responsible for monitoring, expanding, contracting, and providing connectivity for all the servers, operating systems, and software that gets deployed to them. They manage and monitor how the software will work in production and run in a real environment. Their ultimate goal is zero outages and painless, managed downtime as they scale and update our massive distributed systems.
As an SRE on the TechOps team, you will be front-and-center in the effort to keep our distributed services fast and reliable, 100% of the time. Our SREs are embedded on development teams, becoming experts on our systems while providing their TechOps expertise to the product release process. You will understand how our systems behave and will be responsible for ensuring they run stably and securely.
Manage the scalability, performance, and availability of MediaMath systems by solving for reliability against existing systems and services spanning the entire stack.
Develop tools and automation to minimize delivery time and increase developer productivity.
Advise in the design and development of new and evolving services, architecture, and performance standards.
Advise in capacity planning and service performance analysis and tuning.
Respond to and resolve emergent issues. Be on-call periodically as part of shared team.
WHAT IT WILL TAKE
5+ years of relevant work experience, including experience with high-volume, production distributed systems environment.
High-level shell fluency + one or more scripting languages (Python, Perl, or similar).
Experience managing and deploying full stack, distributed services.
Experience with system automation tools (Ansible, Chef, Puppet, Salt Stack, etc.).
Experience with monitoring, alerting, and pipeline analysis tools (Nagios, Sensu, Graphite, Riemann, Logstash, etc.).
Excellent analytical skills, coupled with a strong sense of ownership, urgency and drive.
Experience with queuing/data-pipelining solutions (Storm, Kafka, RabbitMQ, ZeroMQ, etc.).
Experience with SQL/NoSQL systems such as PostgresSQL, MySQL, Cassandra, CouchDB, or Redis.
WHAT MIGHT HELP
Exposure to AWS and OpenStack APIs.
You have a passion for distributed computing and/or large scale systems.
How do we reward our outstanding Math Men and Math Women? We start with company equity, comprehensive medical, dental, vision, short term and long term disability and life insurance, Open Paid Time Off, free on-site chair massages and our 401(k) and 401(k) matching. We then serve-up flexible spending accounts, bagel Fridays, free snacks and sodas, and the latest and greatest technology you need to do your job (including a $250 biennial allowance for the latest smartphone). And the cherry on top? Monthly MathMixers; including Potlucks, Trivia Nights, and pick-up basketball, to name a few.
In achieving their duties and responsibilities, MediaMath employees are expected to behave in keeping with the Math Values of SPACE: Scalable Innovation, Performance, Accountability, Collaboration, and Empowerment.
||New York, NY |