Software/Systems Engineer (SRE), Production Engineering Quantcast
THIS JOB HAS EXPIRED The production engineering team at Quantcast consists of three specific groups that make up data centers, infrastructure software, and application (or ?product?) operations. The teams work cross-functionally where possible to promote knowledge sharing and the elimination of any single point of failure.
These engineers (commonly referred to in industry as Site Reliability Engineers) truly are versatile, possessing a ?triple threat? toolbox consisting of expertise in software develompent, Linux systems/administration, and networking. Each person is unique in their personal and technical strengths and will likely bring valuable depth in one or two of these areas in particular.
A candidate for this group will be excited by the opportunity to join an elite team of systems and software engineers that are tasked with not only keeping up with our current scaling demands, but also laying the groundwork of various tools, automation, and processes from scratch along the way. We operate at a very large scale (think 20 petabytes of data processed and tens of billions of requests handled per day) but are in rapid growth mode and really just getting started. As the experts of our production systems and applications, this team provides guidance to all other software develompent teams within Quantcast for fault-tolerance, performance, and scalability.
We offer you a unique blend of experience: to work in our current truly large-scale environment, but also one that requires developing many new systems to take our products and platforms from medium-size to very large.
Technologies and problems:
Large-scale data transfer, datacenter replication, and continuous deployment architecture for high-traffic website
Tooling for 1000+ node commodity compute clusters, custom mapreduce framework, and open sourced distributed file system (QFS)
Monitoring/alerting software for a global WAN infrastructure that handles up to 500k transactions per second
?Cloud automation? and provisioning software
Open source implementations including Redis, puppet, Hadoop, Storm, QFS, OpenStack, etc.
Active participation in design and code reviews with product development teams
Operations, scaling, and troubleshooting for all mission-critical systems, applications, and data pipelines
Strong understanding of Linux administration and automation techniques
Excellent troubleshooting skills to manage live, large-scale systems
Fluency in one or more programming or scripting languages (e.g., Java/C++, Python/Perl/bash/Ruby)
Knowledge of networking protocols (e.g., TCP/IP, HTTP, DNS)
BS or MS in Computer Science, alternate technical major, or equivalent years of relevant experience
Comfortable participating in 24/7 on-call rotation
Nice to have:
Understanding of performance, scalability and reliability problems at large scale (thousands of processing and storage units) a plus
Understanding of map/reduce implementation a plus
Experience with system configuration-management systems (e.g., CFEngine, Puppet, Chef) a plus
Quantcast is an audience measurement and targeting company. The pioneer of direct audience measurement, we start with the industry?s most in-depth understanding of digital audiences to help marketers and publishers buy and sell the most effective targeted advertising and drive conversions through the full funnel. Our products let publishers match their audience to the exact consumers an advertiser wants to reach with impression level targeting. Ranked Fast Company?s #3 Most Innovative Company on the Web and the Overall Winner of AlwaysOn?s Global 250 Top Private Companies, Quantcast is used by the world?s leading advertisers, the top 10 media agencies and 100+ million web destinations. Launched in 2006, Quantcast is headquartered in San Francisco and backed by Founders Fund, Polaris Venture Partners, Revolution Ventures and Cisco Systems.
||400 Second Street |
San Francisco, CA 94107
THIS JOB HAS EXPIRED