Software Engineer, Site Reliability Turn
Turn delivers real-time insights that transform the way leading advertising agencies and marketers make decisions. Our cloud applications and Internet-scale architecture work together to provide a complete picture of customers, execute cross-channel campaigns, and connect with a worldwide ecosystem of over 100 partners. Turn is headquartered in Silicon Valley with offices in Amsterdam, Charlotte, Chicago, Dallas, London, Los Angeles, New York, Paris, and San Francisco. Company revenue has more than doubled every year of Turn?s existence. For more information, visit turn.com or follow @turnplatform.
Interested in what it?s like to work at Turn? Listen to what our employees have to say at: http://www.turn.com/working-at-turn/
Overview: As a leading cloud marketing platform, Turn's infrastructure handles trillions of online events a day, more than 900,000 events per second. Handling this volume requires innovative approaches when it comes to monitoring and running complex applications in our mission critical production environment. The Site Reliability Engineering team takes on that mission and works as an interface between our software engineers and our operations team to ensure our 10x year over year traffic growth is handled with class leading availability.
As a SW Engineer in the site reliability team, you will design and develop systems that will help us run our "software as a service" with enterprise quality. As part of this team, you will pioneer Turn's efforts in building custom solutions for monitoring our infrastructure. You will be part of a team making the architecture decisions to support the capture and aggregation thousands of business metrics every minute and be able to use them to make data driven decisions. Building a cross component understanding of our infrastructure will be the key to success in this role, and will make you an expert in the eyes of our engineering team. Your opinions will influence the design of our next generation systems.
A successful candidate in this role will enjoy problem solving at a scale that very few companies can offer. Fluency in software engineering, network technologies and data collection/analysis is necessary. If your idea of day to day excitement and challenge ranges from looking at stack traces to diagnosing problems involving petabytes of data across thousands of servers, then Turn Site Reliability Engineering team is the place for you!
Responsibilities:
Serve as a primary point of contact for the availability, scalability and performance of our mission critical applications
Gain deep knowledge of our complex applications to develop a bird's eye view.
Assist our software engineering team to ensure proper monitoring and metrics are being built into the applications
Develop custom infrastructure and system to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment
Participate in a 24x7 rotation for third-tier escalations
Work closely with development teams to ensure that platforms are designed with "operability" in mind
Minimum Qualifications:
Bachelor's or Master?s in Computer Science
0-5 years experience in a high scalability environment
Fluency in Java is required, experience in one or more languages a plus
Experience in one or more scripting language for system level automation
Comfortable with UNIX/Linux systems
Strong opinions or demonstrated experience on how to monitor thousands of data points in real time.
| Location: |
1600 Seaport Boulevard
Suite 300
Redwood City, CA 94063
United States
|