'Big Data' Data Scientist, Cloud Analytics Platform Greenplum
The analytics team consists of Data Scientists and Data Engineers, who will work together to develop analytical capabilities for generating business insights from big data. The Data Scientists will use these capabilities to provide answers to our customers' most urgent questions, exercising statistical methods and models against some of the world's largest data warehouses. The Data Engineers will provide the technologies required to produce these sophisticated analytics. They will develop a new layer of applications and tools on top of the Greenplum data warehouse technology that will facilitate statistical analysis and modeling. Examples will include Map/Reduce functions, statistical tests, modeling capabilities, and manipulation of large matrices. The Data Engineers will often work closely with leading academics and industry experts, as well as the engineers who are responsible for the database engine.
Responsibilities:
- Develop tools and technologies that support the use of statistical analysis and modeling to deliver business insights based on data stored in Greenplum data warehouses.
- Create re-usable implementations of statistical tests and models using the available technologies in the Greenplum database.
- Work alongside Data Scientists on client projects.
- Collaborate with Greenplum database engineers to enhance the analytics capabilities of the database.
- Work with the academic and business community to develop new techniques and to contribute to research in the area of business intelligence and analytics on large databases.
Requirements:- A passion for software development and for data
- At least three years experience of software development in languages such as C, C++, Java, and scripting languages such as Perl or Python
- Strong experience of databases and SQL, and experience of working with large data sets
- Familiarity with parallel programming frameworks like Google MapReduce or Apache Hadoop
- Some familiarity with statistical methods, mathematical modeling and business analytics, and preferably experience with statistical languages and packages such as R, SAS and Matlab
- A BS or advanced degree in a technical field (e.g. CS, mathematics, statistics, physics)
- A team player, capable of conducting independent research
- Results-driven, self-motivated, self-starter
- Not afraid to take on hard technical challenges
- Excellent communication skills
| Location: |
San Mateo, CA
United States
|