Data Scientist Technorati
We are seeking a strong, and highly motivated, Data Scientist with large-scale data mining, and Big Data machine learning experience. This role will be responsible for developing advanced data mining algorithms and applications, building statistical and predictive models from multiple data sources, and using industry standards and best practices to implement advanced analytics.
Work with large (terabytes of data, billions of daily transactions) structured and unstructured data sets.
Work closely and iterate quickly with product teams throughout the organization.
Summarize and report analytical findings in both oral and written form.
Optimize advertising algorithms for performance and revenue.
Write and interpret map-reduce style data analyses using Hadoop and Pig, SQL, R and scripting languages.
PhD or Masters in Computer Science, Statistics, Applied Math, Physics, Engineering or other quantitative field.
Minimum of 3 years of hand-on experience (in a corporate environment) in quantitative modeling, analysis and data mining
Possesses strong combination of theoretical knowledge and hands-on experience in statistical techniques, development of predictive models and machine learning algorithms
Comfortable working with large, complex data sets from varying sources
Knowledge of relational database design and methods
Modeling expertise in statistical techniques such as logistic regression, decision trees, neural networks and clustering techniques
Demonstrated experience in developing algorithms and predictive models to solve real world business problems
Proven ability to translate theoretical results into practical applications while working with a diverse and innovative team
Highly motivated with the ability to work on a multitude of projects on an ongoing basis
Team player with an entrepreneurial spirit and strong communication and collaboration skills
Ability to potentially help interview, hire, train and lead a team
Expertise in R, MathLab, Mahout.
Proficiency in SQL
Experience in at least one compiled language (Java, C/C++ preferred).
Expertise in scripting and command-line operations
Deep understanding and hands-on experience with optimization, data mining and machine learning techniques, in particular in application to large sparse data sets
Experience analyzing internet scale datasets (billions of rows, thousands of columns)
Experience in natural language processing techniques and text analytics is a plus.
Experience with MPP databases or Map Reduce (Hadoop)
Experience in high-performance computing
Digital media or web retail experience
||665 Third Street |
San Francisco, CA 94107