转一个USA的Sr Data Scientist 职位需求描述
因为在一个linkedin的group中,经常收到一些相关话题的讨论组邮件。今天看到一个职位推荐信息,遂打开浏览,发现正是时下流行的data scicentist 。。。个人觉得其职位需求非常值得参考,转载如下:
Sr Data Scientist - NYC 写道Sr Data Scientist - NYCFast growing online technology company seeks a seasoned, hands-on analytics professional to provide all of the company’s data mining & modeling activities, while playing an instrumental role in the development of data-related products and services.
You will:
Develop and implement the company’s data mining and modeling activities in support of our clients’ online targeting and digital media marketing goals?Develop and maintain ongoing data exploratory analyses against internal and client-provided dataSupport the business teams to solve complex segmentation and analysis challenges?Conduct studies, tests and use advanced data mining & modeling techniques to build solutions that optimize the quality and performance of the company’s data?You have:
5 10 years relevant analytics and data mining & modeling work experience?Proven ability to develop and execute sophisticated data mining & modeling solutions?In depth knowledge of the advanced techniques and tools common to the data mining world such as KNIME, RapidMiner, and Weka; knowledge of R and machine learning techniques a very strong plus?Strong attention to detail and excellent quantitative and qualitative analytical ability?Strong experience in marketing analytics, including marketing response modeling and customer segmentation; experience with online media and targeting a big plus?Working experience with relational database management system (RDBMS); with a strong knowledge of SQL?High energy and enthusiasm to develop the next big thing in internet advertising; someone that can think out of the box, solving new and unique analytical challenges?Advanced degree in statistics required???
?
顺便再列举几个开源的数据挖掘系统/软件:
Orange ??Orange?是一个基于组件的数据挖掘和机器学习软件套装,它的功能即友好,又很强大,快速而又多功能的可视化编程前端,以便浏览数据分析和可视化,基绑定了 Python以进行脚本开发。它包含了完整的一系列的组件以进行数据预处理,并提供了数据帐目,过渡,建模,模式评估和勘探的功能。其由C++ 和 Python开发,它的图形库是由跨平台的Qt框架开发。RapidMiner?
RapidMiner, 以前叫 YALE (Yet Another Learning Environment), 其是一个给机器学习和数据挖掘和分析的试验环境,同时用于研究了真实世界数据挖掘。它提供的实验由大量的算子组成,而这些算子由详细的XML 文件记录,并被RapidMiner图形化的用户接口表现出来。RapidMiner为主要的机器学习过程提供了超过500算子,并且,其结合了学习方案 和Weka学习环境的属性评估器。它是一个独立的工具可以用来做数据分析,同样也是一个数据挖掘引擎可以用来集成到你的产品中。