Facebook has launched a Kaggle competition to hire a data scientist:
'This competition tests your text skills on a large dataset from the Stack Exchange sites. The task is to predict the tags (a.k.a. keywords, topics, summaries), given only the question text and its title. The dataset contains content from disparate stack exchange sites, containing a mix of both technical and non-technical questions.'
The question is very similar to building a taxonomy to classify user questions into a number of categories, called tags. Interestingly, we face the same problem at Data Science Central: automatically attaching tags (from a set of 5,000 potential tags - e.g. big data, analytics, hadoop, career, etc.) to all the blog posts posted on our network since 2007. We might hire someone to do this!
Read the article, and get help to win, at http://bit.ly/1anrTdf
You may leave the list at any time by sending the command
SIGNOFF allstat
to [log in to unmask], leaving the subject line blank.
|