KDD Cup 2010: Educational Data Mining Challenge

Evaluation

You will be allowed to train on the training portion of each data set, and will then be evaluated on your performance at providing Correct First Attempt values for the test portion. We will provide feedback for formatting errors in prediction files, but we will not reveal accuracy on test data until the end of the competition. Note that for each test file you submit, an unidentified portion will be used to validate your data and provide scores for the leaderboard, while the remaining portion will be used for determining the winner of the competition.

For a valid submission, the evaluation program will compare the predictions you provided against the undisclosed true values and report the difference as Root Mean Squared Error (RMSE). If a data set file is missing from a submission, the evaluation program will report the RMSE as 1 for that file. The total score for a submission will then be the average of the RMSE values. All data sets will receive equal weight in the final average, independent of their size.

At the end of the competition, the winner will be the team with the lowest total score.

KDD Cup 2010: Educational Data Mining Challenge

Sponsored by the Pittsburgh Science of Learning Center

Evaluation

Important Dates in 2010

March 15
April 1

April 19

June 8
June 14

June 21
July 25

About the KDD Cup

Questions about the competition or website?

Facebook

Elsevier

ACM

Carnegie Learning

IBM Research

DataShop

KDD Cup 2010: Educational Data Mining Challenge

Sponsored by the Pittsburgh Science of Learning Center

Evaluation

Important Dates in 2010

March 15 April 1 April 19 June 8 June 14 June 21 July 25

About the KDD Cup

Questions about the competition or website?

Facebook

Elsevier

ACM

Carnegie Learning

IBM Research

DataShop

March 15
April 1

April 19

June 8
June 14

June 21
July 25