Michael Jahrer commendo research Austria
Andreas Töscher commendo research
Provide a URL to a web page, technical memorandum, or a paper.
No response.
Provide a general summary with relevant background information: Where does the method come from? Is it novel? Name the prior art.
We use an ensemble of collaborative filtering techniques originally developed for the Netflix competition and adopt them for educational data mining. In special we use the idea of matrix factorization, which is successfully used for collaborative filtering and adapt it to include all provided information. Additionally we factorize student/step/group relationships and purely neighborhood based relationships. A neural network combines an ensemble of predictions stemming from the mentioned methods and some baseline predictions.
Summarize the algorithms you used in a way that those skilled in the art should understand what to do. Profile of your methods as follows:
Please describe your data understanding efforts, and interesting observations:
We use all provided information in our factor models.
Details on feature generation:
The features were used at the blending stage.
Details on feature selection:
We do not use feature selection.
Details on latent factor discovery (techniques used, useful student/step features, how were the factors used, etc.):
We use matrix factorization which determines the latent features during the learning process.
More details on preprocessing:
Missing KCs are represented via a separate KC.
Details on classification:
Details on model selection:
We used a 8 fold cross validation for model selection.
Scores shown in the table below are Cup scores, not leaderboard scores. The difference between the two is described on the Evaluation page.
A reader should also know from reading the fact sheet what the strength of the method is.
Please comment about the following:
Details on the relevance of the KC models and latent factors:
The latent factors are most important for high accuracy. KC information can be used to achieve higher accuracy.
Details on software implementation:
Details on hardware implementation. Specify whether you provide a self contained-application or libraries.
Provide a URL for the code (if available):
List references below.