Rafael Perez Mendoza University of Electro-Communications Japan
Provide a URL to a web page, technical memorandum, or a paper.
No response.
Provide a general summary with relevant background information: Where does the method come from? Is it novel? Name the prior art.
This is just a brute force approach to find the most fit combination of averages of some relevant variables. The variables were empirically chosen after receiving results from kNN.
Summarize the algorithms you used in a way that those skilled in the art should understand what to do. Profile of your methods as follows:
Please describe your data understanding efforts, and interesting observations:
Brute force approach with all the possible combination of features using kNN
Details on feature generation:
New features were generated by averaging the Correct First Step using all the possible combinations of "AnonStudentId", "ProblemHierarchy", "ProblemName", and "StepName"
Details on feature selection:
"AnonStudentId", "ProblemHierarchy", "ProblemName", and "StepName" where selected as the most profitable features after doing a brute force approach with kNN
Details on latent factor discovery (techniques used, useful student/step features, how were the factors used, etc.):
More details on preprocessing:
Details on classification:
Details on model selection:
Scores shown in the table below are Cup scores, not leaderboard scores. The difference between the two is described on the Evaluation page.
A reader should also know from reading the fact sheet what the strength of the method is.
Please comment about the following:
No advantages found, this method is big and slow
Details on the relevance of the KC models and latent factors:
Details on software implementation:
Details on hardware implementation. Specify whether you provide a self contained-application or libraries.
Provide a URL for the code (if available):
The data is a bit too big for our computer capabilities, made us redefine the use of databases and parallel processing
List references below.