Sample Selector

Sample Selector is a tool for creating and editing samples, or groups of data you compare across—they're not "samples" in the statistical sense, but more like filters.

By default, a single sample exists: "All Data". With the Sample Selector, you can create new samples to organize your data.

You can use samples to:

A sample is composed of one or more filters, specific conditions that narrow down your sample.

Creating a sample

The general process for creating a sample is to:

The effect of multiple filters

DataShop interprets each filter after the first as an additional restriction on the data that is included in the sample. This is also known as a logical "AND". You can see the results of multiple filters in the sample preview as soon as all filters are "saved".

« Back to DataShop

Student-Step Rollup

The Student-Step Rollup report is a tabular report that arranges transaction data by step. It includes counts of total errors, hints, and opportunity number. Optional columns are knowledge component names, Learning Factors Analysis predicted error rate, and Learning Factors Analysis slope and intercepts.

To display the Student-Step Rollup report:

  1. Click the Learning Curve tab at the top of the screen.
  2. Click the subtab Student-Step Rollup.

As on the Export page, you can export your data using the export button. The Student-Step Rollup export includes only your selected sample(s), and reflects the chosen knowledge component models; however, it includes all knowledge components and students within those samples. (To include a subset of knowledge components and/or students, define a new sample using the sample selector, and include only that sample.)

Column Descriptions

Column Description
# A row counter.
Sample The sample that includes this step. If you select more than one sample to export, steps that occur in more than one sample will be duplicated in the export.
Anon Student ID The student that performed the step.
Problem Hierarchy The location in the curriculum hierarchy where this step occurs.
Problem Name The name of the problem in which the step occurs.
Problem View The number of times the student encountered the problem associated with this step so far. Note that problem view increases regardless of whether or not the step was encountered in previous problem views. For example, a step can have a "Problem View" of "3", indicating the problem was viewed three times by this student, but that same step need not have been encountered by that student in all instances of the problem. If this number does not increase as you expect it to, it might be that DataShop has identified similar problems as distinct: two problems with the same "Problem Name" are considered different "problems" by DataShop if the following logged values are not identical: problem name, context, tutor_flag (whether or not the problem or activity is tutored) and "other" field. For more on the logging of these fields, see the description of the "problem" element in the Guide to the Tutor Message Format.
Step Name Formed by concatenating the "selection" and "action". Also see the glossary entry for "step".
Step Start Time Determined based on the presence of problem events in the log. If a problem event precedes the first transaction toward the step, step start time is equivalent to the problem event time. If the first transaction toward the step is preceded by a transaction toward a different step, the step start time is the time of that preceding transaction. If the first transaction toward the step is not preceded by either a problem event or transaction toward another step, the step start time is the time of the earliest transaction. In the case that the step start time is determined to be more than 10 minutes before the earliest transaction time, the step start time is set to null as it's considered an unreliable value.
For a visual example, see the Examples page.
First Transaction Time The time of the first transaction toward the step.
Correct Transaction Time The time of the correct attempt toward the step, if there was one.
Step End Time The time of the last transaction toward the step.
Step Duration (sec) The elapsed time of the step in seconds, calculated by adding all of the durations for transactions that were attributed to the step. See the glossary entry for more detail. This column was previously labeled "Assistance Time". It differs from "Assistance Time" in that its values are derived by summing transaction durations, not finding the difference between only two points in time (step start time and the last correct attempt).
Correct Step Duration (sec) The step duration if the first attempt for the step was correct. This might also be described as "reaction time" since it's the duration of time from the previous transaction or problem start event to the correct attempt. See the glossary entry for more detail. This column was previously labeled "Correct Step Time (sec)".
Error Step Duration (sec) The step duration if the first attempt for the step was an error (incorrect attempt or hint request).
First Attempt The tutor's response to the student's first attempt on the step. Example values are "hint", "correct", and "incorrect".
Incorrects Total number of incorrect attempts by the student on the step.
Hints Total number of hints requested by the student for the step.
Corrects Total correct attempts by the student for the step. (Only increases if the step is encountered more than once.)
Knowledge Component(s) (Only shown when the "Knowledge Components" option is selected.) Knowledge components associated with the correct performance of this step. If more than one KC is associated with this step, a separate row is created with the only differences between the rows being in the "Knowledge component" and "Opportunity #" columns.
Opportunity # (Only shown when the "Knowledge Components" option is selected.) An opportunity is the first chance on a step for a student to demonstrate whether he or she has learned the associated knowledge component. Opportunity number is therefore a count that increases by one each time the student encounters a step with the listed knowledge component.
Predicted Error Rate A hypothetical error rate based on the Learning Factor Analysis (LFA) algorithm. A value of "1" is a prediction that the student's first attempt will be an error (incorrect attempt or hint request); a value of "0" is a prediction that the student's first attempt will be correct. For more on this calculation, see the Predicted Error Rate.

See the Student-Step Rollup Example for a visual description of how step times, step durations, and correct step durations are calculated.

“Predicted Error Rate” and how it's calculated

Predicted error rate is the probability of getting a step incorrect, as predicted by the Learning Factors Analysis algorithm.

LFA equation

where

  • Υij = the response of student i on item j
  • θi = coefficient for proficiency of student i
  • βk = coefficient for difficulty of knowledge component k
  • γk = coefficient for the learning rate of knowledge component k
  • Τik = the number of practice opportunities student i has had on the knowledge component k

The intuition of this model is that the probability of a student getting a step correct is proportional to the amount of required knowledge the student knows, plus the "easiness" of that knowledge component, plus the amount of learning gained for each practice opportunity.

When there is one knowledge component per step, the above model has the form below called the Additive Factor Model (AFM). The term "Additive" comes from the fact that a linear combination of knowledge component parameters determines logit(pij) in the equation.

Additive Factor Model (AFM)

where

Additive Factor Model (AFM)

Κ = the total number of knowledge components in the Q-matrix

When there is more than one knowledge component associated with a step, DataShop uses the Conjunctive Factor Model (CFM):

Conjunctive Factor Model (CFM)

DataShop runs the Additive Factor Model (AFM) variant of LFA when a knowledge component model specifies one or zero knowledge components for each transaction, and the Conjunctive Factor Model (CFM) when a knowledge component model specifies more than one knowledge component for at least one transaction.

Both variants generate the same models, but AFM runs more quickly.

You can view values of the LFA parameters and see measures of how well the LFA statistical model fits the data on the LFA Values report (click its title on the subtab to the right of Student-Step Rollup Table).

For more information on the LFA algorithm, or for assistance interpreting the predicted error rate, contact Hao Cen. See also Learning Factors Analysis - A General Method for Cognitive Model Evaluation and Improvement.

Student-Step Rollup Example

This example demonstrates how DataShop calculates step start time, step end time, step duration, and correct step duration for a student on a series of steps.

To follow the example, refer to the timeline representation of steps and the table of calculated times (both below), and the definitions of student-step rollup fields. Note that steps alternately appear above and below the gray line to improve the readability of the example.

Step # Start Time End Time Step Duration (sec) Correct Step Duration (sec) Notes
11 15:32 15:42 10 null A problem event precedes the first transaction for the step. DataShop uses the problem event time as the step start time. The step end time is the time of the last attempt on the step. No attempt is correct for this step, so the sum of the durations is the total length of time spent on the step, and there is no Correct Step Duration.
12 15:45 15:49 4 null A problem event signifies a new instance of the same problem; it is used as the step start time. The correct attempt is not the first attempt, so again there is no Correct Step Duration.
2 null 46:00 null null No problem event precedes the first attempt for the step and the preceding transaction is more than 10 minutes before the first transaction on the step. Given this, DataShop does not calculate a step start time, nor a Step Duration or Correct Step Duration.
3 46:00 46:05 5 5 No problem event precedes the first attempt, but the preceding transaction's time is less than 10 minutes prior so it is used as the step start time. Correct Step Duration and Step Duration are equivalent because the first transaction is a correct attempt.
4 46:06 46:25 4+3+3=10 null Step 4 is interrupted by attempts toward Step 5. DataShop excludes time spent toward Step 5 in its calculation of total time spent on Step 4. The step duration is the sum of the durations for transactions at 46:10 (4s), 46:13 (3s), and 46:25 (3s).
5 46:13 46:22 9 null No problem event precedes the first attempt, but the preceding transaction's time is less than 10 minutes prior so it is used as the step start time.
Version 3.6.9 October 28, 2009 LearnLab logo