eye_movements

From MaRDI portal
Revision as of 12:29, 16 April 2024 by Import240416010454 (talk | contribs) (Created automatically from import240416010454)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Dataset:6037080



OpenML44034MaRDI QIDQ6037080

OpenML dataset with id 44034

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22103122/eye_movements.arff

Upload date: 18 June 2022



Dataset Characteristics

Number of classes: 2
Number of features: 24 (numeric: 20, symbolic: 3 and in total binary: 3 )
Number of instances: 7,608
Number of instances with missing values: 0
Number of missing values: 0

Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark,

                         transformed in the same way. This dataset belongs to the "classification on categorical and
                         numerical features" benchmark. Original description: 

Author: Source: Unknown - Date unknown Please cite:

Jarkko Salojarvi, Kai Puolamaki, Jaana Simola, Lauri Kovanen, Ilpo Kojo, Samuel Kaski. Inferring Relevance from Eye Movements: Feature Extraction. Helsinki University of Technology, Publications in Computer and Information Science, Report A82. 3 March 2005. Data set at http://www.cis.hut.fi/eyechallenge2005/

Competition 1 (preprocessed data) A straight-forward classification task. We provide pre-computed feature vectors for each word in the eye movement trajectory, with class labels.

The dataset consist of several assignments. Each assignment consists of a question followed by ten sentences (titles of news articles). One of the sentences is the correct answer to the question (C) and five of the sentences are irrelevant to the question (I). Four of the sentences are relevant to the question (R), but they do not answer it.


  • Features are in columns, feature vectors in rows.
  • Each assignment is a time sequence of 22-dimensional feature vectors.
  • The first column is the line number, second the assignment number and the next 22 columns (3 to 24) are the different features. Columns 25 to 27 contain extra information about the example. The training data set contains the classification label in the 28th column: "0" for irrelevant, "1" for relevant and "2" for the correct answer.
  • Each example (row) represents a single word. You are asked to return the classification of each read sentence.
  • The 22 features provided are commonly used in psychological studies on eye movement. All of them are not necessarily relevant in this context.

The objective of the Challenge is to predict the classification labels (I, R, C).


Please see the technical report for information of eye movements, experimental setup, baseline methods and references:

Jarkko Salojarvi, Kai Puolamaki, Jaana Simola, Lauri Kovanen, Ilpo Kojo, Samuel Kaski. Inferring Relevance from Eye Movements: Feature Extraction. Helsinki University of Technology, Publications in Computer and Information Science, Report A82. 3 March 2005. [PDF]


Modified by TunedIT (converted to ARFF format)


FEATURES

The values in columns marked with an asterisk (*) are same for all occurances of the word.

COL NAME DESCRIPTION 1 #line Line number 2 #assg Assignment Number 3 fixcount Number of fixations to the word 4* firstPassCnt Number of fixations to the word when it is first encountered 5* P1stFixation '1' if fixation occured when the sentence the word was in was encountered the first time 6* P2stFixation '1' if fixation occured when the sentence the word was in was encountered the second time 7* prevFixDur Duration of previous fixation 8* firstfixDur Duration of the first fixation when the word is first encountered 9* firstPassFixDur Sum of durations of fixations when the word is first encountered 10* nextFixDur Duration of the next fixation when gaze initially moves from the word 11 firstSaccLen Length of the first saccade 12 lastSaccLen Distance between fixation on the word and the next fixation 13 prevFixPos Distance between the first fixation preceding the word and the beginning ot the word 14 landingPos Distance between the first fixation on the word and the beginning of the word 15 leavingPos Distance between the last fixation on the word and the beginning of the word 16 totalFixDur Sum of all durations of fixations to the word 17 meanFixDur Mean duration of the fixations to the word 18* nRegressFrom Number of regressions leaving from the word 19* regressLen Sum of durations of regressions initiating from this word 20* nextWordRegress '1' if a regression initiated from the following word 21* regressDur Sum of durations of the fixations on the word during regression 22 pupilDiamMax Maximum pupil diameter 23 pupilDiamLag Maximum pupil diameter 0.5 - 1.5 seconds after the beginning of fixation 24 timePrtctg First fixation duration divided by the total number of fixations 25 nWordsInTitle Number of word in the sentence (title) this word is in 26 titleNo Title number 27 wordNo Word number (ordinal) in this title 28 label Classification for training data ('0'=irrelevant, '1'=relevant, '2'=correct)



This page was built for dataset: eye_movements