Raw data, tidy data, and Codebook

Overview the collected data across languages and laboratory settings.

Raw data files

The all raw data were uploaded to theproject OSF. We will make the raw data public once we submit the final report to PB&R.

Tidy data files

The example analysis in this website were from the tidy data files. Researchers would access the data files by the subtitles.

Sentence-Picture verification

Metadata

Description

Dataset name: results

The dataset has N=71025 rows and 17 columns. 71025 rows have no missing values on any column.

Metadata for search engines
  • Date published: 2021-10-18
x
PSA_ID
SEED
datetime
logfile
subject_nr
task_order
List
Match
Orientation
PList
Probe
Target
response_time
correct
opensesame_codename
opensesame_version
Language

Codebook table

name data_type n_missing complete_rate n_unique empty min median max mean sd whitespace hist label
PSA_ID character 0 1 50 0 7 NA 8 NA NA 0 NA NA
SEED numeric 0 1 NA NA 1 3024 9497 3615.9203 3326.363 NA <U+2587><U+2582><U+2583><U+2583><U+2583> NA
datetime character 0 1 2067 0 10 NA 24 NA NA 0 NA NA
logfile character 0 1 3289 0 8 NA 105 NA NA 0 NA NA
subject_nr numeric 0 1 NA NA 1 68 5375 922.6107 1485.397 NA <U+2587><U+2581><U+2581><U+2581><U+2581> NA
task_order character 0 1 4 0 2 NA 6 NA NA 0 NA NA
List character 0 1 68 0 1 NA 15 NA NA 0 NA NA
Match character 0 1 2 0 1 NA 1 NA NA 0 NA NA
Orientation character 0 1 2 0 1 NA 1 NA NA 0 NA NA
PList character 0 1 34 0 3 NA 14 NA NA 0 NA NA
Probe character 0 1 1075 0 11 NA 132 NA NA 0 NA NA
Target character 0 1 48 0 8 NA 22 NA NA 0 NA NA
response_time numeric 0 1 NA NA 0 638 471343 938.9592 4224.964 NA <U+2587><U+2581><U+2581><U+2581><U+2581> NA
correct numeric 0 1 NA NA 1 1 1 1.0000 0.000 NA <U+2581><U+2581><U+2587><U+2581><U+2581> NA
opensesame_codename character 0 1 3 0 5 NA 17 NA NA 0 NA NA
opensesame_version character 0 1 7 0 5 NA 10 NA NA 0 NA NA
Language character 0 1 18 0 4 NA 20 NA NA 0 NA NA
JSON-LD metadata

The following JSON-LD can be found by search engines, if you share this codebook publicly on the web.

{
  "name": "results",
  "datePublished": "2021-10-18",
  "description": "The dataset has N=71025 rows and 17 columns.\n71025 rows have no missing values on any column.\n\n\n## Table of variables\nThis table contains variable names, labels, and number of missing values.\nSee the complete codebook for more.\n\n|name                |label | n_missing|\n|:-------------------|:-----|---------:|\n|PSA_ID              |NA    |         0|\n|SEED                |NA    |         0|\n|datetime            |NA    |         0|\n|logfile             |NA    |         0|\n|subject_nr          |NA    |         0|\n|task_order          |NA    |         0|\n|List                |NA    |         0|\n|Match               |NA    |         0|\n|Orientation         |NA    |         0|\n|PList               |NA    |         0|\n|Probe               |NA    |         0|\n|Target              |NA    |         0|\n|response_time       |NA    |         0|\n|correct             |NA    |         0|\n|opensesame_codename |NA    |         0|\n|opensesame_version  |NA    |         0|\n|Language            |NA    |         0|\n\n### Note\nThis dataset was automatically described using the [codebook R package](https://rubenarslan.github.io/codebook/) (version 0.9.2).",
  "keywords": ["PSA_ID", "SEED", "datetime", "logfile", "subject_nr", "task_order", "List", "Match", "Orientation", "PList", "Probe", "Target", "response_time", "correct", "opensesame_codename", "opensesame_version", "Language"],
  "@context": "http://schema.org/",
  "@type": "Dataset",
  "variableMeasured": [
    {
      "name": "PSA_ID",
      "@type": "propertyValue"
    },
    {
      "name": "SEED",
      "@type": "propertyValue"
    },
    {
      "name": "datetime",
      "@type": "propertyValue"
    },
    {
      "name": "logfile",
      "@type": "propertyValue"
    },
    {
      "name": "subject_nr",
      "@type": "propertyValue"
    },
    {
      "name": "task_order",
      "@type": "propertyValue"
    },
    {
      "name": "List",
      "@type": "propertyValue"
    },
    {
      "name": "Match",
      "@type": "propertyValue"
    },
    {
      "name": "Orientation",
      "@type": "propertyValue"
    },
    {
      "name": "PList",
      "@type": "propertyValue"
    },
    {
      "name": "Probe",
      "@type": "propertyValue"
    },
    {
      "name": "Target",
      "@type": "propertyValue"
    },
    {
      "name": "response_time",
      "@type": "propertyValue"
    },
    {
      "name": "correct",
      "@type": "propertyValue"
    },
    {
      "name": "opensesame_codename",
      "@type": "propertyValue"
    },
    {
      "name": "opensesame_version",
      "@type": "propertyValue"
    },
    {
      "name": "Language",
      "@type": "propertyValue"
    }
  ]
}`

Picture-Picture verification

Metadata

Description

Dataset name: results

The dataset has N=76142 rows and 16 columns. 76142 rows have no missing values on any column.

Metadata for search engines
  • Date published: 2021-10-18
x
PSA_ID
SEED
datetime
logfile
subject_nr
PPList
Orientation1
Orientation2
Identical
Picture1
Picture2
response_time
correct
opensesame_codename
opensesame_version
Language

Codebook table

name data_type n_missing complete_rate n_unique empty min median max mean sd whitespace hist label
PSA_ID character 0 1 50 0 7 NA 8 NA NA 0 NA NA
SEED numeric 0 1 NA NA 1 2856 9497 3487.6104 3342.1881 NA <U+2587><U+2582><U+2583><U+2583><U+2582> NA
datetime character 0 1 2061 0 10 NA 24 NA NA 0 NA NA
logfile character 0 1 3281 0 8 NA 105 NA NA 0 NA NA
subject_nr numeric 0 1 NA NA 1 73 5375 980.4521 1516.8004 NA <U+2587><U+2581><U+2581><U+2581><U+2581> NA
PPList character 0 1 8 0 1 NA 12 NA NA 0 NA NA
Orientation1 character 0 1 2 0 1 NA 1 NA NA 0 NA NA
Orientation2 character 0 1 2 0 1 NA 1 NA NA 0 NA NA
Identical character 0 1 2 0 1 NA 1 NA NA 0 NA NA
Picture1 character 0 1 48 0 8 NA 22 NA NA 0 NA NA
Picture2 character 0 1 48 0 8 NA 22 NA NA 0 NA NA
response_time numeric 0 1 NA NA 7 589 1995 616.8791 154.4775 NA <U+2581><U+2587><U+2581><U+2581><U+2581> NA
correct numeric 0 1 NA NA 1 1 1 1.0000 0.0000 NA <U+2581><U+2581><U+2587><U+2581><U+2581> NA
opensesame_codename character 0 1 3 0 5 NA 17 NA NA 0 NA NA
opensesame_version character 0 1 6 0 5 NA 6 NA NA 0 NA NA
Language character 0 1 18 0 4 NA 20 NA NA 0 NA NA
JSON-LD metadata

The following JSON-LD can be found by search engines, if you share this codebook publicly on the web.

{
  "name": "results",
  "datePublished": "2021-10-18",
  "description": "The dataset has N=76142 rows and 16 columns.\n76142 rows have no missing values on any column.\n\n\n## Table of variables\nThis table contains variable names, labels, and number of missing values.\nSee the complete codebook for more.\n\n|name                |label | n_missing|\n|:-------------------|:-----|---------:|\n|PSA_ID              |NA    |         0|\n|SEED                |NA    |         0|\n|datetime            |NA    |         0|\n|logfile             |NA    |         0|\n|subject_nr          |NA    |         0|\n|PPList              |NA    |         0|\n|Orientation1        |NA    |         0|\n|Orientation2        |NA    |         0|\n|Identical           |NA    |         0|\n|Picture1            |NA    |         0|\n|Picture2            |NA    |         0|\n|response_time       |NA    |         0|\n|correct             |NA    |         0|\n|opensesame_codename |NA    |         0|\n|opensesame_version  |NA    |         0|\n|Language            |NA    |         0|\n\n### Note\nThis dataset was automatically described using the [codebook R package](https://rubenarslan.github.io/codebook/) (version 0.9.2).",
  "keywords": ["PSA_ID", "SEED", "datetime", "logfile", "subject_nr", "PPList", "Orientation1", "Orientation2", "Identical", "Picture1", "Picture2", "response_time", "correct", "opensesame_codename", "opensesame_version", "Language"],
  "@context": "http://schema.org/",
  "@type": "Dataset",
  "variableMeasured": [
    {
      "name": "PSA_ID",
      "@type": "propertyValue"
    },
    {
      "name": "SEED",
      "@type": "propertyValue"
    },
    {
      "name": "datetime",
      "@type": "propertyValue"
    },
    {
      "name": "logfile",
      "@type": "propertyValue"
    },
    {
      "name": "subject_nr",
      "@type": "propertyValue"
    },
    {
      "name": "PPList",
      "@type": "propertyValue"
    },
    {
      "name": "Orientation1",
      "@type": "propertyValue"
    },
    {
      "name": "Orientation2",
      "@type": "propertyValue"
    },
    {
      "name": "Identical",
      "@type": "propertyValue"
    },
    {
      "name": "Picture1",
      "@type": "propertyValue"
    },
    {
      "name": "Picture2",
      "@type": "propertyValue"
    },
    {
      "name": "response_time",
      "@type": "propertyValue"
    },
    {
      "name": "correct",
      "@type": "propertyValue"
    },
    {
      "name": "opensesame_codename",
      "@type": "propertyValue"
    },
    {
      "name": "opensesame_version",
      "@type": "propertyValue"
    },
    {
      "name": "Language",
      "@type": "propertyValue"
    }
  ]
}`

Corrections

If you see mistakes or want to suggest changes, please create an issue on the source repository.

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at https://github.com/SCgeeker/PSA002_report, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".