SPE Database

This project was inspired by the Confidence Database. We aimed at curating a database that include trial-level data and other meta-data from as many empirical studies that used self-matching task from Sui, He, & Humphreys (2012). 

Currently, the SPE database includes trial-level data from 44 papers, covering 70 datasets and 3603 participants in total. Each dataset includes information on reaction times (RTs), accuracy (ACC), and other information reported in papers. Participants in these included studies come from diverse cultural backgrounds, facilitating cross-study comparisons and meta-analytic investigations.

The SPE Database is continuously updated as new studies and datasets become available. We welcome contributions from researchers who wish to share their data and help expand this resource. If you are interested in contributing or collaborating, please feel free to reach out!

This project is in parallel with an on-going preregistered meta-analysis leading by Hu Chuan-Peng and Zheng Liu (see registry here).

Leading Team

Zhenxin Cai (School of Psychology, Nanjing Normal University,email:czx@nnu.edu.cn)
Wang Qihui(School of Psychology, Nanjing Normal University,email:QAQbigWang@163.com.)
Xinru Sun (School of Psychology, Nanjing Normal University)
Wanke Pan (School of Psychology, Nanjing Normal University)
Mengzheng Hu (School of Psychology, Nanjing Normal University)
Zheng Liu (Division of Applied Psychology, School of Humanities and Social Science, CUHK-Shenzhen)
Jie Sui (School of Psychology, University of Aberdeen)
Hu Chuan-Peng (Corresponding author, School of Psychology, Nanjing Normal University, email: hcp4715@hotmail.com)

Data contributors

Authors of original studies were invited and listed here, if permitted, as contributors. We will adhere to Sage's authorship criteria for authors in our future data descriptor paper. That is, authors of our data descriptor paper must have been responsible for at least one of the following CRediT roles:

Conceptualization
Methodology
Formal Analysis
Investigation

AND at least one of the following:

Writing - Original Draft Preparation
Writing - Review & Editing

Contributors

Marco Bertamini (Department of General Psychology, University of Padova)
Mario Dalmaso (Department of Developmental and Social Psychology, University of Padova)
Michele Vicovaro (Department of General Psychology, University of Padova)
Merryn D. Constable (Department of Psychology, Northumbria University)
Christian Frings (University of Trier)
Céline Haciahmet (University of Trier)
Sarah Schäfer (University of Trier)
Bernhard Pastötter (University of Trier)
Judith Goris (Department of Experimental Psychology, Ghent University)
Letizia Amodeo (Department of Experimental Clinical and Health Psychology, Ghent University)
Annabel D. Nijhof (Department of Experimental Clinical and Health Psychology, Ghent University)
Jan R. Wiersema (Department of Experimental Clinical and Health Psychology, Ghent University)
Lili Guan (School of Psychology, Northeast Normal University)
Luis J. Fuentes (Departamento de Psicología Básica y Metodología, Facultad de Psicología y Logopedia, Universidad de Murcia)
Lucía B. Palmero (Departamento de Psicología Básica y Metodología, Facultad de Psicología y Logopedia, Universidad de Murcia)
Ivar Kolvoort (Department of Psychology, Programme Group Psychological Methods, University of Amsterdam)
Tal Makovski (Department of Psychology, Tel-Hai Academic College)
Víctor Martínez-Pérez (University of Castilla-La Mancha Albacete Campus, Faculty of Medicine (UCLM - Albacete))
Mayan Navon (Department of Education and Psychology, the Open University of Israel)
Georg Northoff (Institute of Mental Health Research, University of Ottawa)
Xiangping Gao (Department of Psychology, Shanghai Normal University)
Haoyue Qian (School of Physics and Shanghai Key Laboratory of Magnetic Resonance, East China Normal University; Department of Psychology, Shanghai Normal University)
Kalai Hung (Tsinghua University)
Michella Feldborg (University of Aberdeen)
Fei Wang (Tsinghua University)
Qiongdan Liang (Tsinghua University)
Yongfa Zhang (Tsinghua University)
Tuo Liu(Goethe University Frankfurt)
Mateusz Wozniak (Social Cognition in Human-Robot Interaction Group, Italian Institute of Technology; Social Mind Center, Department of Cognitive Science, Central European University; Cognition and Philosophy Lab, Department of Philosophy, Monash University; Institute of Psychology, Jagiellonian University)

Data Version

Version 0.1.4 — 2026-04-13

New features/changes

[Visualization Tool]: Added Shiny-based interactive data cleaning tool with batch processing capabilities
[Batch Processing]: Support for processing multiple *_raw.csv files in a directory automatically
[Interactive Interface]: Web-based UI for variable mapping, Identity standardization, and data preview
[Flexible Input]: Support for various path formats (with/without quotes, forward/backward slashes)
[Identity Mapping]: Manual Shape and Label Identity mapping with auto-detection suggestions
[Batch Download]: Results packaged as ZIP archive for easy distribution
[Progress Tracking]: Real-time batch processing progress and detailed results summary

Version 0.1.3 — 2025-10-20

New features/changes

[Data Filtering]: Using R, retaining behavioral variables required for calculating the Self-Prioritization Effect (SPE), including Matching, Shape/Face/Voice, Label, Identity (Shape_Origin_Identity,Shape_English_Identity,Shape_Standardized_Identity,Label_Origin_Identity,Label_English_Identity,Label_Standardized_Identity standardized as: NonPerson, Self, Close, Acquaintance, Celebrity, Stranger), RT_ms, and ACC. Demographic variables (e.g., gender, age, handedness) were also retained when available.
[Floder Structure]: The database is bifurcated into two primary folders: "Clean_Data" and "Raw_Data." The "Clean_Data" folder encompasses micpreprocessed data files, whereas the "Raw_Data" folder houses the original data files sourced from the articles. Within the Clean_Data folder, a JSON file has been added to document the paper's infromation, and a codebook has been included to provide a detailed account of the dataset's contents. Additionally, a codebook is present to meticulously log the data descriptions of the dataset.

Version 0.1.2 — 2025-06-16

New features/changes

[Data Filtering]: Performed initial data filtering using R, retaining behavioral variables required for calculating the Self-Prioritization Effect (SPE), including Matching, Shape/Face/Voice, Label, Identity (Shape_Identity standardized as: NonPerson, Self, Close, Acquaintance, Celebrity, Stranger), RT_ms, and ACC. Demographic variables (e.g., gender, age, handedness) were also retained when available.
[SPE Analysis]: Conducted exploratory analysis of SPE using Clean_Data, calculating sequential dependency effects and analyzing the impact of different Identity categories on RT and ACC.
[Visualization]: Visualized the distribution of SPE for each participant, providing a clear view of SPE performance across different Identity categories.

Bugs/glitches discovered after the release

[Insufficient Preprocessing]: Data filtering was performed rather than full preprocessing, which may lead to invalid values during data exploration (e.g., ACC values may include -1 for no response, 2 for incorrect key press). Users must perform their own preprocessing based on their analysis goals. Details of each article's Clean_Data are available in the Codebook within the Clean_Data folder.

Version 0.1.0 — 2025-05-16

New features/changes

[Data Structure Setup]: Established the initial data structure of the SPE database, including behavioral and demographic data.
[Data Integration]: Integrated raw data from multiple published articles, including behavioral variables (e.g., RT, ACC) and demographic variables.
[README File]: Provided a basic README file explaining the database structure and usage guidelines.

Bugs/glitches discovered after the release

[Inconsistent Variable Names]: Some raw data files contained inconsistent variable names, causing issues during data integration.
[Missing Demographic Variables]: Certain articles lacked demographic variables, resulting in incomplete metadata.

Unreleased

Planned

[Metadata in JSON Format]: Transition metadata storage from .md to .json format for each article, providing a more structured and machine-readable format.

Folder structure

root
│  .gitignore
│  README.md
│  Dataset_inf.xlsx  
├─1_Data 
│   └─ <Author>_<Year>_<Journal>
│       └─ <Author>_<Year>_<Journal>_<Exp-id>_Clean.csv
│       └─ <Author>_<Year>_<Journal>_<Exp-id>_raw_Subject.csv
│       └─ Codebook_<Author>_<Year>_<Journal>_<Exp-id>_Clean.xlsx
│       └─ <Author>_<Year>_<Journal>.json  # Including Meta data for each paper.
│       └─ <Author>_<Year>_<Journal>_<Exp-id>.json  # Including methodological information for the specific experiment.
│       └─ <Author>_<Year>_<Journal>_<Exp-id>_raw.csv
├─2_Code
│   └─ Clean_Data.Rproj
│   └─ Clean_Data.Rmd
│   └─ README.md
└─3_Reports
     │
     └─ README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPE Database

Leading Team

Data contributors

Data Version

Version 0.1.4 — 2026-04-13

Version 0.1.3 — 2025-10-20

Version 0.1.2 — 2025-06-16

Version 0.1.0 — 2025-05-16

Unreleased

Folder structure

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 310 Commits
1_Data		1_Data
2_Code		2_Code
3_Reports		3_Reports
.gitattributes		.gitattributes
.gitignore		.gitignore
Dataset_inf.xlsx		Dataset_inf.xlsx
README.html		README.html
README.md		README.md
SPE_Database.Rproj		SPE_Database.Rproj

Folders and files

Latest commit

History

Repository files navigation

SPE Database

Leading Team

Data contributors

Data Version

Version 0.1.4 — 2026-04-13

Version 0.1.3 — 2025-10-20

Version 0.1.2 — 2025-06-16

Version 0.1.0 — 2025-05-16

Unreleased

Folder structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages