/data/papers/koncewicz2019ocenasilyswiadectw/REPORT.mdExtraction Report: Koncewicz 2019
Source Crosswalk
- Thesis PDF:
pdf/paper.pdf - Text extraction:
out/text.txt,out/fulltext.md - Raw scenario datasets:
data/*.csv - Shared demographic/end files:
data/demo/625672.csv,data/end/255249.csv - Manual numeric analysis:
data/raport_natka.rmd - Reproducible computation:
analysis/effect_sizes.qmd
The thesis is treated as the source of truth for scenario wording, procedure, scale wording, sample description, and moderator coding. The raw CSV exports and report Rmd are treated as the source of truth for group descriptives and effect sizes.
Extraction Decisions
- Included single-scenario effects: five between-subject high-vs-low contrasts for
przechodzien,znaki,policjant,dziecko, andpijany. - Included paired effects: five within-subject high-vs-low contrasts for the same evidence sources in the two-scenario presentation.
- Effect sign convention:
d = mean(low) - mean(high). - Between-subject effects: SMD via
esc::esc_mean_sd. - Within-subject effects: SMCRP via
metafor::escalc(measure = "SMCRP", correct = FALSE), using the within-person correlation computed from the raw paired responses.
Issues For Review
HUMAN_CHECK_KONCEWICZ_EFFECT_UNIT
The extraction includes both the single-scenario between-subject contrasts and the two-scenario paired contrasts. These are different presentation conditions and may or may not both belong in the same meta-analytic model.
Required human answer: confirm whether to include both sets, only single-scenario effects, or only paired/two-scenario effects.
HUMAN_CHECK_KONCEWICZ_SAMPLE_N
The thesis reports 753 participants and no exclusions. The copied raw scenario files produce
effect-level Ns that are used directly in analysis/effect_sizes.qmd; these scenario-level Ns
do not sum to 753 because the files represent scenario/presentation records from the combined
survey structure.
Required human answer: confirm that thesis-level N = 753 should remain at study level.
HUMAN_CHECK_KONCEWICZ_PIJANY_PAIRED_COLUMNS
The original Rmd notes that the paired drunk-passerby dataset uses PrzechodzenOba column
names for the pijany_oba file and states that this is intentional despite a naming mistake.
The QMD follows that original Rmd behavior.
Required human answer: confirm that 823783.csv with PrzechodzenOba.SA001/SA002
is indeed the paired drunk-passerby source.
Validation Notes
The YAML uses thesis pages for stimuli and question-wording provenance. Dataset-derived
groups use table_ref values pointing to the report Rmd and reproducible QMD.