koncewicz2019ocenasilyswiadectw
/data/papers/koncewicz2019ocenasilyswiadectw/REPORT.md
# Extraction Report: Koncewicz 2019

## Source Crosswalk

- Thesis PDF: `pdf/paper.pdf`
- Text extraction: `out/text.txt`, `out/fulltext.md`
- Raw scenario datasets: `data/*.csv`
- Shared demographic/end files: `data/demo/625672.csv`, `data/end/255249.csv`
- Manual numeric analysis: `data/raport_natka.rmd`
- Reproducible computation: `analysis/effect_sizes.qmd`

The thesis is treated as the source of truth for scenario wording, procedure, scale wording,
sample description, and moderator coding. The raw CSV exports and report Rmd are treated
as the source of truth for group descriptives and effect sizes.

## Extraction Decisions

- Included single-scenario effects: five between-subject high-vs-low contrasts for
  `przechodzien`, `znaki`, `policjant`, `dziecko`, and `pijany`.
- Included paired effects: five within-subject high-vs-low contrasts for the same evidence
  sources in the two-scenario presentation.
- Effect sign convention: `d = mean(low) - mean(high)`.
- Between-subject effects: SMD via `esc::esc_mean_sd`.
- Within-subject effects: SMCRP via `metafor::escalc(measure = "SMCRP", correct = FALSE)`,
  using the within-person correlation computed from the raw paired responses.

## Issues For Review

### HUMAN_CHECK_KONCEWICZ_EFFECT_UNIT

The extraction includes both the single-scenario between-subject contrasts and the
two-scenario paired contrasts. These are different presentation conditions and may or may not
both belong in the same meta-analytic model.

Required human answer: confirm whether to include both sets, only single-scenario effects,
or only paired/two-scenario effects.

### HUMAN_CHECK_KONCEWICZ_SAMPLE_N

The thesis reports 753 participants and no exclusions. The copied raw scenario files produce
effect-level Ns that are used directly in `analysis/effect_sizes.qmd`; these scenario-level Ns
do not sum to 753 because the files represent scenario/presentation records from the combined
survey structure.

Required human answer: confirm that thesis-level `N = 753` should remain at study level.

### HUMAN_CHECK_KONCEWICZ_PIJANY_PAIRED_COLUMNS

The original Rmd notes that the paired drunk-passerby dataset uses `PrzechodzenOba` column
names for the `pijany_oba` file and states that this is intentional despite a naming mistake.
The QMD follows that original Rmd behavior.

Required human answer: confirm that `823783.csv` with `PrzechodzenOba.SA001/SA002`
is indeed the paired drunk-passerby source.

## Validation Notes

The YAML uses thesis pages for stimuli and question-wording provenance. Dataset-derived
groups use `table_ref` values pointing to the report Rmd and reproducible QMD.