Experiments

Pre-registered field experiments run by our team and community based on community-submitted hypotheses. OpenExperiments is committed to open science.

Submit an Experiment

All Experiments

ObservationalCompleted

Reddit ChangeMyView

Logistic regression on 4,000 CMV posts. Features: acknowledgment presence (LLM-annotated), argument length, citation count, response position. Covariates: topic, time of day, author karma. Bonferroni-corrected across 4 refinement steps.

System

p=0.0003

A/B TestCompleted

Fortune 500 Consumer Brand

Online A/B experiment. 50/50 traffic split between business-as-usual control and challenger implementing the hypothesized design. Primary metric: newsletter sign-ups. Two-proportion z-test.

System

p=0.000001

ObservationalCompleted

Reddit ChangeMyView

Within-thread ranking analysis. Computed response position for each reply. Logistic regression predicting delta-award with position rank, controlling for length, time-of-day, and author history. Bonferroni correction over 4 tests.

System

p=0.001

ObservationalCompleted

LaMem

Paired comparison of visually similar images. LLM-annotated features: face presence, context congruence, scene category. Binomial test on preference within face-present pairs, stratified by congruence. Bonferroni correction over 3 tests.

System

p=0.004

ObservationalCompleted

Reddit ChangeMyView

LLM-annotated framing type (binary vs. spectrum) for 2,500 counterarguments. Logistic regression on opinion change controlling for length, acknowledgment, and citations.

System

p=0.002

ObservationalCompleted

Reddit ChangeMyView

Interaction analysis between accuracy-motivating elements (citations, data references) and identity-relevant topic classification. Two-way logistic regression with interaction term, controlling for length and position.

System

p=0.0008

ObservationalCompleted

LaMem

Color palette extraction using k-means clustering on pixel values. Warm/cool classification based on dominant hue. Paired comparison within visually similar pairs, controlling for scene category and complexity.

System

p=0.032

ObservationalCompleted

Reddit ChangeMyView

Narrative structure annotation via LLM (personal story vs. abstract argument). Logistic regression controlling for length, position, citations, and acknowledgment.

System

p=0.018