Datasets
Curated Hugging Face datasets powering our hypothesis testing and experiments. Each dataset comes with a clear task description, schema, and links to related problem statements.
Reddit ChangeMyView
You are a research scientist generating content-based hypotheses about whether a counterargument successfully changes the original poster's opinion in online debates. Analyze linguistic features, argumentation strategies, and persuasive elements that predict opinion change.
LaMem
You are a research scientist generating content-based hypotheses about what makes an image memorable. Analyze visual features, composition, color, semantic content, and perceptual properties that contribute to memorability.
Twitter Persuasion Pairs
You are a research scientist studying persuasive communication in short-form social media. Analyze how brevity, emotional tone, and social proof affect persuasion effectiveness in tweets.
Visual Attention & Memorability
You are a research scientist studying what captures and holds visual attention. Analyze image properties that predict where people look and what they remember from natural scenes.
Have a dataset to suggest?
Datasets and problem statements are reviewed weekly by our team. To suggest a new dataset, open a pull request on GitHub with the Hugging Face link, task description, column names, and target column.