On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions

A web visualization

This is an accompanying website for the paper with the same title. Below we provide experiments you can run to gain better intuition for the experiments and results in the paper.

1. Prompting is ineffective at debiasing

In the paper, we show that prompting fails to debias models in both Admissions and Hiring. In some cases, it may even worsen the bias or reduces the acceptance to nearly zero.

In this visualization, you will engineer a prompt to debias models. The debiasing prompt will be appended to the baseline prompt (before "Answer:"), which is a template for the task and model. We also provide the debiasing prompts used in the paper to get you started.

Select the task and model to start debiasing!

DatasetModel

Below is the baseline prompt template for the selected dataset and model. We use it to get models' decisions.

Prompt

Load default debiasing promptUse a custom prompt:

If you already ran an experiment, retrieve results by run ID: