This post is my pre-registration of a study I will be running to continue the exploratory work I started here. Abstract In continuation of previous work, we test if a Large Language Model (LLM) is more likely to produce factually-incorrect answers if it has previously produced factually-incorrect answers.
Pre-registering a study
Pre-registering a study
Pre-registering a study
This post is my pre-registration of a study I will be running to continue the exploratory work I started here. Abstract In continuation of previous work, we test if a Large Language Model (LLM) is more likely to produce factually-incorrect answers if it has previously produced factually-incorrect answers.