BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

This is the repository of dataset and source code for "BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models".

Installation

Setup the environment by first downloading this repository and then running:

pip install -r requirements.txt

Data

The datasets evaluated in this paper are available in the data/ directory:

probabilistic estimation: common2sense_human_annotation.csv (for evaluation) and common2sense_human_annotation.json ( We provide this in the same format as a decision-making dataset to facilitate easier inference).
decision making: common2sense.json, plasma.json and today.json. Each JSON dataset contains the following columns:
- scenario
- statement
- opposite_statement
- additional_sentence_label (indicates which statement each additional condition supports)
- In common2sense.json, the additional conditions are provided as added_information and oppo_added_information.
- In plasma.json and today.json, the additional conditions are listed under additional_sentences.

Run

Configure files for running the pipeline are in the scripts/ directory:

To run the entire BIRD pipeline:

bash scripts/run_bird.sh

To run the baselines:

bash scripts/baseline.sh

To run the evaluation:

bash scripts/eval.sh

Citation and acknowledgement

If you find the project helpful, please cite:

@inproceedings{
feng2025bird,
title={{BIRD}: A Trustworthy Bayesian Inference Framework for Large Language Models},
author={Yu Feng and Ben Zhou and Weidong Lin and Dan Roth},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=fAAaT826Vv}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
code		code
data		data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Installation

Data

Run

Citation and acknowledgement

About

Releases

Packages

Languages

CogComp/BIRD

Folders and files

Latest commit

History

Repository files navigation

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Installation

Data

Run

Citation and acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages