Skip to content
/ BIRD Public

Official Repository for ICLR 2025 (Oral) BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Notifications You must be signed in to change notification settings

CogComp/BIRD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

This is the repository of dataset and source code for "BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models".

Installation

Setup the environment by first downloading this repository and then running:

pip install -r requirements.txt

Data

The datasets evaluated in this paper are available in the data/ directory:

  1. probabilistic estimation: common2sense_human_annotation.csv (for evaluation) and common2sense_human_annotation.json ( We provide this in the same format as a decision-making dataset to facilitate easier inference).
  2. decision making: common2sense.json, plasma.json and today.json. Each JSON dataset contains the following columns:
    • scenario
    • statement
    • opposite_statement
    • additional_sentence_label (indicates which statement each additional condition supports)
    • In common2sense.json, the additional conditions are provided as added_information and oppo_added_information.
    • In plasma.json and today.json, the additional conditions are listed under additional_sentences.

Run

Configure files for running the pipeline are in the scripts/ directory:

  1. To run the entire BIRD pipeline:
bash scripts/run_bird.sh
  1. To run the baselines:
bash scripts/baseline.sh
  1. To run the evaluation:
bash scripts/eval.sh

Citation and acknowledgement

If you find the project helpful, please cite:

@inproceedings{
feng2025bird,
title={{BIRD}: A Trustworthy Bayesian Inference Framework for Large Language Models},
author={Yu Feng and Ben Zhou and Weidong Lin and Dan Roth},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=fAAaT826Vv}
}

About

Official Repository for ICLR 2025 (Oral) BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published