🩺 MedLink: Clinical Case Retrieval and Ranking Dashboard

MedLink is a web application designed to support clinical decision-making by retrieving and ranking relevant clinical case reports based on physician-provided medical reports. Built with Plotly's Dash in Python, this app utilizes advanced NLP models to help healthcare professionals find similar cases, aiding in complex diagnostic scenarios.

Visit MedLink here

📖 Overview

In healthcare, physician-patient interactions are often documented as free-text medical reports, which may be challenging to reference in complex diagnostic cases. MedLink addresses this by retrieving and ranking relevant clinical case reports from health conferences and journals, offering valuable insights based on similar past cases. This tool utilizes advanced NLP techniques, specifically two BERT models, to:

Retrieve similar cases (using a bi-encoder) and,
Re-rank the retrieved cases based on relevance (using a cross-encoder).

MedLink’s effectiveness was evaluated by a physician, achieving a ranking model performance of NDCG@10 of 0.747.

⭐ Features

Medical Report Search: Input a medical report to retrieve similar published case reports.
Ranking and Scoring: Case reports are ranked based on relevance using a cross-encoder model.
Clinical Entity Visualization: Key entities (e.g., symptoms, diagnoses) highlighted using Named Entity Recognition (NER).
Textual Explanations: Provides context and explanations to facilitate comparison of case reports.

⚙️ Methodology

Bi-Encoder Retrieval: The retrieval process begins with the bi-encoder model, which independently encodes a set of medical case reports and the input medical report.
Top-10 Candidates: Based on the similarity scores of the bi-encoder, the top-10 most similar case reports are retrieved.
Re-Ranking with Cross-Encoder: The cross-encoder model takes the medical report and each of the top-10 retrieved case reports as input. It computes a ranking score for each pair, which reflects the relevance of each case report in relation to the medical report.

🛠️ Installation

Note: The models are available on HuggingFace and the dataset for the Dash app is available in this repository. Therefore, it is not necessary to run the data extraction and model training scripts to run the MedLink application.

Prerequisites

Python 3.7+
Virtual Environment Tool: You can use Python’s built-in venv module or a tool like virtualenv.

Setup

Clone the repository and navigate to the project directory:

git clone https://github.com/LIAAD/MedLink.git
cd medlink

Create a virtual environment in the project directory:
```
python3 -m venv venv
```
Activate the virtual environment:
- On Windows:
```
venv\ Scripts\ activate
```
- On macOS and Linux:
```
source venv/bin/activate
```
Install the required dependencies:
```
pip install -r requirements.txt
```

🚀 Usage

Run the Dash app:

python app.py

📂 File/Directory Explanations

MedLink repository is divided into 3 different folders, each containing the essential components to replicate and extend the proposed application.

spmi_dataset/: Contains the necessary scripts for extracting and cleaning the dataset used for this project extracted from SPMI Case Reports, as well as the dataset itself. It also contains the sample dataset used for evaluation.
IR/: Contains the scripts required to reproduce the retrieval and re-ranker models.
dash_app/: Contains the MedLink application and necessary files to run it.
- pages/: Contains the different files for the application pages.
- assets/: Contains the CSS and images for the application.
requirements.txt: Lists all dependencies needed to run the project. Install these with pip install -r requirements.txt.
README.md: Documentation file (this file), providing an overview, installation instructions, and usage guidelines.

💬 Support

For questions, please contact the authors.

🤝 Acknowledgments

This work is co-financed by Component 5 - Capitalization and Business Innovation, integrated in the Resilience Dimension of the Recovery and Resilience Plan within the scope of the Recovery and Resilience Mechanism (MRR) of the European Union (EU), framed in the Next Generation EU, for the period 2021 - 2026, within project HfPT, with reference 41. The authors also would like to acknowledge project StorySense, with reference 2022.09312.PTDC (DOI 10.54499/2022.09312.PTDC) and the Advanced Computing Project CPCA-IAC/AV/594794/2023(https://doi.org/10.54499/CPCAIAC/ AV/594794/2023). The authors also thank Dr Luís Monteiro Dias for his contribution in elaborating the queries (medical reports) and the Sociedade Portuguesa de Medicina Interna (SPMI) for allowing access to the case reports used in this paper. Luís Filipe Cunha thanks the Fundação para a Ciência e Tecnologia (FCT), Portugal, for the Ph.D. Grant (2024.04202.BD).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 MedLink: Clinical Case Retrieval and Ranking Dashboard

📑 Table of Contents

📖 Overview

⭐ Features

⚙️ Methodology

🛠️ Installation

Prerequisites

Setup

🚀 Usage

Run the Dash app:

📂 File/Directory Explanations

💬 Support

🤝 Acknowledgments

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
IR		IR
dash_app		dash_app
spmi_dataset		spmi_dataset
.gitignore		.gitignore
MedLink-architecture.png		MedLink-architecture.png
README.md		README.md
medlink-catalog.png		medlink-catalog.png
medlink-medreport.png		medlink-medreport.png
requirements.txt		requirements.txt

LIAAD/MedLink

Folders and files

Latest commit

History

Repository files navigation

🩺 MedLink: Clinical Case Retrieval and Ranking Dashboard

📑 Table of Contents

📖 Overview

⭐ Features

⚙️ Methodology

🛠️ Installation

Prerequisites

Setup

🚀 Usage

Run the Dash app:

📂 File/Directory Explanations

💬 Support

🤝 Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages