GRACE: Graph-based Complexity Reduction and Context-enhanced Explainability

🎯 Project Goal

The GRACE project aims to process a given dataset into a graph structure where nodes represent dataset features and edges represent possible feature interactions. We use this graph to constrain an XGBoost model, with two primary objectives:

Improve ML Performance: By providing the model with domain-informed or empirically discovered feature interactions, we can guide it towards better performance.
Enhance Explainability & Reduce Complexity: By simplifying the graph structure to a minimal set of nodes and edges, we create a more interpretable and less complex model.

🛠️ How It Works

The workflow is as follows:

Initial Graph Creation: An initial knowledge graph is created. This can be done manually, through an automated agent (create_kg.py), or by loading a pre-existing graph. The initial graph is based on feature importance (SHAP-IQ) and known biological/domain mechanisms.
Graph Optimization: The core of the project is in graph_reduction.py. We use a multi-objective optimization process with Optuna to iteratively refine the graph. The optimization seeks to find a Pareto front of graphs that are optimal in terms of both predictive performance (e.g., AUC or Accuracy) and simplicity (number of nodes and edges).
Constrained Model Training: The optimized graph structure is used to generate interaction_constraints for an XGBoost classifier. This forces the model to only consider interactions between features connected by an edge in the graph.
Evaluation: The final constrained model is evaluated on a test set to measure its performance.

🚀 Getting Started

1. Prerequisites

Python 3.10+
A virtual environment (e.g., venv or conda) is highly recommended.

2. Installation

Clone the repository to your local machine:

git clone <repository-url>
cd GRACE

Create and activate a virtual environment. For example, with venv:

python -m venv venv
source venv/bin/activate

Install the required dependencies:

pip install -r requirements.txt

3. Configuration

The project requires API keys for an LLM provider (like OpenAI) for the agent-based graph creation.

Create a .env file in the root of the project directory:
```
OPENAI_API_KEY="your-api-key-here"
```
Edit the params.py file to configure the project:
- Set DATASET_NAME to either "mimic" or "adni".
- Set LLM_PROVIDER to your desired provider (e.g., "openai").

4. Running the Project

Execute the main script from the root directory:

python main.py

The script will load the data, run the graph optimization process, train the final model, and save the results and visualizations in the images/ and models/ directories.

5. Interactive Knowledge Graph Pruning (NEW! 🆕)

For advanced users and domain experts, we provide an interactive web interface for manual graph editing:

python run_interactive_kg.py

This launches a Streamlit app where you can:

🎯 Visualize optimized knowledge graphs interactively
✏️ Edit graphs by adding/removing nodes and edges
🔒 Lock critical edges to preserve domain knowledge
🔄 Re-optimize graphs with your constraints
📊 Monitor performance metrics in real-time
💾 Export modified graphs for further analysis

Perfect for clinicians and researchers who want to inject domain expertise into the automated optimization process. See INTERACTIVE_KG_README.md for detailed instructions.

📁 Project Structure

GRACE/
├── datasets/         # Raw CSV datasets
├── kg/               # Knowledge Graphs (GraphML) and agent outputs
├── models/           # Saved trained model files
├── images/           # Saved plots and visualizations
├── main.py           # Main script to run the full pipeline
├── graph_reduction.py# Core logic for graph optimization using Optuna
├── create_kg.py      # Script for agent-based initial KG creation
├── visualizations.py # Functions for plotting results
├── utils.py          # Utility functions for graph manipulation
├── params.py         # All user-configurable parameters
└── README.md         # This file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GRACE: Graph-based Complexity Reduction and Context-enhanced Explainability

🎯 Project Goal

🛠️ How It Works

🚀 Getting Started

1. Prerequisites

2. Installation

3. Configuration

4. Running the Project

5. Interactive Knowledge Graph Pruning (NEW! 🆕)

📁 Project Structure

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
dataset_info		dataset_info
datasets		datasets
kg		kg
manuscript_files		manuscript_files
models		models
results		results
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
baseline_comparison.py		baseline_comparison.py
create_kg.py		create_kg.py
graph_optimization.py		graph_optimization.py
main.py		main.py
params.py		params.py
requirements.txt		requirements.txt
visualizations.py		visualizations.py

Prgrmmrjns/GRACE

Folders and files

Latest commit

History

Repository files navigation

GRACE: Graph-based Complexity Reduction and Context-enhanced Explainability

🎯 Project Goal

🛠️ How It Works

🚀 Getting Started

1. Prerequisites

2. Installation

3. Configuration

4. Running the Project

5. Interactive Knowledge Graph Pruning (NEW! 🆕)

📁 Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages