AI File Sorter

AI File Sorter is a powerful, cross-platform desktop application that automates file organization with the help of AI.

It helps tidy up cluttered folders like Downloads, external drives, or NAS storage by automatically categorizing files based on their names, extensions, directory context, taxonomy, and other heuristics for accuracy and consistency.

The app uses a taxonomy-based system, which essentially means that it builds up a smarter internal reference for your file types and naming patterns.

The app intelligently assigns categories and optional subcategories, which you can review and adjust before confirming. Once approved, the necessary folders are created and your files are sorted automatically.

AI File Sorter runs local large language models (LLMs) such as LLaMa 3B and Mistral 7B, and does not require an internet connection unless you choose to use a remote model.

File content–based sorting for certain file types is also in development.

How It Works

Point it at a folder or drive
It runs a local LLM to analyze your files
The LLM suggests categorizations
You review and adjust if needed — done

AI File Sorter

Changelog

[1.1.0] - 2025-11-08

New feature: Support for Vulkan. This means that many non-Nvidia graphics cards (GPUs) are now supported for compute acceleration during local LLM inference.
New feature: Toggle subcategories in the categorization review dialog.
New feature: Undo the recent file sort (move) action.
Fixes: Bug fixes and stability improvements.
Added a CTest-integrated test suite. Expanded test coverage.
Code optimization refactors.

[1.0.0] - 2025-10-30

Migrated the entire desktop UI from GTK/Glade to a native Qt6 interface.
Added selection boxes for files in the categorization review dialog.
Added internatioinalization framework and the French translation for the user interface.
Added refreshed menu icons, mnemonic behaviour, and persistent File Explorer settings.
Simplified cross-platform builds (Linux/macOS) around Qt6; retired the MSYS2/GTK toolchain.
Optimized and cleaned up the code. Fixed error-prone areas.
Modernized the build pipeline. Introduced CMake for compilation on Windows.

[0.9.7] - 2025-10-19

Added paths to files in LLM requests for more context.
Added taxonomy for more consistent assignment of categories across categorizations. (Narrowing down the number of categories and subcategories).
Improved the readability of the categorization progress dialog box.
Improved the stability of CUDA detection and interaction.
Added more logging coverage throughout the code base.

[0.9.3] - 2025-09-22

Added compatibility with CUDA 13.

[0.9.2] - 2025-08-06

Bug fixes.
Increased code coverage with logging.

[0.9.1] - 2025-08-01

Bug fixes.
Minor improvements for stability.
Removed the deprecated GPU backend from the runtime build.

[0.9.0] - 2025-07-18

Local LLM support with llama.cpp.
LLM selection and download dialog.
Improved Makefile for a more hassle-free build and installation.
Minor bug fixes and improvements.

Features

AI-Powered Categorization: Classify files intelligently using either a local LLM (LLaMa, Mistral) or a remote LLM (ChatGPT), depending on your preference.
Offline-Friendly: Use a local LLM to categorize files entirely - no internet or API key required. Robust Categorization Algorithm: Consistency across categories is supported by taxonomy and heuristics. Customizable Sorting Rules: Automatically assign categories and subcategories for granular organization.
Qt6 Interface: Lightweight and responsive UI with refreshed menus and icons.
Cross-Platform Compatibility: Works on Windows, macOS, and Linux.
Local Database Caching: Speeds up repeated categorization and minimizes remote LLM usage costs.
Sorting Preview: See how files will be organized before confirming changes.
Secure API Key Encryption: When using the remote model, your API key is stored securely with encryption.
Update Notifications: Get notified about updates - with optional or required update flows.

Requirements

Operating System: Linux or macOS for source builds (Windows builds are provided as binaries; native Qt/MSVC build instructions are planned).
Compiler: A C++20-capable compiler (g++ or clang++).
Qt 6: Core, Gui, Widgets modules and the Qt resource compiler (qt6-base-dev / qt6-tools on Linux, brew install qt on macOS).
Libraries: curl, sqlite3, fmt, spdlog, and the prebuilt llama libraries shipped under app/lib/precompiled.
Optional GPU backends: A Vulkan 1.2+ runtime (preferred) or CUDA 12.x for NVIDIA cards. StartAiFileSorter.exe/run_aifilesorter.sh auto-detect the best available backend and fall back to CPU/OpenBLAS automatically, so CUDA is never required to run the app.
Git (optional): For cloning this repository. Archives can also be downloaded.
OpenAI API Key (optional): Required only when using the remote ChatGPT workflow.

Installation

File categorization with local LLMs is completely free of charge. If you prefer to use the ChatGPT workflow you will need an OpenAI API key with a small balance (see API Key, Obfuscation, and Encryption).

Linux

Prebuilt Debian/Ubuntu package

Install runtime prerequisites (Qt6, networking, database, math libraries):
```
sudo apt update && sudo apt install -y \
  libqt6widgets6 libcurl4 libjsoncpp25 libfmt9 libopenblas0-pthread
```
Ensure that the Qt platform plugins are installed (on Ubuntu 22.04 this is provided by qt6-wayland). GPU acceleration additionally requires either a working Vulkan 1.2+ stack (Mesa, AMD/Intel/NVIDIA drivers) or, for NVIDIA users, the matching CUDA runtime (nvidia-cuda-toolkit or vendor packages). The launcher automatically prefers Vulkan when both are present and falls back to CPU if neither is available.
Install the package
```
sudo apt install ./aifilesorter_1.0.0_amd64.deb
```
Using apt install (rather than dpkg -i) ensures any missing dependencies listed above are installed automatically.

Build from source

Install dependencies

Debian / Ubuntu:

sudo apt update && sudo apt install -y \
  build-essential cmake git qt6-base-dev qt6-base-dev-tools qt6-tools-dev-tools \
  libcurl4-openssl-dev libjsoncpp-dev libsqlite3-dev libssl-dev libfmt-dev libspdlog-dev

Fedora / RHEL:

sudo dnf install -y gcc-c++ cmake git qt6-qtbase-devel qt6-qttools-devel \
  libcurl-devel jsoncpp-devel sqlite-devel openssl-devel fmt-devel spdlog-devel

Arch / Manjaro:
```
sudo pacman -S --needed base-devel git cmake qt6-base qt6-tools curl jsoncpp sqlite openssl fmt spdlog
```
Optional GPU acceleration also requires either the distro Vulkan 1.2+ driver/runtime (Mesa, AMD, Intel, NVIDIA) or CUDA packages for NVIDIA cards. Install whichever stack you plan to use; the app will fall back to CPU automatically if none are detected.

Clone the repository
```
git clone https://github.com/hyperfield/ai-file-sorter.git
cd ai-file-sorter
git submodule update --init --recursive --remote
```
Submodule tip: If you previously downloaded llama.cpp or Catch2 manually, remove or rename app/include/external/llama.cpp and external/Catch2 before running the git submodule command. Git needs those directories to be empty so it can populate them with the tracked submodules.
Build the llama runtime variants (run once per backend you plan to ship/test)
```
# CPU / OpenBLAS
./app/scripts/build_llama_linux.sh cuda=off vulkan=off
# CUDA (optional; requires NVIDIA driver + CUDA toolkit)
./app/scripts/build_llama_linux.sh cuda=on vulkan=off
# Vulkan (optional; requires a working Vulkan 1.2+ stack, e.g. mesa-vulkan-drivers + vulkan-tools)
./app/scripts/build_llama_linux.sh cuda=off vulkan=on
```
Each invocation stages the corresponding llama/ggml libraries under app/lib/precompiled/<variant> and the runtime DLL/SO copies under app/lib/ggml/w<variant>. The script refuses to enable CUDA and Vulkan simultaneously, so run it separately for each backend. Shipping both directories lets the launcher pick Vulkan when available, then CUDA, and otherwise stay on CPU—no CUDA-only dependency remains.
Compile the application
```
cd app
make -j4
```
The binary is produced at app/bin/aifilesorter.
Install system-wide (optional)
```
sudo make install
```

macOS

Install Xcode command-line tools (xcode-select --install).
Install Homebrew (if required).

Install dependencies

brew install qt curl jsoncpp sqlite openssl fmt spdlog cmake git

Add Qt to your environment if it is not already present:

export PATH="$(brew --prefix)/opt/qt/bin:$PATH"
export PKG_CONFIG_PATH="$(brew --prefix)/lib/pkgconfig:$(brew --prefix)/share/pkgconfig:$PKG_CONFIG_PATH"

Clone the repository and submodules (same commands as Linux).
Build the llama runtime (Metal-only on macOS)
```
./app/scripts/build_llama_macos.sh
```
The macOS helper already produces the Metal-enabled variant the app needs, so no extra GPU-specific invocations are required on this platform.

Compile the application

cd app
make -j4
sudo make install   # optional

Windows

Build now targets native MSVC + Qt6 without MSYS2. Two options are supported; the vcpkg route is simplest.

Option A - CMake + vcpkg (recommended)

Install prerequisites:
- Visual Studio 2022 with Desktop C++ workload
- CMake 3.21+ (Visual Studio ships a recent version)
- vcpkg: https://github.com/microsoft/vcpkg (clone and bootstrap)

Clone repo and submodules:

git clone https://github.com/hyperfield/ai-file-sorter.git
cd ai-file-sorter
git submodule update --init --recursive

Determine your vcpkg root. It is the folder that contains vcpkg.exe (for example C:\dev\vcpkg).
- If vcpkg is on your PATH, run this command to print the location:
```
Split-Path -Parent (Get-Command vcpkg).Source
```
- Otherwise use the directory where you cloned vcpkg.
Build the bundled llama.cpp runtime variants (run from the same x64 Native Tools / VS 2022 Developer PowerShell shell). Invoke the script once per backend you need:

CPU / OpenBLAS only

app\scripts\build_llama_windows.ps1 cuda=off vulkan=off vcpkgroot=C:\dev\vcpkg

CUDA (requires matching NVIDIA toolkit/driver)

app\scripts\build_llama_windows.ps1 cuda=on vulkan=off vcpkgroot=C:\dev\vcpkg

Vulkan (requires LunarG Vulkan SDK or vendor Vulkan 1.2+ runtime)

app\scripts\build_llama_windows.ps1 cuda=off vulkan=on vcpkgroot=C:\dev\vcpkg

Each run emits the appropriate `llama.dll` / `ggml*.dll` pair under `app\lib\precompiled\<cpu|cuda|vulkan>` and copies the runtime DLLs into `app\lib\ggml\w<variant>`. For Vulkan builds, install the latest LunarG Vulkan SDK (or the vendor's runtime), ensure `vulkaninfo` succeeds in the same shell, and then run the script. Supplying both Vulkan and (optionally) CUDA artifacts lets `StartAiFileSorter.exe` detect the best backend at launch—Vulkan is preferred, CUDA is used when Vulkan is missing, and CPU remains the fallback, so CUDA is not required.
5. Build the Qt6 application using the helper script (still in the VS shell). The helper stages runtime DLLs via `windeployqt`, so `app\build-windows\Release` is immediately runnable:
 ```powershell
 # One-time per shell if script execution is blocked:
 Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass

 app\build_windows.ps1 -Configuration Release -VcpkgRoot C:\dev\vcpkg

Replace C:\dev\vcpkg with the path where you cloned vcpkg; it must contain scripts\buildsystems\vcpkg.cmake.
Always launch the app via StartAiFileSorter.exe. This small bootstrapper configures the GGML/CUDA/Vulkan DLLs, auto-selects Vulkan → CUDA → CPU at runtime, and sets the environment before spawning aifilesorter.exe. Launching aifilesorter.exe directly now shows a reminder dialog; developers can bypass it (for debugging) by adding --allow-direct-launch when invoking the GUI manually.
-VcpkgRoot is optional if VCPKG_ROOT/VPKG_ROOT is set or vcpkg/vpkg is on PATH.
The executable and required Qt/third-party DLLs are placed in app\build-windows\Release. Pass -SkipDeploy if you only want the binaries without bundling runtime DLLs.
Pass -Parallel <N> to override the default “all cores” parallel build behaviour (for example, -Parallel 8). By default the script invokes cmake --build … --parallel <core-count> and ctest -j <core-count> to keep both MSBuild and Ninja fully utilized.

Option B - CMake + Qt online installer

Install prerequisites:
- Visual Studio 2022 with Desktop C++ workload
- Qt 6.x MSVC kit via Qt Online Installer (e.g., Qt 6.6+ with MSVC 2019/2022)
- CMake 3.21+
- vcpkg (for non-Qt libs): curl, jsoncpp, sqlite3, openssl, fmt, spdlog, gettext
Build the bundled llama.cpp runtime (same VS shell). Any missing OpenBLAS/cURL packages are installed automatically via vcpkg:
```
pwsh .\app\scripts\build_llama_windows.ps1 [cuda=on|off] [vulkan=on|off] [vcpkgroot=C:\dev\vcpkg]
```
This is required before configuring the GUI because the build links against the produced llama static libraries/DLLs.

Configure CMake to see Qt (adapt CMAKE_PREFIX_PATH to your Qt install):

$env:VCPKG_ROOT = "C:\path\to\vcpkg" (e.g., `C:\dev\vcpkg`)
$qt = "C:\Qt\6.6.3\msvc2019_64"  # example
cmake -S . -B build -G "Ninja" `
  -DCMAKE_PREFIX_PATH=$qt `
 -DCMAKE_TOOLCHAIN_FILE=$env:VCPKG_ROOT\scripts\buildsystems\vcpkg.cmake `
 -DVCPKG_TARGET_TRIPLET=x64-windows
cmake --build build --config Release

Notes

To rebuild from scratch, run .\app\build_windows.ps1 -Clean. The script removes the local app\build-windows directory before configuring.
Runtime DLLs are copied automatically via windeployqt after each successful build; skip this step with -SkipDeploy if you manage deployment yourself.
If Visual Studio sets VCPKG_ROOT to its bundled copy under Program Files, clone vcpkg to a writable directory (for example C:\dev\vcpkg) and pass vcpkgroot=<path> when running build_llama_windows.ps1.
If you plan to ship CUDA or Vulkan acceleration, run the build_llama_* helper for each backend you intend to include before configuring CMake so the libraries exist. The runtime can carry both and auto-select at launch, so CUDA remains optional.

Running tests

Catch2-based unit tests are optional. Enable them via CMake:

cmake -S app -B build-tests -DAI_FILE_SORTER_BUILD_TESTS=ON
cmake --build build-tests
ctest --test-dir build-tests

On Windows you can pass -BuildTests (and -RunTests to execute ctest) to app\build_windows.ps1:

app\build_windows.ps1 -Configuration Release -BuildTests -RunTests

The current suite (under tests/unit) focuses on core utilities; expand it as new functionality gains coverage.

Selecting a backend at runtime

Both the Linux launcher (app/bin/run_aifilesorter.sh / aifilesorter-bin) and the Windows starter accept the following optional flags:

--cuda={on|off} – force-enable or disable the CUDA backend.
--vulkan={on|off} – force-enable or disable the Vulkan backend.

When no flags are provided the app auto-detects available runtimes in priority order (Vulkan → CUDA → CPU). Use the flags to skip a backend (--cuda=off forces Vulkan/CPU even if CUDA is installed, --vulkan=off tests CUDA explicitly) or to validate a newly installed stack (--vulkan=on). Passing on to both flags is rejected, and if neither GPU backend is detected the app automatically stays on CPU.

Uninstallation

Linux: cd app && sudo make uninstall
macOS: cd app && sudo make uninstall

The command removes the executable and the staged precompiled libraries. You can also delete cached local LLM models in ~/.local/share/aifilesorter/llms (Linux) or ~/Library/Application Support/aifilesorter/llms (macOS) if you no longer need them.

API Key, Obfuscation, and Encryption

Important: This step is needed only if you are going to use the Remote LLM option.

Before compiling the app:

Get an OpenAI API key from the OpenAI website.
A minimal balance is required in your OpenAI API account for the app to function.
Generate a 32-character random secret key, e.g., using this tool.

Important: If you're compiling on Windows, make sure there is NO = in the generated key! If one or more = are there, regenerate the key! Important: If you're compiling on Windows, it's probably best to avoid symbols due to possible unpredictable parsing issues.

Your secret key could look something like sVPV2fWoRg5q62AuCGVQ4p0NbHIU5DEv or du)]--Wg#+Au89Ro6eRMJc"]qx~owL_X.
Navigate to the api-key-encryption folder, then make a file named encryption.ini with the following content:
```
LLM_API_KEY=sk-...
SECRET_KEY=your-generated-32-byte-secret-key
```
Run the compile.sh (or compile_mac.sh) script in the same directory to generate the executable obfuscate_encrypt. due
Execute obfuscate_encrypt to generate:
- Obfuscated Key part 1
- Obfuscated Key part 2
- Encrypted data (hex)

Update the application files:

Update app/include/CryptoManager.hpp with Obfuscated Key part 1:

static constexpr char embedded_pc[] = "insert-obfuscated-Key-part-1-here";

Add the values to app/resources/.env as shown:

ENV_PC=obfuscated-key-part2-value
ENV_RR=encrypted-data-hex-value

Continue with Installation

Uninstallation

In the same subdirectory app, run sudo make uninstall.

How to Use

Launch the application (see the last step in Installation according your OS).
Select a directory to analyze.
Tick off the checkboxes on the main window according to your preferences.
Click the "Analyze" button. The app will scan each file and/or directory based on your selected options.
A review dialog will appear. Verify the assigned categories (and subcategories, if enabled in step 3).
Click "Confirm & Sort!" to move the files, or "Continue Later" to postpone. You can always resume where you left off since categorization results are saved.

Sorting a Remote Directory (e.g., NAS)

Follow the steps in How to Use, but modify step 2 as follows:

Windows: Assign a drive letter (e.g., Z: or X:) to your network share (instructions here).

Linux & macOS: Mount the network share to a local folder using a command like:

sudo mount -t cifs //192.168.1.100/shared_folder /mnt/nas -o username=myuser,password=mypass,uid=$(id -u),gid=$(id -g)

(Replace 192.168.1.100/shared_folder with your actual network location path and adjust options as needed.)

Contributing

Fork the repository and submit pull requests.
Report issues or suggest features on the GitHub issue tracker.
Follow the existing code style and documentation format.

Credits

Curl: https://github.com/curl/curl
Dotenv: https://github.com/motdotla/dotenv
git-scm: https://git-scm.com
Hugging Face: https://huggingface.co
JSONCPP: https://github.com/open-source-parsers/jsoncpp
LLaMa: https://www.llama.com
llama.cpp https://github.com/ggml-org/llama.cpp
Mistral AI: https://mistral.ai
OpenAI: https://platform.openai.com/docs/overview
OpenSSL: https://github.com/openssl/openssl
Qt: https://www.qt.io/
spdlog: https://github.com/gabime/spdlog

License

This project is licensed under the GNU AFFERO GENERAL PUBLIC LICENSE (GNU AGPL). See the LICENSE file for details.

Donation

Support the development of AI File Sorter and its future features. Every contribution counts!

Donate via PayPal
Bitcoin: 12H8VvRG9PGyHoBzbYxVGcu8PaLL6pc3NM
Ethereum: 0x09c6918160e2AA2b57BfD40BCF2A4BD61B38B2F9
Tron: TGPr8b5RxC5JEaZXkzeGVxq7hExEAi7Yaj

USDT is also accepted in Ethereum and Tron chains.

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.ccache/9/9		.ccache/9/9
.codacy		.codacy
api-key-encryption		api-key-encryption
app		app
external		external
images		images
prototypes/constrained_taxonomy		prototypes/constrained_taxonomy
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

License

hyperfield/ai-file-sorter

Folders and files

Latest commit

History

Repository files navigation

AI File Sorter

How It Works

Changelog

[1.1.0] - 2025-11-08

[1.0.0] - 2025-10-30

[0.9.7] - 2025-10-19

[0.9.3] - 2025-09-22

[0.9.2] - 2025-08-06

[0.9.1] - 2025-08-01

[0.9.0] - 2025-07-18

Features

Requirements

Installation

Linux

Prebuilt Debian/Ubuntu package

Build from source

macOS

Windows

CPU / OpenBLAS only

CUDA (requires matching NVIDIA toolkit/driver)

Vulkan (requires LunarG Vulkan SDK or vendor Vulkan 1.2+ runtime)

Running tests

Selecting a backend at runtime

Uninstallation

API Key, Obfuscation, and Encryption

Uninstallation

How to Use

Sorting a Remote Directory (e.g., NAS)

Contributing

Credits

License

Donation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 2

Languages

Packages