Yeti – An Agentic Artificial Intelligence Framework

Yeti is a framework for building agentic AI applications with support for open-source large language models, tool calling, and modular extensions.

Key Features

Bring Your Own Model

Yeti leverages Mistral-Nemo, providing compatibility with the OpenAI API specification without requiring an API subscription. This approach enables:

Seamless use of open-source models.
Future capability to train, fine-tune, or update models.
Flexible model swapping (similar to LoRA adapters but for open-source models).
Greater control and ownership over intelligence, avoiding proprietary paywalls.

Why Mistral-Nemo?

Handles meaningful conversations effectively.
Supports tool and function calling for agentic AI development.
Fully open-source and powerful.
Compatible with OpenAI API, zero-shot, ReAct-based flows, and LangGraph’s tool-calling framework.
Can run quantized versions in limited GPU environments.

Tool Calling

Out-of-the-box support includes:

Fetching weather for a given city.
Getting the current date and time.
Fetching exchange rates (via private API).
Searching and summarizing results from the internet.

Roadmap (Planned Features)

Text embeddings and vector database for overcoming context limits.
Session and thread IDs for topic-based conversation classification.
Integrated search backend for browsing the internet.
Voice controls and conversational interaction (low priority).
Image analysis (low priority).

Architecture

Host OS: Runs llama_cpp inference
Docker: Runs database, frontend and FastAPI backend.

Getting Started

Clone the Repository

git clone https://github.com/deepyes02/yeti-ai

Requirements

Install llama_cpp (compile for your specific architecture; see documentation).
Install Docker Desktop.
Download the Mistral-Nemo quantized GGUF model from Hugging Face.

Serve the model locally:

llama-server -m ~/llms/mistral-nemo-15.gguf --jinja -c 4096
# Adjust context length based on available GPU

Run the backend (FastAPI + WebSocket):

uvicorn app.main:app --host 0.0.0.0 --port 8000

Start Docker containers in the project root:
```
docker compose up -d
```
Ensure the model name is correctly configured in load_model.py.

Development Notes

For testing, type checking, and script execution in scripts/, it is recommended to set up a virtual environment in the project root:

python -m venv env   # Python 3.11 recommended
source ./env/bin/activate
pip install -r requirements.txt

ChatOpenAI Wrapper (No API Key Required)

Mistral-Nemo is OpenAI API-compatible. Wrapping it in LangGraph works just like using OpenAI, except no real API key is required:

def load_model():
    model = ChatOpenAI(
        base_url="http://localhost:8080/v1",
        model="mistral-nemo",
        api_key=SecretStr("any_string_here"),  # any placeholder string works
        temperature=0.9,
        top_p=0.95,
    )
    return model

Running the Application

Backend server: Port 8000
Frontend server: Port 3000 (see docker-compose.yml)

Visit: http://localhost:3000

Tested Models

DeepSeek – Works, but limited by lack of quantized non-thinking model.
Qwen 3 – Has a “thinking mode” toggle, but not yet supported via Ollama. (Issue raised with LangGraph.)
Llama 3.2 – Handles tools but often produces incoherent results.
Granite 3.3 (8B) – Promising IBM model, but tool-calling not yet functional (needs more testing).

References

ReAct: Synergizing Reasoning and Acting in Language Models

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.vscode		.vscode
app		app
frontend		frontend
scripts		scripts
.DS_Store		.DS_Store
.env.sample		.env.sample
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bak-requirements.txt		bak-requirements.txt
checkpoints (2).csv		checkpoints (2).csv
curl.json		curl.json
curl.sh		curl.sh
docker-compose.yml		docker-compose.yml
help.txt		help.txt
image-1.png		image-1.png
image.png		image.png
infer.txt		infer.txt
info.txt		info.txt
llama-server.txt		llama-server.txt
requirements.txt		requirements.txt
run_ai_server.sh		run_ai_server.sh
yeti-logo.png		yeti-logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Yeti – An Agentic Artificial Intelligence Framework

Key Features

Bring Your Own Model

Tool Calling

Roadmap (Planned Features)

Architecture

Getting Started

Clone the Repository

Requirements

Development Notes

ChatOpenAI Wrapper (No API Key Required)

Running the Application

Tested Models

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

deepyes02/yeti-ai

Folders and files

Latest commit

History

Repository files navigation

Yeti – An Agentic Artificial Intelligence Framework

Key Features

Bring Your Own Model

Tool Calling

Roadmap (Planned Features)

Architecture

Getting Started

Clone the Repository

Requirements

Development Notes

ChatOpenAI Wrapper (No API Key Required)

Running the Application

Tested Models

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages