SubRep: Subgoal Refinement and Representation Learning

Executive Summary

This project develops a standalone SubRep implementation that transforms skill discovery into a certificate-driven, auditable process. SubRep certifies skills via two mathematical tests (CDS/PDS) that guarantee composition safety across motive shifts, preventing negative transfer before skills enter the library.

This project validates the core mechanism in MO-LunarLander, storing certified skills as native MeTTa Atoms for future Hyperon integration.

Objectives & Key Results (OKRs)

Aligned with Approved Quarter Plan:

Objective	Goal	Key Results
1. Neural Skill Generator	Generate skill summaries from experience	• 2-head MLP (Payoff + Motives) • MDN Interface Defined • TD Error Computation
2. Core Certification	Implement CDS/PDS admission tests	• CDS Test (Universal Benefit) • PDS-ε Test (Acceptable Trade-off) • MO-LunarLander Integration
3. MeTTa Storage	Store certificates as native Atoms	• Certificate Schema Defined • PyMeTTa Bridge (`hyperon`) • Zero-Shot Reuse Demo
4. Minimal Validation	Demonstrate core mechanism works	• Certified Skills Pass Tests • Uncertified Skills Rejected • Admission Rates Documented

Quick Start

1. Prerequisites

Python 3.8+
Git

2. Installation

# Clone the repository
git clone https://github.com/iCog-Labs-Dev/subrep.git
cd subrep

# Install dependencies
pip install -r requirements.txt

# Configure environment
cp .env.example .env

3. Validation

# Verify Environment Setup
python tests/test_env.py

# Verify Generator Output
python tests/test_generator.py

# Run Full Pipeline (Phase 5+)
python main.py

Project Structure

Folder	Description
`env/`	MO-LunarLander wrapper & vector reward handling
`generator/`	2-head MLP skill generator (PyTorch)
`certification/`	CDS/PDS admission gate logic
`metta/`	PyMeTTa bridge & certificate schema
`utils/`	TD error computation, logging, helpers
`tests/`	Validation scripts for each component

Technical Specifications

Environment

Platform: mo-gymnasium (MO-LunarLander-v3)
Observation Space: (8,) – State vector (position, velocity, fuel, etc.)
Reward Space: (2,) – [Safety_Reward, Fuel_Reward]

Neural Generator

Architecture: 2-head MLP (Payoff + Motives)
Input: State vector (8,)
Output:
- payoff: Scalar (1,)
- motives: Vector (2,)

Certification

CDS: Cone-Dominant Subtask (Universal Benefit)
PDS-ε: Pareto-Dominant Subtask (Acceptable Trade-off)
Cones: Full-simplex (Phase 3) -> MDN-learned (Phase 4+)

MeTTa Integration

Package: hyperon (Python bindings)
Operations: add_atom, match, space

Documentation

Roadmap (Q2+)

MDN Training: Full Motive Decomposition Network implementation.
MetaMo Integration: Dynamic weight management & risk budgets.
Cross-Paradigm Skills: Logic macros & evolutionary programs.
Benchmarking: Hypervolume efficiency vs. standard MORL baselines.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
certification		certification
env		env
generator		generator
metta		metta
utils		utils
.gitignore		.gitignore
README.md		README.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SubRep: Subgoal Refinement and Representation Learning

Executive Summary

Objectives & Key Results (OKRs)

Quick Start

1. Prerequisites

2. Installation

3. Validation

Project Structure

Technical Specifications

Environment

Neural Generator

Certification

MeTTa Integration

Documentation

Roadmap (Q2+)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SubRep: Subgoal Refinement and Representation Learning

Executive Summary

Objectives & Key Results (OKRs)

Quick Start

1. Prerequisites

2. Installation

3. Validation

Project Structure

Technical Specifications

Environment

Neural Generator

Certification

MeTTa Integration

Documentation

Roadmap (Q2+)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages