DiffSRL: Learning Dynamic-Aware State representation for Control via Differentiable Simulation

Pre-released code bug expected.

Install

Run cd ChamferDistancePytorch
Install python3 -m pip install -e .
Run cd ..
Install python3 -m pip install -e .

Enjoy the pretrained model

Model Free Reinforcement Learning on Chopsticks or Rope

Run python3 -m plb.algorithms.solve --algo td3 --env_name [Chopsticks-v1/Rope-v1] --exp_name enjoy --model_name rope/encoder --render

Model Based Policy Optimization on Chopsticks or Rope

Run python3 -m plb.algorithms.solve --algo torch_nn --env_name [Chopsticks-v1/Rope-v1] --exp_name enjoy --model_name rope/encoder --render

Training new model

Collect data from new environment

Run python3 -m plb.algorithms.solve --algo collect --env_name [EnvName-version] --exp_name [new_environment] Which will collect raw data and stored in raw_data folder.
Run python3 preprocess.py --dir raw_data/[new_environment]to pre-process data and the preprocessed npz file will be stored in data with the name of [new_environment].

Running State Representation learning using new dataset

Run python3 plb.algorithms.solve --env_name [EnvName-version] --exp_name [EnvName-version] --exp_name learn_latent --lr 1e-5 The encoder weight will be saved in pretrained_model

Experiment result

All experiment result are rendered from policy trained with MBPO

Picking up a rope

Wrapping a rope around a cylinder

Simulation Transfer to Real Experiments

Rope Experiment (The coordinate frame has been flipped to avoid occlusion)

#### Chopsticks Experiment

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
ChamferDistancePytorch		ChamferDistancePytorch
Images		Images
emd		emd
model_based_rewards		model_based_rewards
plb		plb
visualization		visualization
.gitignore		.gitignore
MUJOCO_LOG.TXT		MUJOCO_LOG.TXT
README.md		README.md
_config.yml		_config.yml
preprocess.ipynb		preprocess.ipynb
run_td3_rope.sh		run_td3_rope.sh
setup.py		setup.py
temp_execute_script.sh		temp_execute_script.sh
train_all_decoders.py		train_all_decoders.py
train_autoencoder.sh		train_autoencoder.sh
train_cfm_encoder.sh		train_cfm_encoder.sh
train_srl.sh		train_srl.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiffSRL: Learning Dynamic-Aware State representation for Control via Differentiable Simulation

Pre-released code bug expected.

Install

Enjoy the pretrained model

Model Free Reinforcement Learning on Chopsticks or Rope

Model Based Policy Optimization on Chopsticks or Rope

Training new model

Collect data from new environment

Running State Representation learning using new dataset

Experiment result

All experiment result are rendered from policy trained with MBPO

Simulation Transfer to Real Experiments

Rope Experiment (The coordinate frame has been flipped to avoid occlusion)

About

Uh oh!

Releases

Packages

Languages

Ericcsr/DiffSRL

Folders and files

Latest commit

History

Repository files navigation

DiffSRL: Learning Dynamic-Aware State representation for Control via Differentiable Simulation

Pre-released code bug expected.

Install

Enjoy the pretrained model

Model Free Reinforcement Learning on Chopsticks or Rope

Model Based Policy Optimization on Chopsticks or Rope

Training new model

Collect data from new environment

Running State Representation learning using new dataset

Experiment result

All experiment result are rendered from policy trained with MBPO

Simulation Transfer to Real Experiments

Rope Experiment (The coordinate frame has been flipped to avoid occlusion)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages