Skip to content

Commit e3c17b0

Browse files
authored
Merge pull request #2093 from rhatdan/VERSION
Bump to v0.14.0
2 parents b7a080f + 1ee89d0 commit e3c17b0

File tree

17 files changed

+66
-22
lines changed

17 files changed

+66
-22
lines changed

docs/ramalama-bench.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
5959
images based on the accelerator it discovers. For example:
6060
`quay.io/ramalama/ramalama`. See the table below for all default images.
6161
The default image tag is based on the minor version of the RamaLama package.
62-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
62+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
6363

6464
The default can be overridden in the ramalama.conf file or via the
6565
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docs/ramalama-perplexity.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
6262
images based on the accelerator it discovers. For example:
6363
`quay.io/ramalama/ramalama`. See the table below for all default images.
6464
The default image tag is based on the minor version of the RamaLama package.
65-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
65+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
6666

6767
The default can be overridden in the ramalama.conf file or via the
6868
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docs/ramalama-rag.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
5050
images based on the accelerator it discovers. For example:
5151
`quay.io/ramalama/ramalama-rag`. See the table below for all default images.
5252
The default image tag is based on the minor version of the RamaLama package.
53-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
53+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
5454

5555
The default can be overridden in the ramalama.conf file or via the
5656
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docs/ramalama-run.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
7373
images based on the accelerator it discovers. For example:
7474
`quay.io/ramalama/ramalama`. See the table below for all default images.
7575
The default image tag is based on the minor version of the RamaLama package.
76-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
76+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
7777

7878
The default can be overridden in the ramalama.conf file or via the
7979
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docs/ramalama-serve.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
120120
images based on the accelerator it discovers. For example:
121121
`quay.io/ramalama/ramalama`. See the table above for all default images.
122122
The default image tag is based on the minor version of the RamaLama package.
123-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
123+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
124124

125125
The default can be overridden in the ramalama.conf file or via the
126126
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docs/ramalama-version.1.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,9 +18,9 @@ Print usage message
1818

1919
```
2020
$ ramalama version
21-
ramalama version 0.13.0
21+
ramalama version 0.14.0
2222
$ ramalama -q version
23-
0.13.0
23+
0.14.0
2424
>
2525
```
2626
## SEE ALSO

docsite/docs/commands/ramalama/bench.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -63,7 +63,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
6363
images based on the accelerator it discovers. For example:
6464
`quay.io/ramalama/ramalama`. See the table below for all default images.
6565
The default image tag is based on the minor version of the RamaLama package.
66-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
66+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
6767

6868
The default can be overridden in the ramalama.conf file or via the
6969
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

docsite/docs/commands/ramalama/convert.mdx

Lines changed: 14 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,14 +25,26 @@ The model can be from RamaLama model storage in Huggingface, Ollama, or a local
2525

2626
Convert Safetensor models into a GGUF with the specified quantization format. To learn more about model quantization, read llama.cpp documentation:
2727
https://github.com/ggml-org/llama.cpp/blob/master/tools/quantize/README.md
28-
Default: Q4_K_M
2928

3029
#### **--help**, **-h**
3130
Print usage message
3231

32+
#### **--image**=IMAGE
33+
Image to use for model quantization when converting to GGUF format (when the `--gguf` option has been specified). The image must have the
34+
`llama-quantize` executable available on the `PATH`. Defaults to the appropriate `ramalama` image based on available accelerators. If no
35+
accelerators are available, the current `quay.io/ramalama/ramalama` image will be used.
36+
3337
#### **--network**=*none*
3438
sets the configuration for network namespaces when handling RUN instructions
3539

40+
#### **--pull**=*policy*
41+
Pull image policy. The default is **missing**.
42+
43+
#### **--rag-image**=IMAGE
44+
Image to use when converting to GGUF format (when then `--gguf` option has been specified). The image must have the `convert_hf_to_gguf.py` script
45+
executable and available in the `PATH`. The script is available from the `llama.cpp` GitHub repo. Defaults to the current
46+
`quay.io/ramalama/ramalama-rag` image.
47+
3648
#### **--type**=*raw* | *car*
3749

3850
type of OCI Model Image to convert.
@@ -59,7 +71,7 @@ Successfully tagged quay.io/rhatdan/tiny:latest
5971

6072
Generate and run an oci model with a quantized GGUF converted from Safetensors.
6173
```bash
62-
$ ramalama --image quay.io/ramalama/ramalama-rag convert --gguf Q4_K_M hf://ibm-granite/granite-3.2-2b-instruct oci://quay.io/kugupta/granite-3.2-q4-k-m:latest
74+
$ ramalama convert --gguf Q4_K_M hf://ibm-granite/granite-3.2-2b-instruct oci://quay.io/kugupta/granite-3.2-q4-k-m:latest
6375
Converting /Users/kugupta/.local/share/ramalama/models/huggingface/ibm-granite/granite-3.2-2b-instruct to quay.io/kugupta/granite-3.2-q4-k-m:latest...
6476
Building quay.io/kugupta/granite-3.2-q4-k-m:latest...
6577
$ ramalama run oci://quay.io/kugupta/granite-3.2-q4-k-m:latest

docsite/docs/commands/ramalama/info.mdx

Lines changed: 13 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,14 +20,24 @@ show this help message and exit
2020

2121
## FIELDS
2222

23+
The `Accelerator` field indicates the accelerator type for the machine.
24+
25+
The `Config` field shows the list of paths to RamaLama configuration files used.
26+
2327
The `Engine` field indicates the OCI container engine used to launch the container in which to run the AI Model
2428

2529
The `Image` field indicates the default container image in which to run the AI Model
2630

27-
The `Runtime` field indicates which backend engine is used to execute the AI model:
31+
The `Inference` field lists the currently used inference engine as well as a list of available engine specification and schema files used for model inference.
32+
For example:
33+
34+
- `llama.cpp`
35+
- `vllm`
36+
- `mlx`
37+
38+
The `Selinux` field indicates if SELinux is activated or not.
2839

29-
- `llama.cpp`: Uses the llama.cpp library for model execution
30-
- `vllm`: Uses the vLLM library for model execution
40+
The `Shortnames` field shows the used list of configuration files specifying AI Model short names as well as the merged list of shortnames.
3141

3242
The `Store` field indicates the directory path where RamaLama stores its persistent data, including downloaded models, configuration files, and cached data. By default, this is located in the user's local share directory.
3343

docsite/docs/commands/ramalama/perplexity.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
6666
images based on the accelerator it discovers. For example:
6767
`quay.io/ramalama/ramalama`. See the table below for all default images.
6868
The default image tag is based on the minor version of the RamaLama package.
69-
Version 0.13.0 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
69+
Version 0.14.0 of RamaLama pulls an image with a `:0.14` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
7070

7171
The default can be overridden in the ramalama.conf file or via the
7272
RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells

0 commit comments

Comments
 (0)