Releases · sammcj/gollama

gollama --vram --model NousResearch/Hermes-2-Theta-Llama-3-8B --quant q4_k_m --context 2048 --kvcache q4_0 # For GGUF models
gollama --vram --model NousResearch/Hermes-2-Theta-Llama-3-8B --quant 5.0 --context 2048 --kvcache q4_0 # For exl2 models
# Estimated VRAM usage: 5.35 GB

To calculate maximum context for a given memory constraint:

gollama --vram --model NousResearch/Hermes-2-Theta-Llama-3-8B --quant q4_k_m --memory 6 --kvcache q8_0 # For GGUF models
gollama --vram --model NousResearch/Hermes-2-Theta-Llama-3-8B --bpw 5.0 --memory 6 --kvcache q8_0 # For exl2 models
# Maximum context for 6.00 GB of memory: 5069

To find the best BPW:

gollama --vram --model NousResearch/Hermes-2-Theta-Llama-3-8B --memory 6 --quanttype gguf
# Best BPW for 6.00 GB of memory: IQ3_S

The vRAM estimator works by:

Fetching the model configuration from Hugging Face (if not cached locally)
Calculating the memory requirements for model parameters, activations, and KV cache
Adjusting calculations based on the specified quantisation settings
Performing binary and linear searches to optimize for context length or quantisation settings

1.21.1 (2024-08-01)

What's Changed

chore(renovate): patch Update patch (patch) by @renovate in #77
feat: vram estimator by @sammcj in #86

Full Changelog: v1.20.4...v1.21.1

Contributors

sammcj and renovate

Assets 5

21 Jul 21:40

github-actions

v1.20.4

85d01a4

Release v1.20.4

1.20.4 (2024-07-21)

Documentation

contributor: contributors readme action update (#80) (7a3356b)

What's Changed

chore(renovate): pin Update actions/setup-go digest to 0a12ed9 by @renovate in #71
fix: index out of range error in quantColour function by @anrgct in #79
docs(contributor): contributors readme action update by @github-actions in #80
chore(deps): bump deps by @sammcj in #81

New Contributors

@anrgct made their first contribution in #79

Full Changelog: v1.20.2...v1.20.4

Contributors

sammcj, renovate, and anrgct

Assets 5

14 Jul 07:28

github-actions

v1.20.2

74a3bef

Release v1.20.2

1.20.2 (2024-07-14)

Bug Fixes

tagging: hopefully fix tagging in actions vs makefile (#74) (74a3bef)

What's Changed

fix(tagging): hopefully fix tagging in actions vs makefile by @sammcj in #74

Full Changelog: 1.20.1...v1.20.2

Contributors

sammcj

Assets 5

14 Jul 07:13

github-actions

1.20.1

f3f5a4f

Release 1.20.1

What's Changed

chore(renovate): pin Update actions/upload-artifact digest to 0b2256b by @renovate in #70
feat: pull model updates by @sammcj in #69
feat: pull existing or new model by @sammcj in #72
chore(deps): bump deps by @sammcj in #73

Full Changelog: 1.18.2...1.20.1

Contributors

sammcj and renovate

Assets 5

05 Jul 07:14

github-actions

1.18.2

52e6871

Release 1.18.2

What's Changed

feat: add edit model cli and tui by @sammcj in #64
feat: add -L cli option to link all models by @sammcj in #65
minor updates by @sammcj in #67

Full Changelog: 1.16.0...1.18.2

Contributors

sammcj

Assets 5

04 Jul 03:32

github-actions

1.17.0

174673d

Release 1.17.0

1.17.0 (2024-07-04)

Features

add edit model cli and tui (#64) (174673d)

BREAKING

Update model (u) has been replaced by Edit model (e)

What's Changed

feat: add edit model cli and tui by @sammcj in #64

Full Changelog: 1.16.0...1.17.0

Contributors

sammcj

Assets 5

03 Jul 22:11

github-actions

1.16.0

234511f

Release 1.16.0

1.16.0 (2024-07-03)

Features

add search cli (#63) (234511f)

What's Changed

feat: add search cli by @sammcj in #63

Full Changelog: 1.15.0...1.16.0

Contributors

sammcj

Assets 5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1.26.0 (2024-08-03)

Features

What's Changed

Contributors

1.24.0 (2024-08-03)

Features

What's Changed

Contributors

1.22.0 (2024-08-01)

Features

1.21.1 (2024-08-01)

What's Changed

Contributors

1.20.4 (2024-07-21)

Documentation

What's Changed

New Contributors

Contributors

1.20.2 (2024-07-14)

Bug Fixes

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

1.17.0 (2024-07-04)

Features

BREAKING

What's Changed

Contributors

1.16.0 (2024-07-03)

Features

What's Changed

Contributors

Releases: sammcj/gollama

Release v1.26.0

1.26.0 (2024-08-03)

Features

What's Changed

Contributors

Release v1.24.0

1.24.0 (2024-08-03)

Features

What's Changed

Contributors

Release v1.22.0

1.22.0 (2024-08-01)

Features

Release v1.21.1

1.21.1 (2024-08-01)

What's Changed

Contributors

Release v1.20.4

1.20.4 (2024-07-21)

Documentation

What's Changed

New Contributors

Contributors

Release v1.20.2

1.20.2 (2024-07-14)

Bug Fixes

What's Changed

Contributors

Release 1.20.1

What's Changed

Contributors

Release 1.18.2

What's Changed

Contributors

Release 1.17.0

1.17.0 (2024-07-04)

Features

BREAKING

What's Changed

Contributors

Release 1.16.0

1.16.0 (2024-07-03)

Features

What's Changed

Contributors