Skip to content

devflowinc/install-embeddings

Repository files navigation

install-embeddings

Need

Check AWS quota

Ensure you have quotas for

  • ${gpu_count}*4 for On-Demand G and VT instances in the region of choice
  • At least 1 load-balancer per each model you want. (Not per server running)

Create eks cluster and install needed plugins

Modify the following lines in create_cluster.sh

To get your account id run

aws sts get-caller-identity

account_id=555555555555
region=us-east-2
cluster_name=trieve-gpu
main_instance_type=t3.small
gpu_instance_type=g4dn.xlarge
gpu_count=1

Run ./create_cluster.sh to generate the cluster

  1. Specify your embedding models

Modify embedding_models.yaml for the models that you want to use

  1. Install the helm chart
helm upgrade -i embedding-release oci://registry-1.docker.io/trieve/embeddings-helm -f embedding_models.yaml
  1. Get your model endpoints
kubectl get ing

Cleanup

helm uninstall embedding-release
./delete_cluster.sh

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages