LangChain components for Dartmouth-hosted models.
- Install the package:
pip install langchain_dartmouth
- Obtain a Dartmouth API key from developer.dartmouth.edu
- Store the API key as an environment variable called
DARTMOUTH_API_KEY:
export DARTMOUTH_API_KEY=<your_key_here>
- Obtain a Dartmouth Chat API key
- Store the API key as an environment variable called
DARTMOUTH_CHAT_API_KEY
export DARTMOUTH_CHAT_API_KEY=<your_key_here>
Note
You may want to make the environment variables permanent or use a .env file
This library provides an integration of Darmouth-provided generative AI resources with the LangChain framework.
There are three main components currently implemented:
- Large Language Models
- Embedding models
- Reranking models
All of these components are based on corresponding LangChain base classes and can be used seamlessly wherever the corresponding LangChain objects can be used.
There are three kinds of Large Language Models (LLMs) provided by Dartmouth:
- On-premises:
- Base models without instruction tuning (require no special prompt format)
- Instruction-tuned models (also known as Chat models) requiring specific prompt formats
- Cloud:
- Third-party, pay-as-you-go chat models (e.g., OpenAI's GPT 4.1, Google Gemini)
Using a Dartmouth-hosted base language model:
from langchain_dartmouth.llms import DartmouthLLM
llm = DartmouthLLM(model_name="codellama-13b-hf")
response = llm.invoke("Write a Python script to swap two variables.")
print(response)Using a Dartmouth-hosted chat model:
from langchain_dartmouth.llms import ChatDartmouth
llm = ChatDartmouth(model_name="meta.llama-3-2-11b-vision-instruct")
response = llm.invoke("Hi there!")
print(response.content)Note
The required prompt format is enforced automatically when you are using ChatDartmouth.
Using a Dartmouth-provided third-party chat model:
from langchain_dartmouth.llms import ChatDartmouth
llm = ChatDartmouth(model_name="openai.gpt-4.1-mini-2025-04-14")
response = llm.invoke("Hi there!")Using a Dartmouth-hosted embeddings model:
from langchain_dartmouth.embeddings import DartmouthEmbeddings
embeddings = DartmouthEmbeddings()
embeddings.embed_query("Hello? Is there anybody in there?")
print(response)Using a Dartmouth-hosted reranking model:
from langchain_dartmouth.retrievers.document_compressors import DartmouthReranker
from langchain.docstore.document import Document
docs = [
Document(page_content="Deep Learning is not..."),
Document(page_content="Deep learning is..."),
]
query = "What is Deep Learning?"
reranker = DartmouthReranker(model_name="bge-reranker-v2-m3")
ranked_docs = reranker.compress_documents(query=query, documents=docs)
print(ranked_docs)For a list of available models, check the respective list() method of each class.
If you are using langchain_dartmouth as part of a scientific publication, we would greatly appreciate a citation of the following paper:
@inproceedings{10.1145/3708035.3736076,
author = {Stone, Simon and Crossett, Jonathan and Luker, Tivon and Leligdon, Lora and Cowen, William and Darabos, Christian},
title = {Dartmouth Chat - Deploying an Open-Source LLM Stack at Scale},
year = {2025},
isbn = {9798400713989},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
doi = {10.1145/3708035.3736076},
booktitle = {Practice and Experience in Advanced Research Computing 2025: The Power of Collaboration},
articleno = {43},
numpages = {5}
}|
Created by Simon Stone for Dartmouth College under Creative Commons CC BY-NC 4.0 License. For questions, comments, or improvements, email Research Computing. |
![]() |
Except where otherwise noted, the example programs are made available under the OSI-approved MIT license.
