Lodging Recommendation Service With Azure OpenAI and YugabyteDB

This is a sample Node.JS and React application that demonstrates how to build generative AI applications using the Azure OpenAI Service and YugabyteDB.

The app provides lodging recommendations for travelers going to San Francisco. It supports two distinct modes:

Azure OpenAI Chat Mode: In this mode, the Node.js backend leverages one of the Azure GPT models to generate lodging recommendations based on the user's input.
YugabyteDB Embeddings Mode: Initially, the backend employs an Azure OpenAI Embeddings model to convert the user's prompt into an embedding (a vectorized representation of the text data). Subsequently, the server does a similarity search in YugabyteDB finding Airbnb properties which descriptions are related to the user's prompt. YugabyteDB relies on the PostgreSQL pgvector extension for the similarity search and other generative AI use cases.

Prerequisites

A Microsoft Azure subscription.
An Azure OpenAI Service resource with GPT and Embedding models deployed. For more information about model deployment, see the resource deployment guide.
The latest Node.js version.
A YugabyteDB cluster of version 2.19.2 or later.

Download Application and Provide Azure OpenAI Settings

Download the application and provide settings specific to your instance of the Azure OpenAI Service:

Clone the repository:

git clone https://github.com/YugabyteDB-Samples/yugabytedb-azure-openai-lodging-service

Initialize the project:

npm i 

cd {project_dir}/frontend
npm i

Open the {project_dir}/application.properties.ini file and fill in the Azure specific settings:

AZURE_OPENAI_KEY= # The Azure OpenAI API key
AZURE_OPENAI_ENDPOINT= # An endpoint for the Language APIs
AZURE_GPT_MODEL_DEPLOYMENT_NAME = # A deployment name for the GPT model
AZURE_EMBEDDING_MODEL_DEPLOYMENT_NAME = # A deployment name for the Embedding model

Follow this Azure guide, if you'd like to know how to deploy Azure models and find their names.

Start YugabyteDB and Load Sample Data Set

Start a YugabyteDB isntance of version 2.19.2 or later:

mkdir ~/yb_docker_data

docker network create custom-network

docker run -d --name yugabytedb_node1 --net custom-network \
    -p 15433:15433 -p 7001:7000 -p 9001:9000 -p 5433:5433 \
    -v ~/yb_docker_data/node1:/home/yugabyte/yb_data --restart unless-stopped \
    yugabytedb/yugabyte:2.19.2.0-b121 \
    bin/yugabyted start \
    --base_dir=/home/yugabyte/yb_data --daemon=false

docker run -d --name yugabytedb_node2 --net custom-network \
    -p 15434:15433 -p 7002:7000 -p 9002:9000 -p 5434:5433 \
    -v ~/yb_docker_data/node2:/home/yugabyte/yb_data --restart unless-stopped \
    yugabytedb/yugabyte:2.19.2.0-b121 \
    bin/yugabyted start --join=yugabytedb_node1 \
    --base_dir=/home/yugabyte/yb_data --daemon=false
    
docker run -d --name yugabytedb_node3 --net custom-network \
    -p 15435:15433 -p 7003:7000 -p 9003:9000 -p 5435:5433 \
    -v ~/yb_docker_data/node3:/home/yugabyte/yb_data --restart unless-stopped \
    yugabytedb/yugabyte:2.19.2.0-b121 \
    bin/yugabyted start --join=yugabytedb_node1 \
    --base_dir=/home/yugabyte/yb_data --daemon=false

The database connectivity settings are provided in the {project_dir}/application.properties.ini file and do not need to be changed if you started the cluster with the command above:

DATABASE_HOST=localhost
DATABASE_PORT=5433
DATABASE_NAME=yugabyte
DATABASE_USER=yugabyte
DATABASE_PASSWORD=yugabyte

Next, load the sample Airbnb data set for the properties in San Francisco:

Create the original schema:

psql -h 127.0.0.1 -p 5433 -U yugabyte -d yugabyte {project_dir}/sql/0_airbnb_listings.sql

Load the data:

psql -h 127.0.0.1 -p 5433 -U yugabyte
\copy airbnb_listing from '{project_dir}/sql/sf_airbnb_listings.csv' DELIMITER ',' CSV HEADER;

Execute the following script to enable the pgvector extension and add the description_embedding column of the vector type:
```
\i {project_dir}/sql/1_airbnb_embeddings.sql
```

Generate Embeddings for Airbnb Listing Descriptions

Airbnb properties provide a detailed property description (rooms number, amenities, location and other perks) in the description column. That information is a perfect fit for the similarity search against user prompts. However, the text data of the description column needs to be transformed into a vectorized representation.

Use the embeddings_generator.js tool to generate embeddings for all Arbnb properties descriptions. The tool leverages the Azure OpenAI Embedding model and stores the generated vectors in the description_embedding column in the database:

node {project_dir}backend/embeddings_generator.js

It can take 10+ minutes to generate embeddings for over 7000 Airbnb properties. You'll see the message below once the generation is over:

....
Processing rows starting from 34746755
Processed 7551 rows
Processing rows starting from 35291912
Finished generating embeddings for 7551 rows

Starting the Application

Start the Node.js backend:
```
cd {project_dir}/backend
npm start
```
Start the React frontend:
```
cd {project_dir}/backend
npm start
```
Access the application's user interface at: http://localhost:3000

Enjoy exploring the app and toggling between the two modes: Azure OpenAI Chat and YugabyteDB Embeddings. The latter is significantly faster.

Here are some sample prompts to get you started:

We're traveling to San Francisco from October 21st through 28th. We need a hotel with parking.

I'm looking for an apartment near the Golden Gate Bridge with a nice view of the Bay.

I'd like a hotel near Fisherman's Wharf with a Bay view.

An apartment close to the Salesforce Tower, within walking distance of Blue Bottle Coffee.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
backend		backend
frontend		frontend
sql		sql
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
application.properties.ini		application.properties.ini
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lodging Recommendation Service With Azure OpenAI and YugabyteDB

Prerequisites

Download Application and Provide Azure OpenAI Settings

Start YugabyteDB and Load Sample Data Set

Generate Embeddings for Airbnb Listing Descriptions

Starting the Application

About

Releases

Packages

Languages

License

YugabyteDB-Samples/yugabytedb-azure-openai-lodging-service

Folders and files

Latest commit

History

Repository files navigation

Lodging Recommendation Service With Azure OpenAI and YugabyteDB

Prerequisites

Download Application and Provide Azure OpenAI Settings

Start YugabyteDB and Load Sample Data Set

Generate Embeddings for Airbnb Listing Descriptions

Starting the Application

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages