DRAFT: Added an updated version of notebook for video script #471

justincastilla · 2025-07-15T01:38:27Z

Updated the notebook to use modern mechanisms and API calls. I ONLY updated the notebook codebase, not the supporting text. I will change the copy at a later time

gitnotebooks · 2025-07-15T01:38:30Z

Found 1 changed notebook. Review the changes at https://app.gitnotebooks.com/elastic/elasticsearch-labs/pull/471

carlyrichmond · 2025-07-15T10:10:00Z

@justincastilla I see you've added a new notebook rather than updated the existing one. Why is that? I would have expected us to update the existing notebook, especially if the plan is to update the piece as well.

Can you also double check if things are running as expected and using latest possible versions? I see an error in one of the cells in your updated notebook:

"\u001b[31mERROR: Could not find a version that satisfies the requirement torch==1.11 (from versions: 2.0.0, 2.0.1, 2.1.0, 2.1.1, 2.1.2, 2.2.0, 2.2.1, 2.2.2, 2.3.0, 2.3.1, 2.4.0, 2.4.1, 2.5.0, 2.5.1, 2.6.0, 2.7.0, 2.7.1)\u001b[0m\u001b[31m\n",
      "\u001b[0m\u001b[31mERROR: No matching distribution found for torch==1.11\u001b[0m\u001b[31m\n",
      "\u001b[0m\n",
      "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m A new release of pip is available: \u001b[0m\u001b[31;49m23.2.1\u001b[0m\u001b[39;49m -> \u001b[0m\u001b[32;49m25.1.1\u001b[0m\n",
      "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m To update, run: \u001b[0m\u001b[32;49mpython3 -m pip install --upgrade pip\u001b[0m\n",
      "Note: you may need to restart the kernel to use updated packages.\n"

justincastilla · 2025-07-15T15:05:53Z

@justincastilla I see you've added a new notebook rather than updated the existing one. Why is that? I would have expected us to update the existing notebook, especially if the plan is to update the piece as well.

I'm not going to replace it fully until this new notebook is approved and ready. Also, we'll want it around until the article is updated, as it references the same code.

Can you also double check if things are running as expected and using latest possible versions? I see an error in one of the cells in your updated notebook:

The new code no longer needs those packages. It's been removed.

carlyrichmond

I like the summary being added. Can you make sure all TODO's are addressed and the comments removed? After that I think we're good.

As discussed on Wednesday, let's also make sure this updated notebook replaces the original once the piece has been updated.

carlyrichmond · 2025-07-16T09:51:18Z

.../lexical-and-semantic-search-with-elasticsearch/updated-ecommerce_dense_sparse_project.ipynb

-    "# Index to load products-ecommerce.json docs\n",
-    "if client.indices.exists(index=\"ecommerce\"):\n",
-    "    client.indices.delete(index=\"ecommerce\")\n",
+    "We define the `e5_description_vector` and the `elser_description_vector` fields to store the inference pipeline results. The field type in `e5_description_vector` is a `dense_vector`. The `.e5_multilingual_small` model has embedding_size of 384, so the dimension of the fector (dims) is set to 384. \n",


Suggested change

"We define the `e5_description_vector` and the `elser_description_vector` fields to store the inference pipeline results. The field type in `e5_description_vector` is a `dense_vector`. The `.e5_multilingual_small` model has embedding_size of 384, so the dimension of the fector (dims) is set to 384. \n",

"We define the `e5_description_vector` and the `elser_description_vector` fields to store the inference pipeline results. The field type in `e5_description_vector` is a `dense_vector`. The `.e5_multilingual_small` model has embedding_size of 384, so the dimension of the vector (dims) is set to 384. \n",

carlyrichmond · 2025-07-16T09:52:05Z

.../lexical-and-semantic-search-with-elasticsearch/updated-ecommerce_dense_sparse_project.ipynb

   "source": [
    "# Performs text analysis on a string and returns the resulting tokens.\n",
-    "\n",
+    "# TODO: Partial Smoosh together\n",


Let's remove the TODOs (there's a couple in this file)

carlyrichmond

I think there's still 1 TODO comment in there, but aside from that LGTM.

Added an updated version of notebook for video script

22f913c

Justin Castilla added 2 commits July 15, 2025 07:41

removes old install script

ea28703

updating cell outputs, copy, removes unecessary scripts

b8c06b6

removes older code patterns, adds table of results

b76d5da

carlyrichmond requested changes Jul 16, 2025

View reviewed changes

Justin Castilla added 4 commits July 17, 2025 13:33

adds copy and semantic_text examples

3cd8fdb

sample object edit

ca99d4f

removes commented code

81af33f

adds e5_semantic_text_search_results

ee98aa5

carlyrichmond requested changes Jul 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DRAFT: Added an updated version of notebook for video script #471

DRAFT: Added an updated version of notebook for video script #471

Uh oh!

justincastilla commented Jul 15, 2025 •

edited

Loading

Uh oh!

gitnotebooks bot commented Jul 15, 2025

Uh oh!

carlyrichmond commented Jul 15, 2025

Uh oh!

justincastilla commented Jul 15, 2025

Uh oh!

carlyrichmond left a comment •

edited

Loading

Uh oh!

carlyrichmond Jul 16, 2025

Uh oh!

carlyrichmond Jul 16, 2025

Uh oh!

carlyrichmond left a comment

Uh oh!

Uh oh!

	"We define the `e5_description_vector` and the `elser_description_vector` fields to store the inference pipeline results. The field type in `e5_description_vector` is a `dense_vector`. The `.e5_multilingual_small` model has embedding_size of 384, so the dimension of the fector (dims) is set to 384. \n",
	"We define the `e5_description_vector` and the `elser_description_vector` fields to store the inference pipeline results. The field type in `e5_description_vector` is a `dense_vector`. The `.e5_multilingual_small` model has embedding_size of 384, so the dimension of the vector (dims) is set to 384. \n",

DRAFT: Added an updated version of notebook for video script #471

Are you sure you want to change the base?

DRAFT: Added an updated version of notebook for video script #471

Uh oh!

Conversation

justincastilla commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gitnotebooks bot commented Jul 15, 2025

Uh oh!

carlyrichmond commented Jul 15, 2025

Uh oh!

justincastilla commented Jul 15, 2025

Uh oh!

carlyrichmond left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carlyrichmond Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

carlyrichmond Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

carlyrichmond left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

justincastilla commented Jul 15, 2025 •

edited

Loading

carlyrichmond left a comment •

edited

Loading