parteekcoder
diff --git a/‎main.ipynb
Lines changed: 13 additions & 48 deletions b/‎main.ipynb
Lines changed: 13 additions & 48 deletions
@@ -6,21 +6,23 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "!pip3 install google\n",
-    "!pip3 install beautifulsoup4"
+    "!pip install google\n",
+    "!pip install beautifulsoup4"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 24,
+   "execution_count": 5,
    "metadata": {},
    "outputs": [],
    "source": [
     "from googlesearch import search\n",
     "import requests\n",
     "import re\n",
     "import os\n",
-    "from bs4 import BeautifulSoup"
+    "from bs4 import BeautifulSoup\n",
+    "from dotenv import load_dotenv, find_dotenv\n",
+    "_ = load_dotenv(find_dotenv())"
    ]
   },
   {
@@ -155,7 +157,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 25,
+   "execution_count": 6,
    "metadata": {},
    "outputs": [],
    "source": [
@@ -202,42 +204,18 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": null,
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "[Document(page_content='Skip to main contentWhat is a large language model (LLM)?Explore popular open-source LLMsLarge language model definitionA large language model (LLM) is a deep learning algorithm that can perform a variety of natural language processing (NLP) tasks. Large language models use transformer models and are trained using massive datasets — hence, large. This enables them to recognize, translate, predict, or generate text or other content.Large language models are also referred to as neural networks (NNs), which are computing systems inspired by the human brain. These neural networks work using a network of nodes that are layered, much like neurons.In addition to teaching human languages to artificial intelligence (AI) applications, large language models can also be trained to perform a variety of tasks like understanding protein structures, writing software code, and more. Like the human brain, large language models must be pre-trained and then fine-tuned so that they can solve text classification, question', metadata={'source': 'scraped.txt'}), Document(page_content='text classification, question answering, document summarization, and text generation problems. Their problem-solving capabilities can be applied to fields like healthcare, finance, and entertainment where large language models serve a variety of NLP applications, such as translation, chatbots, AI assistants, and so on.Large language models also have large numbers of parameters, which are akin to memories the model collects as it learns from training. Think of these parameters as the model’s knowledge bank.So, what is a transformer model? A transformer model is the most common architecture of a large language model. It consists of an encoder and a decoder. A transformer model processes data by tokenizing the input, then simultaneously conducting mathematical equations to discover relationships between tokens. This enables the computer to see the patterns a human would see were it given the same query.Transformer models work with self-attention mechanisms, which enables the model to learn more quickly than', metadata={'source': 'scraped.txt'}), Document(page_content=\"to learn more quickly than traditional models like long short-term memory models. Self-attention is what enables the transformer model to consider different parts of the sequence, or the entire context of a sentence, to generate predictions.Related: Apply transformers to your search applicationsKey components of large language modelsLarge language models are composed of multiple neural network layers. Recurrent layers, feedforward layers, embedding layers, and attention layers work in tandem to process the input text and generate output content.The embedding layer creates embeddings from the input text. This part of the large language model captures the semantic and syntactic meaning of the input, so the model can understand context.The feedforward layer (FFN) of a large language model is made of up multiple fully connected layers that transform the input embeddings. In so doing, these layers enable the model to glean higher-level abstractions — that is, to understand the user's intent with the text\", metadata={'source': 'scraped.txt'}), Document(page_content=\"user's intent with the text input.The recurrent layer interprets the words in the input text in sequence. It captures the relationship between words in a sentence.The attention mechanism enables a language model to focus on single parts of the input text that is relevant to the task at hand. This layer allows the model to generate the most accurate outputs.There are three main kinds of large language models:Generic or raw language models predict the next word based on the language in the training data. These language models perform information retrieval tasks.Instruction-tuned language models are trained to predict responses to the instructions given in the input. This allows them to perform sentiment analysis, or to generate text or code.Dialog-tuned language models are trained to have a dialog by predicting the next response. Think of chatbots or conversational AI.What is the difference between large language models and generative AI?Generative AI is an umbrella term that refers to artificial intelligence\", metadata={'source': 'scraped.txt'}), Document(page_content='to artificial intelligence models that have the capability to generate content. Generative AI can generate text, code, images, video, and music. Examples of generative AI include Midjourney, DALL-E, and ChatGPT.Large language models are a type of generative AI that are trained on text and produce textual content. ChatGPT is a popular example of generative text AI.All large language models are generative AI1.How do large language models work?A large language model is based on a transformer model and works by receiving an input, encoding it, and then decoding it to produce an output prediction. But before a large language model can receive text input and generate an output prediction, it requires training, so that it can fulfill general functions, and fine-tuning, which enables it to perform specific tasks.Training: Large language models are pre-trained using large textual datasets from sites like Wikipedia, GitHub, or others. These datasets consist of trillions of words, and their quality will affect the', metadata={'source': 'scraped.txt'}), Document(page_content='their quality will affect the language model\\'s performance. At this stage, the large language model engages in unsupervised learning, meaning it processes the datasets fed to it without specific instructions. During this process, the LLM\\'s AI algorithm can learn the meaning of words, and of the relationships between words. It also learns to distinguish words based on context. For example, it would learn to understand whether \"right\" means \"correct,\" or the opposite of \"left', metadata={'source': 'scraped.txt'})]\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "chunks = createChunking(doc)"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": null,
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "[Document(page_content='Skip to main contentWhat is a large language model (LLM)?Explore popular open-source LLMsLarge language model definitionA large language model (LLM) is a deep learning algorithm that can perform a variety of natural language processing (NLP) tasks. Large language models use transformer models and are trained using massive datasets — hence, large. This enables them to recognize, translate, predict, or generate text or other content.Large language models are also referred to as neural networks (NNs), which are computing systems inspired by the human brain. These neural networks work using a network of nodes that are layered, much like neurons.In addition to teaching human languages to artificial intelligence (AI) applications, large language models can also be trained to perform a variety of tasks like understanding protein structures, writing software code, and more. Like the human brain, large language models must be pre-trained and then fine-tuned so that they can solve text classification, question', metadata={'source': 'scraped.txt'}),\n",
-       " Document(page_content='text classification, question answering, document summarization, and text generation problems. Their problem-solving capabilities can be applied to fields like healthcare, finance, and entertainment where large language models serve a variety of NLP applications, such as translation, chatbots, AI assistants, and so on.Large language models also have large numbers of parameters, which are akin to memories the model collects as it learns from training. Think of these parameters as the model’s knowledge bank.So, what is a transformer model? A transformer model is the most common architecture of a large language model. It consists of an encoder and a decoder. A transformer model processes data by tokenizing the input, then simultaneously conducting mathematical equations to discover relationships between tokens. This enables the computer to see the patterns a human would see were it given the same query.Transformer models work with self-attention mechanisms, which enables the model to learn more quickly than', metadata={'source': 'scraped.txt'}),\n",
-       " Document(page_content=\"to learn more quickly than traditional models like long short-term memory models. Self-attention is what enables the transformer model to consider different parts of the sequence, or the entire context of a sentence, to generate predictions.Related: Apply transformers to your search applicationsKey components of large language modelsLarge language models are composed of multiple neural network layers. Recurrent layers, feedforward layers, embedding layers, and attention layers work in tandem to process the input text and generate output content.The embedding layer creates embeddings from the input text. This part of the large language model captures the semantic and syntactic meaning of the input, so the model can understand context.The feedforward layer (FFN) of a large language model is made of up multiple fully connected layers that transform the input embeddings. In so doing, these layers enable the model to glean higher-level abstractions — that is, to understand the user's intent with the text\", metadata={'source': 'scraped.txt'}),\n",
-       " Document(page_content=\"user's intent with the text input.The recurrent layer interprets the words in the input text in sequence. It captures the relationship between words in a sentence.The attention mechanism enables a language model to focus on single parts of the input text that is relevant to the task at hand. This layer allows the model to generate the most accurate outputs.There are three main kinds of large language models:Generic or raw language models predict the next word based on the language in the training data. These language models perform information retrieval tasks.Instruction-tuned language models are trained to predict responses to the instructions given in the input. This allows them to perform sentiment analysis, or to generate text or code.Dialog-tuned language models are trained to have a dialog by predicting the next response. Think of chatbots or conversational AI.What is the difference between large language models and generative AI?Generative AI is an umbrella term that refers to artificial intelligence\", metadata={'source': 'scraped.txt'}),\n",
-       " Document(page_content='to artificial intelligence models that have the capability to generate content. Generative AI can generate text, code, images, video, and music. Examples of generative AI include Midjourney, DALL-E, and ChatGPT.Large language models are a type of generative AI that are trained on text and produce textual content. ChatGPT is a popular example of generative text AI.All large language models are generative AI1.How do large language models work?A large language model is based on a transformer model and works by receiving an input, encoding it, and then decoding it to produce an output prediction. But before a large language model can receive text input and generate an output prediction, it requires training, so that it can fulfill general functions, and fine-tuning, which enables it to perform specific tasks.Training: Large language models are pre-trained using large textual datasets from sites like Wikipedia, GitHub, or others. These datasets consist of trillions of words, and their quality will affect the', metadata={'source': 'scraped.txt'}),\n",
-       " Document(page_content='their quality will affect the language model\\'s performance. At this stage, the large language model engages in unsupervised learning, meaning it processes the datasets fed to it without specific instructions. During this process, the LLM\\'s AI algorithm can learn the meaning of words, and of the relationships between words. It also learns to distinguish words based on context. For example, it would learn to understand whether \"right\" means \"correct,\" or the opposite of \"left', metadata={'source': 'scraped.txt'})]"
-      ]
-     },
-     "execution_count": 13,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "chunks"
    ]
@@ -322,22 +300,9 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 20,
+   "execution_count": null,
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Human: \n",
-      "Answer the question based only on the following context:\n",
-      "[(Document(page_content='Skip to main contentWhat is a large language model (LLM)?Explore popular open-source LLMsLarge language model definitionA large language model (LLM) is a deep learning algorithm that can perform a variety of natural language processing (NLP) tasks. Large language models use transformer models and are trained using massive datasets — hence, large. This enables them to recognize, translate, predict, or generate text or other content.Large language models are also referred to as neural networks (NNs), which are computing systems inspired by the human brain. These neural networks work using a network of nodes that are layered, much like neurons.In addition to teaching human languages to artificial intelligence (AI) applications, large language models can also be trained to perform a variety of tasks like understanding protein structures, writing software code, and more. Like the human brain, large language models must be pre-trained and then fine-tuned so that they can solve text classification, question', metadata={'source': 'scraped.txt'}), 0.8211523777245459), (Document(page_content='text classification, question answering, document summarization, and text generation problems. Their problem-solving capabilities can be applied to fields like healthcare, finance, and entertainment where large language models serve a variety of NLP applications, such as translation, chatbots, AI assistants, and so on.Large language models also have large numbers of parameters, which are akin to memories the model collects as it learns from training. Think of these parameters as the model’s knowledge bank.So, what is a transformer model? A transformer model is the most common architecture of a large language model. It consists of an encoder and a decoder. A transformer model processes data by tokenizing the input, then simultaneously conducting mathematical equations to discover relationships between tokens. This enables the computer to see the patterns a human would see were it given the same query.Transformer models work with self-attention mechanisms, which enables the model to learn more quickly than', metadata={'source': 'scraped.txt'}), 0.7714181820213781), (Document(page_content='to artificial intelligence models that have the capability to generate content. Generative AI can generate text, code, images, video, and music. Examples of generative AI include Midjourney, DALL-E, and ChatGPT.Large language models are a type of generative AI that are trained on text and produce textual content. ChatGPT is a popular example of generative text AI.All large language models are generative AI1.How do large language models work?A large language model is based on a transformer model and works by receiving an input, encoding it, and then decoding it to produce an output prediction. But before a large language model can receive text input and generate an output prediction, it requires training, so that it can fulfill general functions, and fine-tuning, which enables it to perform specific tasks.Training: Large language models are pre-trained using large textual datasets from sites like Wikipedia, GitHub, or others. These datasets consist of trillions of words, and their quality will affect the', metadata={'source': 'scraped.txt'}), 0.7623094323313777), (Document(page_content='their quality will affect the language model\\'s performance. At this stage, the large language model engages in unsupervised learning, meaning it processes the datasets fed to it without specific instructions. During this process, the LLM\\'s AI algorithm can learn the meaning of words, and of the relationships between words. It also learns to distinguish words based on context. For example, it would learn to understand whether \"right\" means \"correct,\" or the opposite of \"left', metadata={'source': 'scraped.txt'}), 0.7468296416864612), (Document(page_content=\"user's intent with the text input.The recurrent layer interprets the words in the input text in sequence. It captures the relationship between words in a sentence.The attention mechanism enables a language model to focus on single parts of the input text that is relevant to the task at hand. This layer allows the model to generate the most accurate outputs.There are three main kinds of large language models:Generic or raw language models predict the next word based on the language in the training data. These language models perform information retrieval tasks.Instruction-tuned language models are trained to predict responses to the instructions given in the input. This allows them to perform sentiment analysis, or to generate text or code.Dialog-tuned language models are trained to have a dialog by predicting the next response. Think of chatbots or conversational AI.What is the difference between large language models and generative AI?Generative AI is an umbrella term that refers to artificial intelligence\", metadata={'source': 'scraped.txt'}), 0.7287668130767889)]\n",
-      "\n",
-      "Answer the question based only on the above context: what is Large Language models\n",
-      "\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "print(prompt)"
    ]