@@ -20,7 +20,8 @@ and other applications.
2020
2121These applications can answer questions about specific sources of information,
2222for example using techniques like Retrieval Augmented Generation (RAG).
23- RAG is a technique for augmenting LLM knowledge with additional data.
23+ RAG is a technique for augmenting LLM knowledge with additional data,
24+ often private or real-time.
2425:::
2526
2627::::{grid} 2
@@ -52,6 +53,7 @@ using SQL: CrateDB is all you need.
5253:link-type: ref
5354LangChain is a framework for developing applications powered by language models,
5455written in Python, and with a strong focus on composability.
56+ It supports retrieval-augmented generation (RAG).
5557+++
5658The LangChain adapter for CrateDB provides support to use CrateDB as a vector
5759store database, to load documents using LangChain’s DocumentLoader, and also
@@ -61,10 +63,13 @@ supports LangChain’s conversational memory subsystem.
6163::::
6264
6365
64- ## Text-to-SQL and MCP
66+ (text-to-sql)=
67+ ## Text-to-SQL
6568
66- The adapters enumerated below integrate CrateDB for Text-to-SQL purposes,
69+ :::{div} sd-text-muted
70+ Integrate CrateDB with Text-to-SQL solutions,
6771and provide MCP and AI enterprise data integrations.
72+ :::
6873
6974::::{grid} 2
7075:gutter: 4
@@ -76,7 +81,7 @@ Text-to-SQL is a technique that converts natural language queries into SQL
7681queries that can be executed by a database.
7782:::
7883
79- :::{grid-item-card} MCP
84+ :::{grid-item-card} All about MCP
8085:link : mcp
8186:link-type: ref
8287The Model Context Protocol (MCP), is an open protocol that enables seamless
@@ -92,8 +97,54 @@ MindsDB is the platform for customizing AI from enterprise data.
9297::::
9398
9499
100+ ## Time series analysis
101+
102+ :::{div} sd-text-muted
103+ Load and analyze data from database systems for
104+ time series anomaly detection and forecasting.
105+ :::
106+
107+ ::::{grid} 2
108+ :gutter: 4
109+
110+ :::{grid-item-card} Statistical analysis and visualization on huge datasets
111+ :link : r-tutorial
112+ :link-type: ref
113+ Learn how to create a machine learning pipeline using R and CrateDB.
114+ :::
115+
116+ :::{grid-item-card} Regression analysis with pandas and scikit-learn
117+ :link : scikit-learn
118+ :link-type: ref
119+ Use pandas and scikit-learn to run a regression analysis within a Jupyter Notebook.
120+ :::
121+
122+ :::{grid-item-card} Build model for predictive maintenance with TensorFlow
123+ :link : tensorflow-tutorial
124+ :link-type: ref
125+ Learn how to build a machine learning model that will predict whether
126+ a machine will fail within a specified time window in the future.
127+ :::
128+
129+ :::{grid-item-card} Advanced time series analysis with MLflow and PyCaret
130+ :link : ml-timeseries
131+ :link-type: ref
132+ Learn how to conduct advanced data analysis on large time series datasets
133+ with CrateDB, MLflow, and PyCaret:
134+ Anomaly detection and forecasting, time series decomposition,
135+ Exploratory data analysis (EDA).
136+ :::
137+
138+ ::::
139+
140+
95141## MLOps and model training
96142
143+ :::{div} sd-text-muted
144+ CrateDB supports MLOps procedures through adapters to best-of-breed software
145+ frameworks.
146+ :::
147+
97148:::{div}
98149Training a machine learning model, running it in production, and maintaining
99150it, requires a significant amount of data processing and bookkeeping
@@ -103,16 +154,14 @@ Machine Learning Operations [MLOps] is a paradigm that aims to deploy and
103154maintain machine learning models in production reliably and efficiently,
104155including experiment tracking, and in the spirit of continuous development
105156and DevOps.
106-
107- CrateDB supports MLOps procedures through adapters to best-of-breed software
108- frameworks.
109157:::
110158
111159::::{grid} 2
112160:gutter: 4
113161
114162:::{grid-item-card} MLflow
115163:link : mlflow
164+ :link-type: ref
116165MLflow is an open-source platform to manage the whole ML lifecycle,
117166including experimentation, reproducibility, deployment, and a central
118167model registry.
@@ -122,36 +171,29 @@ CrateDB can be used as a storage database for the MLflow Tracking subsystem.
122171
123172:::{grid-item-card} PyCaret
124173:link : pycaret
174+ :link-type: ref
125175PyCaret is an open-source, low-code machine learning library for Python
126176that automates machine learning workflows (AutoML).
127177+++
128178CrateDB can be used as a storage database for training and production datasets.
129179:::
130180
131- ::::
132-
133-
134- ## Time-series anomaly detection and forecasting
135-
136- Load and analyze data from database systems.
137-
138- ::::{grid} 2
139- :gutter: 4
140-
141- :::{grid-item-card} Statistical analysis and visualization on huge datasets
142- :link : r-tutorial
143- Learn how to create a Machine Learning pipeline using R and CrateDB.
181+ :::{grid-item-card} Advanced time series analysis with MLflow and PyCaret
182+ :link : ml-timeseries
183+ :link-type: ref
184+ :columns: 12
185+ Learn how to conduct advanced data analysis on large time series datasets
186+ with CrateDB, MLflow, and PyCaret.
187+ +++
188+ ** What's inside:** Anomaly detection and forecasting, time series decomposition,
189+ exploratory data analysis (EDA).
144190:::
145191
146- :::{grid-item-card} Regression analysis with pandas and scikit-learn
147- :link : scikit-learn-tutorial
148- Use pandas and scikit-learn to run a regression analysis within a Jupyter Notebook.
149- :::
192+ ::::
150193
151- :::{grid-item-card} Use TensorFlow and CrateDB for predictive maintenance
152- :link : tensorflow-tutorial
153- Learn how to build a machine learning model that will predict whether
154- a machine will fail within a specified time window in the future.
155- :::
156194
157- ::::
195+ :::{toctree}
196+ :maxdepth: 1
197+ :hidden:
198+ time-series
199+ :::
0 commit comments