You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pages/public_cloud/ai_machine_learning/endpoints_guide_01_getting_started/guide.de-de.md
+53-57
Original file line number
Diff line number
Diff line change
@@ -1,13 +1,11 @@
1
1
---
2
2
title: AI Endpoints - Getting started
3
3
excerpt: Discover AI Endpoints, the secure serverless platform by OVHcloud for developers to access top AI models with easy-to-use APIs. No AI expertise needed.
4
-
updated: 2025-02-10
4
+
updated: 2025-04-28
5
5
---
6
6
7
7
> [!primary]
8
8
>
9
-
> AI Endpoints is currently in **Beta**. Although we aim to offer a production-ready product even in this testing phase, service availability may not be guaranteed. Please be careful if you use endpoints for production, as the Beta phase is not yet complete.
10
-
>
11
9
> AI Endpoints is covered by the **[OVHcloud AI Endpoints Conditions](https://storage.gra.cloud.ovh.net/v1/AUTH_325716a587c64897acbef9a4a4726e38/contracts/48743bf-AI_Endpoints-ALL-1.1.pdf)** and the **[OVHcloud Public Cloud Special Conditions](https://storage.gra.cloud.ovh.net/v1/AUTH_325716a587c64897acbef9a4a4726e38/contracts/d2a208c-Conditions_particulieres_OVH_Stack-WE-9.0.pdf)**.
12
10
>
13
11
@@ -21,73 +19,60 @@ With no extensive AI expertise required, AI Endpoints is an ideal choice for dev
21
19
22
20
The objective of this guide is to help developers interested in AI quickly and easily get started with [AI Endpoints](https://endpoints.ai.cloud.ovh.net/).
23
21
24
-
It explains how to obtain an access token, access AI models, and interact with AI APIs on the [AI Endpoints](https://endpoints.ai.cloud.ovh.net/) platform. By following this guide, you will learn how to integrate AI capabilities into your applications with ease.
22
+
It explains how to obtain an access key, access AI models, and interact with AI APIs on the [AI Endpoints](https://endpoints.ai.cloud.ovh.net/) platform. By following this guide, you will learn how to integrate AI capabilities into your applications with ease.
25
23
26
24
## Requirements
27
25
28
26
- A [Public Cloud project](/links/public-cloud/public-cloud) in your OVHcloud account
29
27
30
28
## Instructions
31
29
32
-
### Getting an access token
33
-
34
-
Getting a token enables you to use the models available in our [catalog](https://endpoints.ai.cloud.ovh.net/catalog) and test their integration into your solutions. To obtain an access token, please follow the steps below:
35
-
36
-
**1\. Visit the AI Endpoints access token page**
37
-
38
-
Click [here](https://endpoints.ai.cloud.ovh.net/) to access AI Endpoints. You can also find the product from the [OVHcloud Control Panel](/links/manager), in the `AI & Machine Learning` category, from the left-hand vertical menu.
30
+
### Generating your first API access key
39
31
40
-
**2\. Generate an access token**
32
+
Getting an API key enables you to use the models available in our [catalog](https://endpoints.ai.cloud.ovh.net/catalog) and test their integration into your solutions. To obtain an API access key, please follow the steps below:
41
33
42
-
Once on the site page, click the `Get it`{.action} button. You will be asked to confirm that you accept the [terms of service](https://storage.gra.cloud.ovh.net/v1/AUTH_325716a587c64897acbef9a4a4726e38/contracts/48743bf-AI_Endpoints-ALL-1.1.pdf) of the product.
Log in to the [OVHcloud Control Panel](/links/manager), navigate to the `Public Cloud`{.action} section, select your desired Public Cloud project, then go to the `AI & Machine Learning`{.action} category in the left menu and choose `AI Endpoints`{.action}.
45
37
46
-
**3\. Scope created token to one of your Public Cloud Projects**
This will redirect you to the **Public Cloud**. If you haven't already done so, log in to your account, and confirm the following AI OVH OAUTH authorization:
From there, click the `Generate my first access Key`{.action} blue button to create your API access key. Next, click the `+ Create a new API key`{.action} button. You will be asked to provide a **name** for the key and an optional **description**. You can also set an **expiration date** for the key if desired.
51
43
52
-
You will then need to choose the lifetime of your token, and also select one of the Public Cloud projects of your account to which you want to attach the token.
53
-
54
-
> [!warning]
55
-
>
56
-
> If you don't have any projects created in your Public Cloud account, you will receive an error message telling you to create one, with a link to follow.
57
-
>
44
+
Once you have filled in the required information, click the `Create`{.action} button to confirm the creation of your API key.
58
45
59
-
Once the authorization process is completed, confirm the token creation by clicking the `Generate token`{.action} button.
Once created, the key will be displayed in the API keys table. You will see your new access key in this table, with its information (name, description, expiry date).
66
53
67
-
This will redirect you to the AI Endpoints site, where you will find your new token in the token field. You will be reminded of the expiry date of your token, and you will be able to copy it.
68
-
69
-
{.thumbnail}
54
+
Your key value will be displayed and you can copy it by clicking the copy icon.
70
55
71
56
> [!primary]
72
57
>
73
-
> It is essential that you **keep your API token private and confidential**.
58
+
> It is essential that you **keep your API key private and confidential**.
74
59
>
75
-
> Moreover, the token displayed will not be stored in the website's memory, so please **make sure to save it securely on your side** for future usage.
60
+
> Moreover, the API key displayed will not be stored in the website's memory, so please **make sure to store it securely on your side** for future usage.
76
61
>
77
62
78
-
With your access token in hand, you are now ready to access the AI models and their easy-to-use APIs.
63
+
With your access API key in hand, you are now ready to access the AI models and their easy-to-use APIs.
79
64
80
65
### Accessing AI models
81
66
82
-
Once your token has been generated, you can navigate to the [Catalog page](https://endpoints.ai.cloud.ovh.net/catalog) to choose the AI model you want to interact with.
67
+
Once your API key has been generated, you can navigate to the [Catalog page](https://endpoints.ai.cloud.ovh.net/catalog) to choose the AI model you want to interact with.
83
68
84
69
AI Endpoints offers a variety of world-renowned AI models to choose from, including:
85
70
86
71
-**Large Language Models (LLM)**: Use models like LLaMa 3, Mistral and more, for conversations and RAG use cases.
87
72
-**Reasoning LLM**: Use reasoning models like DeepSeek-R1 distillations for maths, coding or complex tasks.
88
-
-**Code LLM**: Code generation and code completion from an IDE with models like Codestral or CodeLLaMa.
73
+
-**Code LLM**: Code generation and code completion from an IDE with models like Qwen Coder or Codestral.
89
74
-**Visual LLM**: Multimodal models such as LLaVa-Next, that are able to process images and text inputs, for image understanding or OCR use cases.
90
-
-**Embeddings**: Generate embeddings for use in machine learning applications (BGE base, Multilingual E5).
75
+
-**Embeddings**: Generate embeddings for use in machine learning applications (BGE Base, BGE Multilingual Gemma2, ...).
91
76
-**Natural Language Processing**: Use models like RoBERTa, Bert, and T5 for NLP tasks like sentiment analysis, entity recognition, and text summarization.
92
77
-**Image Generation**: Generate images using Stable Diffusion XL.
93
78
-**Audio Analysis**: Automatic Speech Recognition and Text to Speech using NVIDIA models.
@@ -107,27 +92,27 @@ This will take you to a dedicated page with several options for interacting with
107
92
>>
108
93
>> This option allows you to quickly try out the model by playing with it to see if it meets your needs. This is a great way to get a feel for the model without having to code.
>> The section provides detailed documentation for the model, including example Python code that demonstrates how to interact with the model using its API. The documentation also includes the OpenAI specification codes, as our **LLM APIs are compatible with the OpenAI specifications**.
115
100
>>
116
-
>> To ensure that these code examples work as intended, you should replace the placeholder value `(os.getenv('OVH_AI_ENDPOINTS_ACCESS_TOKEN'))` with your own API token and set it as an environment variable.
101
+
>> To ensure that these code examples work as intended, you should replace the placeholder value `(os.getenv('OVH_AI_ENDPOINTS_ACCESS_TOKEN'))` with your own API key and set it as an environment variable.
>> There, you will find blog articles related to AI Endpoints that you may find helpful in learning how to use the model more effectively. Whether you're building a chatbot with Langchain and JavaScript or creating a video translator app, we provide step-by-step guidance to support your AI projects.
107
+
>> There, you will find guides related to AI Endpoints that you may find helpful in learning how to use the model more effectively. Whether you're building a chatbot with Langchain and JavaScript or creating a video translator app, we provide step-by-step guidance to support your AI projects.
Alternatively, you can also revoke your API key using the `Revoke API key`{.action} button from the [AI Endpoints](https://endpoints.ai.cloud.ovh.net/) website. However, please note that this button will only allow you to revoke the most recently created API key.
146
+
147
+
Once done, you can confirm its deletion by trying to send a request using your revoked API key.
148
+
149
+
### Model rate limit
150
+
151
+
When using AI Endpoints, the **following rate limits apply**:
152
+
153
+
-**Anonymous**: 2 requests per minute, per IP and per model.
154
+
-**Authenticated with an API access key**: 400 requests per minute, per PCI project and per model.
159
155
160
-
Alternatively, you can also revoke your API token using the `Revoke API token`{.action} button from the [AI Endpoints](https://endpoints.ai.cloud.ovh.net/) website. However, please note that this button will only allow you to revoke the most recently created token.
156
+
If you exceed this limit, a **429 error code**will be returned.
161
157
162
-
Once done, you can confirm its deletion by trying to send a request using your revoked token.
158
+
If you require higher usage, please **[get in touch with us](https://help.ovhcloud.com/csm?id=csm_get_help)** to discuss increasing your rate limits.
163
159
164
160
### Billing and usage
165
161
166
-
AI Endpoints remains **free** during its beta phase.
162
+
For information on pricing and the models lifecycle of the platform, please refer to the [AI Endpoints - Billing and lifecycle](/pages/public_cloud/ai_machine_learning/endpoints_guide_04_billing_concept) documentation.
167
163
168
-
In the interest of transparency towards our customers, you will soon be able to monitor your estimated consumption and resource usage via the [OVHcloud Control Panel](/links/manager), in the `AI Endpoints`{.action} section of the `AI & Machine Learning` category, in the left-hand vertical menu.
164
+
For your convenience, you can monitor your estimated consumption and resource usage through the [OVHcloud Control Panel](/links/manager). To do so, navigate to the `AI Endpoints`{.action} section of the `AI & Machine Learning` category, in the left-hand vertical menu.
169
165
170
166
## Going further
171
167
172
-
To discover how to build complete and powerful applications using AI Endpoints, explore our dedicated [AI Endpoints blog page](https://blog.ovhcloud.com/tag/ai-endpoints/). This blog offers a wealth of knowledge and inspiration, including the following articles:
168
+
To discover how to build complete and powerful applications using AI Endpoints, explore our dedicated AI Endpoints guides which offer a wealth of knowledge and inspiration, including the following subjects:
173
169
174
-
-[Create your own Audio Summarizer assistant with AI Endpoints](https://blog.ovhcloud.com/create-audio-summarizer-assistant-with-ai-endpoints/)
175
-
-[Implement chatbot memory management with LangChain and AI Endpoints](https://blog.ovhcloud.com/chatbot-memory-management-with-langchain-and-ai-endpoints/)
176
-
-[Discover how to create a Retrieval Augmented Generation (RAG) system](https://blog.ovhcloud.com/reference-architecture-retrieval-augmented-generation-rag/)
170
+
-[Create your own Audio Summarizer assistant with AI Endpoints](/pages/public_cloud/ai_machine_learning/endpoints_tuto_01_audio_summarizer)
171
+
-[Implement chatbot memory management with LangChain and AI Endpoints](/pages/public_cloud/ai_machine_learning/endpoints_tuto_09_chatbot_memory_langchain)
172
+
-[Discover how to create a Retrieval Augmented Generation (RAG) system](/pages/public_cloud/ai_machine_learning/endpoints_tuto_11_rag_chatbot_langchain)
177
173
-[Discover more about AI Endpoints features and limitations](/pages/public_cloud/ai_machine_learning/endpoints_guide_02_capabilities)
178
174
179
175
If you need training or technical assistance to implement our solutions, contact your sales representative or click on [this link](/links/professional-services) to get a quote and ask our Professional Services experts for a custom analysis of your project.
0 commit comments