Skip to content

Commit eba662e

Browse files
authored
Merge branch 'main' into refresh_ubi_docs_part_deux
2 parents 638ba70 + ac2b3a5 commit eba662e

File tree

284 files changed

+6996
-1041
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

284 files changed

+6996
-1041
lines changed

.github/PULL_REQUEST_TEMPLATE.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
_Describe what this change achieves._
33

44
### Issues Resolved
5-
Closes #[_insert issue number_]
5+
Closes #[_delete this text, including the brackets, and replace with the issue number_]
66

77
### Version
88
_List the OpenSearch version to which this PR applies, e.g. 2.14, 2.12--2.14, or all._
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
extends: existence
2+
message: "In links, use '{{site.url}}{{site.baseurl}}' instead of 'https://www.opensearch.org/docs/latest'."
3+
level: error
4+
nonword: true
5+
scope: raw
6+
tokens:
7+
- '\]\(https:\/\/www.opensearch.org\/docs\/latest'
8+
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
extends: substitution
2+
message: "Use '%s' instead of '%s'. Note the correct capitalization."
3+
ignorecase: false
4+
level: error
5+
action:
6+
name: replace
7+
swap:
8+
'Retrieval-Augmented Generation': retrieval-augmented generation

.github/vale/styles/OpenSearch/SubstitutionsError.yml

+1
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ swap:
3737
'pre-trained': pretrained
3838
'premigration': pre-migration
3939
're-enable': reenable
40+
'retrieval augmented generation': retrieval-augmented generation
4041
'screen shot': screenshot
4142
'sample request': example request
4243
'sample response': example response

.github/vale/styles/Vocab/OpenSearch/Products/accept.txt

+2
Original file line numberDiff line numberDiff line change
@@ -68,6 +68,7 @@ OpenSearch Assistant
6868
OpenSearch Assistant Toolkit
6969
OpenSearch Benchmark
7070
OpenSearch Dashboards
71+
OpenSearch Flow
7172
OpenSearch Playground
7273
OpenSearch Project
7374
OpenSearch Service
@@ -99,4 +100,5 @@ Tribuo
99100
VisBuilder
100101
Winlogbeat
101102
XGBoost
103+
Zipf
102104
Zstandard
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
1+
[Aa]ss

.github/vale/tests/test-style-pos.md

+4
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,8 @@ This sentence tests [links end slash]({{site.url}}{{site.baseurl}}/opensearch).
3838

3939
This sentence tests [links mid slash]({{site.url}}{{site.baseurl}}opensearch).
4040

41+
This sentence tests [links explicit](https://www.opensearch.org/docs/latest/double-slash/).
42+
4143
This sentence tests log-in as a noun. To login, we test this as a verb.
4244

4345
To test merge conflicts, remove tick marks in `<<<<<<< HEAD`.
@@ -74,6 +76,8 @@ This sentence tests splling.
7476

7577
This sentence tests substitution error by using the word indices.
7678

79+
This sentence tests substitution case-sensitive error by using the word Retrieval-Augmented Generation.
80+
7781
This sentence tests substitution suggestion due to its nature.
7882

7983
This Table | tests capitalization

.github/workflows/update-api-components.yml

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ jobs:
2424
- run: bundle install
2525

2626
- name: Download spec and insert into documentation
27-
run: bundle exec jekyll spec-insert
27+
run: bundle exec jekyll spec-insert -F -R
2828

2929
- name: Get current date
3030
id: date

.vale.ini

+2
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,7 @@ OpenSearch.LatinismsSubstitution = YES
3838
OpenSearch.LinksDoubleParentheses = YES
3939
OpenSearch.LinksDoubleSlash = YES
4040
OpenSearch.LinksEndSlash = YES
41+
OpenSearch.LinksExplicit = YES
4142
OpenSearch.LinksMidSlash = YES
4243
OpenSearch.LoginNoun = YES
4344
OpenSearch.LoginVerb = YES
@@ -60,6 +61,7 @@ OpenSearch.SpacingSlash = YES
6061
OpenSearch.SpacingWords = YES
6162
OpenSearch.Spelling = YES
6263
OpenSearch.StackedHeadings = YES
64+
OpenSearch.SubstitutionsErrorCaseSensitive = YES
6365
OpenSearch.SubstitutionsError = YES
6466
OpenSearch.SubstitutionsSuggestion = YES
6567
OpenSearch.TableHeadings = YES

DEVELOPER_GUIDE.md

+17-6
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@
77
- [Spec insert components](#spec-insert-components)
88
- [Query parameters](#query-parameters)
99
- [Path parameters](#path-parameters)
10-
- [Paths and HTTP methods](#paths-and-http-methods)
10+
- [Endpoints](#endpoints)
1111

1212
## Introduction
1313

@@ -49,7 +49,13 @@ bundle exec jekyll spec-insert
4949
If you are working on multiple Markdown files and do not want to keep running the `jekyll spec-insert` command, you can add the `--watch` (or `-W`) flag to the command to watch for changes in the Markdown files and automatically render the API components:
5050

5151
```shell
52-
bundle exec jekyll spec-insert --watch
52+
bundle exec jekyll spec-insert -W
53+
```
54+
55+
By default, when the plugin encounters an error when processing a file, the plugin prints out the error than moves on to the next file. If you want it to short-circuit when an error occurs, add the `--fail-on-error` (or `-F`) flag to the command:
56+
57+
```shell
58+
bundle exec jekyll spec-insert -F
5359
```
5460

5561
Depending on the text editor you are using, you may need to manually reload the file from disk to see the changes applied by the plugin if the editor does not automatically reload the file periodically.
@@ -63,21 +69,26 @@ bundle exec jekyll spec-insert --refresh-spec
6369
### Ignoring files and folders
6470
The `spec-insert` plugin ignores all files and folders listed in the [./_config.yml#exclude](./_config.yml) list, which is also the list of files and folders that Jekyll ignores.
6571

72+
### Configuration
73+
You can update the configuration settings for this plugin through the [config.yml](./spec-insert/config.yml) file.
74+
75+
_Note that tests for this plugin use a mock configuration [file](./spec-insert/spec/mock_config.yml) to assure that the tests still pass when the config file is altered. The expected output for the tests is based on the mock configuration file and will look different from the actual output when the plugin is run._
76+
6677
## CI/CD
6778
The `spec-insert` plugin is run as part of the CI/CD pipeline to ensure that the API components are up to date in the documentation. This is performed through the [update-api-components.yml](.github/workflows/update-api-components.yml) GitHub Actions workflow, which creates a pull request containing the updated API components every Sunday.
6879

6980
## Spec insert components
7081
All spec insert components accept the following arguments:
7182
- `api` (String; required): The name of the API to render the component from. This is equivalent to the `x-operation-group` field in the OpenSearch OpenAPI Spec.
72-
- `component` (String; required): The name of the component to render, such as `query_parameters`, `path_parameters`, or `paths_and_http_methods`.
83+
- `component` (String; required): The name of the component to render, such as `query_parameters`, `path_parameters`, or `endpoints`.
7384
- `omit_header` (Boolean; Default is `false`): If set to `true`, the markdown header of the component will not be rendered.
7485

75-
### Paths and HTTP methods
76-
To insert paths and HTTP methods for the `search` API, use the following snippet:
86+
### Endpoints
87+
To insert endpoints for the `search` API, use the following snippet:
7788
```markdown
7889
<!-- spec_insert_start
7990
api: search
80-
component: paths_and_http_methods
91+
component: endpoints
8192
-->
8293
<!-- spec_insert_end -->
8394
```

FORMATTING_GUIDE.md

+6
Original file line numberDiff line numberDiff line change
@@ -399,6 +399,12 @@ Some Markdown paragraph. Here's a formula:
399399
And back to Markdown.
400400
```
401401

402+
Alternatively, you can use double dollar signs (`$$`) for both display and inline math directly in Markdown:
403+
404+
```
405+
The probability of selecting pair $$i$$ is proportional to $$1 \over i^\alpha$$.
406+
```
407+
402408
## Tables
403409

404410
Markdown table columns are automatically sized, and there is no need to specify a different number of dashes in the formatting.

STYLE_GUIDE.md

+5-2
Original file line numberDiff line numberDiff line change
@@ -23,11 +23,11 @@ The following naming conventions should be observed in OpenSearch Project conten
2323

2424
#### Product names
2525

26-
Capitalize product names. The OpenSearch Project has three products: OpenSearch, OpenSearch Dashboards, and Data Prepper. For example:
26+
Capitalize product names. The OpenSearch Project has three products: OpenSearch, OpenSearch Dashboards, and OpenSearch Data Prepper. For example:
2727

2828
* "To install *OpenSearch*, download the Docker image."
2929
* "To access *OpenSearch Dashboards*, open your browser and navigate to http://localhost:5601/app/home."
30-
* "*Data Prepper* contains the following components:"
30+
* "*OpenSearch Data Prepper* contains the following components:"
3131

3232
Capitalize the names of clients and tools. For example:
3333

@@ -128,6 +128,7 @@ The following table lists acronyms that you don't need to spell out.
128128
| CSV | comma-separated values |
129129
| DNS | Domain Name System |
130130
| DOS | disk operating system |
131+
| Faiss | Facebook AI Similarity Search |
131132
| FAQ | frequently asked questions |
132133
| FTP | File Transfer Protocol |
133134
| GIF | Graphics Interchange Format |
@@ -141,8 +142,10 @@ The following table lists acronyms that you don't need to spell out.
141142
| JPEG | Joint Photographic Experts Group |
142143
| JSON | JavaScript Object Notation |
143144
| k-NN | k-nearest neighbors |
145+
| MS MARCO | Microsoft Machine Reading Comprehension |
144146
| NAT | network address translation |
145147
| NGINX | engine x |
148+
| NMSLIB | Non-Metric Space Library |
146149
| PDF | Portable Document Format |
147150
| RAM | random access memory |
148151
| REST | Representational State Transfer |

TERMS.md

+18-2
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ Use to describe a list of items that are allowed (not blocked). Do not use as a
4646

4747
**Amazon OpenSearch Service**
4848

49-
Amazon OpenSearch Service is a managed service that makes it easy to deploy, operate, and scale OpenSearch clusters in the AWS Cloud. Amazon OpenSearch Service is the successor to Amazon Elasticsearch Service (Amazon ES) and supports OpenSearch and legacy Elasticsearch OSS (up to 7.10, the final open-source version of the software).
49+
Use "Amazon OpenSearch Service" on first appearance; "OpenSearch Service" is acceptable for subsequent appearances. Amazon OpenSearch Service is a managed service that makes it easy to deploy, operate, and scale OpenSearch clusters in the AWS Cloud. Amazon OpenSearch Service is the successor to Amazon Elasticsearch Service (Amazon ES) and supports OpenSearch and legacy Elasticsearch OSS (up to 7.10, the final open-source version of the software).
5050

5151
**Anomaly Detection**
5252

@@ -196,6 +196,10 @@ Use data is, not data are. Don't use datas. Use pieces of data or equivalent to
196196

197197
**data center**
198198

199+
**OpenSearch Data Prepper**
200+
201+
Use "OpenSearch Data Prepper" on first appearance; "Data Prepper" is acceptable for subsequent appearances. OpenSearch Data Prepper is a server-side data collector capable of filtering, enriching, transforming, normalizing, and aggregating data for downstream analytics and visualization. Data Prepper also lets users build custom pipelines to improve the operational view of applications.
202+
199203
**dataset**
200204

201205
**data source**
@@ -299,6 +303,8 @@ Exception: *Execution* is unavoidable for third-party terms for which no alterna
299303

300304
**Faiss**
301305

306+
Facebook AI Similarity Search. Do not define on first appearance. Faiss is a library that allows developers to quickly search for embeddings of multimedia documents that are similar to each other.
307+
302308
**file name**
303309

304310
**frontend (n., adj.)**
@@ -501,6 +507,10 @@ Do not use. Use *management account* instead.
501507

502508
Avoid. Use _can_ or _might_ instead.
503509

510+
**MS MARCO**
511+
512+
Microsoft Machine Reading Comprehension. Do not define on first appearance. MS MARCO is a collection of datasets focused on deep learning in search.
513+
504514
**multilayer, multilayered**
505515

506516
**must, shall, should**
@@ -521,6 +531,10 @@ Use _near real time_ as a noun; use near real-time as an adjective. Don't add a
521531

522532
Spell out _near real time_ on first mention; _NRT_ can be used on subsequent mentions.
523533

534+
**NMSLIB**
535+
536+
Non-Metric Space Library. Do not define on first appearance. NMSLIB is an efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
537+
524538
**node**
525539

526540
A server that stores your data and processes search requests with OpenSearch, usually as part of a cluster. Do not use _master node_ and avoid using _worker node_.
@@ -650,6 +664,8 @@ Copy of a primary shard. Helps improve performance when using indexes across mul
650664

651665
Use as a synonym for repository, on second and subsequent use.
652666

667+
**retrieval-augmented generation (RAG)**
668+
653669
**RPM Package Manager (RPM)**
654670

655671
Formerly known as RedHat Package Manager. An open-source package management system for use with Linux distributions.
@@ -698,7 +714,7 @@ Do not use. Use *replica*, *secondary*, or *standby* instead.
698714

699715
**Snapshot Management (SM)**
700716

701-
**solid state drive (SSD)**
717+
**solid-state drive (SSD)**
702718

703719
**standalone**
704720

_about/version-history.md

+1
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,7 @@ permalink: /version-history/
99

1010
OpenSearch version | Release highlights | Release date
1111
:--- | :--- | :---
12+
[2.19.0](https://github.com/opensearch-project/opensearch-build/blob/main/release-notes/opensearch-release-notes-2.19.0.md) | Adds workload management, additional query insights, and template queries. Introduces a query insights page to OpenSearch Dashboards. Includes improvements and bug fixes to snapshots, search statistics, star-tree search, and index management. For a full list of release highlights, see the Release Notes. | 11 February 2025
1213
[2.18.0](https://github.com/opensearch-project/opensearch-build/blob/main/release-notes/opensearch-release-notes-2.18.0.md) | Adds a redesigned home page, updated Discover interface, and collaborative workspaces to OpenSearch Dashboards. Includes improvements to ML inference processor and query grouping. Introduces reranking by field and paginated CAT APIs. Includes experimental OpenSearch Dashboards Assistant capabilities. For a full list of release highlights, see the Release Notes. | 05 November 2024
1314
[2.17.1](https://github.com/opensearch-project/opensearch-build/blob/main/release-notes/opensearch-release-notes-2.17.1.md) | Includes bug fixes for ML Commons, anomaly detection, k-NN, and security analytics. Adds various infrastructure and maintenance updates. For a full list of release highlights, see the Release Notes. | 1 October 2024
1415
[2.17.0](https://github.com/opensearch-project/opensearch-build/blob/main/release-notes/opensearch-release-notes-2.17.0.md) | Includes disk-optimized vector search, binary quantization, and byte vector encoding in k-NN. Adds asynchronous batch ingestion for ML tasks. Provides search and query performance enhancements and a new custom trace source in trace analytics. Includes application-based configuration templates. For a full list of release highlights, see the Release Notes. | 17 September 2024

_aggregations/bucket/terms.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -112,7 +112,7 @@ While the `doc_count` field provides a representation of the number of individua
112112
* The field does not support nested arrays; only positive integers can be used.
113113
* If a document does not contain the `_doc_count` field, aggregation uses the document to increase the count by 1.
114114

115-
OpenSearch features that rely on an accurate document count illustrate the importance of using the `_doc_count` field. To see how this field can be used to support other search tools, refer to [Index rollups](https://opensearch.org/docs/latest/im-plugin/index-rollups/index/), an OpenSearch feature for the Index Management (IM) plugin that stores documents with pre-aggregated data in rollup indexes.
115+
OpenSearch features that rely on an accurate document count illustrate the importance of using the `_doc_count` field. To see how this field can be used to support other search tools, refer to [Index rollups]({{site.url}}{{site.baseurl}}/im-plugin/index-rollups/index/), an OpenSearch feature for the Index Management (IM) plugin that stores documents with pre-aggregated data in rollup indexes.
116116
{: .tip}
117117

118118
#### Example request

0 commit comments

Comments
 (0)