You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In Ion 1.0, each stream has a [symbol table](https://amazon-ion.github.io/ion-docs/docs/symbols.html#processing-of-symbol-tables). The symbol table stores text values that can be referred to by their integer index in the table, providing a much more compact representation than repeating the full UTF-8 text bytes each time the value is used. Symbol tables do not store any other information used by the reader or writer.
4
4
@@ -9,102 +9,101 @@ Ion 1.1 also introduces the concept of a _module_, an organizational unit that h
9
9
> [!TIP]
10
10
> You can think of an Ion 1.0 symbol table as a module with an empty macro table.
11
11
12
-
In Ion 1.1, each stream has an [encoding module](modules/encoding_module.md)--the active `(symbol table, macro table)` pair that is being used to encode the stream.
12
+
In Ion 1.1, each stream has an [encoding module](modules/encoding_module.md)—the active `(symbol table, macro table)` pair that is being used to encode the stream.
13
13
14
-
## Identifiers
14
+
## Module interface
15
15
16
-
Many of the grammatical elements used to define modules and macros are _identifiers_--symbols that do not require quotation marks.
16
+
The interface to a module consists of:
17
17
18
-
More explicitly, an identifier is a sequence of one or more ASCII letters, digits, or the characters `$` (dollar sign) or `_` (underscore), not starting with a digit. It also cannot be of the form `$\d+`, which is the syntax for symbol IDs. (For example: `$3`, `$10`, `$458`, etc.)
18
+
* its _spec version_, denoting the Ion version used to define the module
19
+
* its _exported symbols_, an array of strings denoting symbol content
20
+
* its _exported macros_, an array of `<name, macro>` pairs, where all names are unique identifiers (or null).
19
21
20
-
## Defining a module
22
+
The spec version is external to the module body and the precise way it is determined depends on the type of module being defined. This is explained in further detail in [Module Versioning](#module-versioning).
21
23
22
-
A module has four kinds of subclauses:
24
+
The exported symbol array is denoted by the `symbol_table` clause of a module definition, and
25
+
by the `symbols` field of a shared symbol table.
23
26
24
-
1.`symbol_table` - an exported list of text values.
25
-
2.`macro_table` - an exported list of macro definitions.
26
-
3.`module` - a nested module definition.
27
-
4.`import` - a reference to a shared module definition
27
+
The exported macro array is denoted by the module’s `macro_table` clause, with addresses
28
+
allocated to macros or macro bindings in the order they are declared.
28
29
29
-
<!-- TODO: `export` -->
30
+
The exported symbols and exported macros are defined in the [module body](body.md).
30
31
31
-
### `symbol_table`
32
32
33
-
The `symbol_table` clause assembles a list of text values for the module to export. It takes any number of arguments.
33
+
## Types of modules
34
34
35
-
#### Syntax
36
-
```ion
37
-
(symbol_table arg1 arg2 ... argN)
38
-
```
35
+
There are multiple types of modules.
36
+
All modules share the same interface, but vary in their implementation in order to support a variety of different use cases.
|[Encoding Module](modules/encoding_module.md)| Defining the local encoding context |
41
+
|[System Module](modules/system_module.md)| Defining system symbols and macros |
42
+
|[Inner Module](modules/inner_modules.md)| Organizing symbols and macros and limiting the scope of macros |
43
+
|[Shared Module](modules/shared_modules.md)| Defining symbols and macros outside of the data stream |
41
44
42
-
When the `symbol_table` clause is encountered, the reader constructs an empty list. The arguments to the clause are then processed from left to right.
43
45
44
-
For each `arg`:
45
-
***If the `arg` is a list of text values**, the nested text values are appended to the end of the symbol table being constructed.
46
-
* When `null`, `null.string`, `null.symbol`, or `$0` appear in the list of text values, this creates a symbol with unknown text.
47
-
* The presence of any other Ion value in the list raises an error.
48
-
***If the `arg` is the name of a module**, the symbols in that module's symbol table are appended to the end of the symbol table being constructed.
49
-
***If the `arg` is anything else**, the reader must raise an error.
46
+
## Module versioning
50
47
51
-
#### Example `symbol_table`
48
+
Every module definition has a _spec version_ that determines the syntax and semantics of the module body.
49
+
A module’s spec version is expressed in terms of a specific Ion version; the meaning of the module is as defined by that version of the Ion specification.
52
50
53
-
```ion
54
-
(symbol_table // Constructs an empty symbol table (list)
55
-
["a", b, 'c'] // The text values in this list are appended to the table
56
-
foo // Module `foo`'s symbol table values are appended to the table
57
-
['''g''', "h", i]) // The text values in this list are appended to the table
58
-
```
59
-
If module `foo`'s symbol table were `[d, e, f]`, then the symbol table defined by the above clause would be:
60
-
```ion
61
-
["a", "b", "c", "d", "e", "f", "g", "h", "i"]
62
-
```
51
+
The spec version for an encoding module is implicitly derived from the Ion version of its containing segment.
52
+
The spec version for a shared module is denoted via a required annotation.
53
+
The spec version of an inner module is always the same as its containing module.
54
+
The spec version of a system module is the Ion version in which it was specified.
63
55
64
-
### `macro_table`
56
+
To ensure that all consumers of a module can properly understand it, a module can only import
57
+
shared modules defined with the same or earlier spec version.
65
58
66
-
The `macro_table` clause assembles a list of macro definitions for the module to export. It takes any number of arguments.
59
+
#### Examples
60
+
The spec version of a shared module must be declared explicitly using an annotation of the form `$ion_1_N`.
61
+
This allows the module to be serialized using any version of Ion, and its meaning will not change.
67
62
68
-
#### Syntax
69
63
```ion
70
-
(macro_table arg1 arg2 ... argN)
64
+
$ion_shared_module::
65
+
$ion_1_1::("com.example.symtab" 3
66
+
(symbol_table ...)
67
+
(macro_table ...))
71
68
```
72
-
#### Processing
73
-
74
-
When the `macro_table` clause is encountered, the reader constructs an empty list. The arguments to the clause are then processed from left to right.
75
69
76
-
For each `arg`:
77
-
***If the `arg` is a `macro` clause**, the clause is processed and the resulting macro definition is appended to the end of the macro table being constructed.
78
-
***If the `arg` is the name of a module**, the macro definitions in that module's macro table are appended to the end of the macro table being constructed.
79
-
***If the `arg` is anything else**, the reader must raise an error.
70
+
The spec version of an encoding module is always the same as the Ion version of its enclosing segment.
80
71
72
+
```ion
73
+
$ion_1_1
74
+
$ion_encoding::(
75
+
// Module semantics specified by Ion 1.1
76
+
...
77
+
)
78
+
79
+
// ...
80
+
81
+
$ion_1_3
82
+
$ion_encoding::(
83
+
// Module semantics specified by Ion 1.3
84
+
...
85
+
)
86
+
//... // Assuming no IVM
87
+
$ion_encoding::(
88
+
// Module semantics specified by Ion 1.3
89
+
...
90
+
)
91
+
```
81
92
82
-
Macro definitions being added to the macro table must have a unique name. If a macro is added whose name conflicts with one already present in the table, the reader must raise an error.
93
+
## Identifiers
83
94
84
-
### `macro`
95
+
Many of the grammatical elements used to define modules and macros are _identifiers_--symbols that do not require quotation marks.
85
96
86
-
The `macro` clause defines a new macro. See _[Defining macros](macros/defining_macros.md)_.
97
+
More explicitly, an identifier is a sequence of one or more ASCII letters, digits, or the characters `$` (dollar sign) or `_` (underscore), not starting with a digit. It also cannot be of the form `$\d+`, which is the syntax for symbol IDs. (For example: `$3`, `$10`, `$458`, etc.)
87
98
88
-
## Grammar
99
+
```bnf
100
+
identifier ::= identifier-start identifier-char*
89
101
90
-
Literals appear in `code blocks`. Terminals are described in _italic text_.
0 commit comments