diff --git a/text/000-symbol-sets.md b/text/000-symbol-sets.md new file mode 100644 index 00000000..773bcc7b --- /dev/null +++ b/text/000-symbol-sets.md @@ -0,0 +1,80 @@ +- Title: Symbol Sets + +- Drivers: Michael Soegtrop + +---- + +# Summary + +In automation the most reliable and fastest way to reduce terms is frequently to use the `lazy` reduction with an explicit list of symbols to expand, that is: +``` +lazy [ ]. +``` + +The problem with this is that the symbol sets can be lengthy - a few 100 is not rare in practice - and difficult to maintain. +Typically the symbol sets are all symbols from specific modules with individual inclusions / exclusions. +An effective solution would be to be able to declare named symbol sets using +- individual symbols +- all symbols in a module without recursion into sub modules +- all symbols in a module with recursion into sub modules +- set operations union, intersection and difference + +# Motivation + +In proof automation one frequently has to reduce terms which contain user supplied terms, that is terms whose content the automation does not restrict. +Applying `simpl` or `cbn` to such terms does not work reliably, because there are examples for term where `simpl` and `cbn` take a very long time. +If the user supplied terms are of this nature, the overall tactic will also take a long time, even though the task at hand in the automation would be trivial. +Besides in automation one usually does not want the user supplied terms to be changed at all. + +What is done in some Coq developments is to hide the user supplied terms behind opaque definitions and then use full evaluation. +This works, but the hiding and unhiding of user terms can take substantially longer than the reduction itself. + +The only really good option is using `lazy` or `cbv` with a delta symbol set, which just contains private definitions of the automated tactic itself. +This requires an effective way to manage symbol sets. + +Note that this method frequently involves copying some basic functions like list append. +This is usually not a problem - it can become a problem if the automation needs e.g. arithmetic. +It would make sense to have a mechanism to copy the contents of a module under a new logical path, so that one can add the copy to the delta reduction list, but leave the original symbols unreduced. +But this is an independent topic and should be discussed in a separate CEP. + +Note that this related to [Declare Reduction](https://coq.inria.fr/doc/V8.19.0/refman/proofs/writing-proofs/equality.html#coq:cmd.Declare-Reduction). + +# Detailed design + +The syntax doesn't really matter a lot - it is fine to choose something which is easy to implement. + +A possible syntax would be: +- Command **Declare Symbol Set** *ident* **:=** *symbol_set_expr* +- *symbol_set_expr* ::= **Module** *qualid* | **Module Recursive** *qualid* | *reference* | *symbol_set_expr* [**+ * -**] *symbol_set_expr* | **(** *symbol_set_expr* **)** + +where +- **Module** *qualid* adds all symbols directly found in the module or library specified by *qualid* +- **Module Recursive** *qualid* adds all symbols found in the module or library specified by *qualid* and all its sub modules +- *reference* adds the one symbol *reference* +- *A* **+** *B* adds the union of *A* and *B* +- *A* **-** *B* adds the *A* excluding *B* (set difference) +- *A* * *B* adds the intersection of *A* and *B* + +The Coq grammar token [reductions](https://coq.inria.fr/doc/V8.19.0/refman/proofs/writing-proofs/equality.html#grammar-token-reductions) should be extended with a new item `set `. + +It is required to be able to extend the symbol set in the tactic call either by allowing an additional `delta` (not not `delta -`), of by providing APIs in the common tactic languages to create new symbol sets from existing ones. +It would also make sense to provide APIs to define symbol sets from tactics. + +# Drawbacks + +None I am aware of. + +# Alternatives + +One can create symbol sets using Elpi programs and there is work in progress to add functionality to enumerate modules to Ltac2 as well. +This does work, but the concept is so basic, that direct support in Coq does make sense. +I guess the idea behind [Declare Reduction](https://coq.inria.fr/doc/V8.19.0/refman/proofs/writing-proofs/equality.html#coq:cmd.Declare-Reduction) is similar, but a comfortable way to handle symbol sets is required. +Good support for creating symbol sets will likely encourage more people to use this very effective technique. +It is also imaginable that for very large list there might be more effective ways to process the reduction tactic if the symbol set is stored internally in an appropriate format. +If symbol sets are provided as lists, the reduction tactic has to at least traverse this list once. + +There are ways to control the opacity of symbols, including local control using `with_strategy`, but this proved to be more useful for manual proofs than for proof automation. + +# Unresolved questions + +The precise syntax of the symbol set declaration command.