Lean Workbook: A large-scale Lean problem set formalized from natural language math problems
Enhancing Formal Theorem Proving: A Comprehensive Dataset for Training AI Models on Coq Code
Building a Large Annotated Corpus of English: The Penn Treebank
Mann et Thompson: Rhethorical structure organisation: a theory of text organisation
Enhanced rhethorical structure theory