DIVE Lab, Texas A&M University
Data Integration, Visualization, and Exploration
- 46 followers
- United States of America
- http://people.tamu.edu/~sji/
Pinned Loading
Repositories
Showing 10 of 55 repositories
- Sys2Bench Public
Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, logical, arithmetic, and common-sense reasoning tasks.
divelab/Sys2Bench’s past year of commit activity