Hi~ Thank you for this interesting work of open source! Can the author provide evaluation scripts on bagel or other open source models?