Skip to content

Add 3 Sentence Transformers training skills#136

Merged
burtenshaw merged 2 commits intohuggingface:mainfrom
tomaarsen:skill/add-sentence-transformers-training
May 7, 2026
Merged

Add 3 Sentence Transformers training skills#136
burtenshaw merged 2 commits intohuggingface:mainfrom
tomaarsen:skill/add-sentence-transformers-training

Conversation

@tomaarsen
Copy link
Copy Markdown
Member

Hello!

Pull Request overview

  • Add 3 Sentence Transformers training skills
  • Fix Windows-specific issue

Details

This adds 3 training skills, which will also be hosted under https://github.com/huggingface/sentence-transformers and updated every time sentence-transformers has an update via a syncing script. I'd like to get this PR in first, and then the syncing can take care of the updates.

  • Tom Aarsen

"name": name,
"description": description,
"path": str(skill_md.parent.relative_to(ROOT)),
"path": skill_md.parent.relative_to(ROOT).as_posix(),
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be the only "outlier" change in this PR: this fixes Windows support for generate_agents.py. Not a big deal, but nice to have fixed.

@burtenshaw
Copy link
Copy Markdown
Collaborator

Thanks @tomaarsen . One things that strikes me from a high level is that there are three skills with a lot of duplicated references. Would it be feasible to achieve this in one skill? Note, progressive disclosure means that an agent will only ever load the references it needs for the task it is doing. Also, multiple skills can hit adoption because we're asking users to do more.

In concrete terms, we could move cross-encoder.md and sparse_encoder.md into train-sentence-transformers and those two skills.

@tomaarsen tomaarsen marked this pull request as draft May 6, 2026 15:47
@tomaarsen tomaarsen marked this pull request as draft May 6, 2026 15:47
@tomaarsen
Copy link
Copy Markdown
Member Author

Working on a merged variant here: huggingface/sentence-transformers#3752

I'm also experimenting with making the SKILL.md tiny to force the agent to look at the references: https://github.com/huggingface/sentence-transformers/pull/3752/changes#diff-9edabb53f64ed74ad022f5ba6bcd5d52b0d5e56d9530f5243b6f3beb42ef1c8a
Just 100 lines right now.

  • Tom Aarsen

@tomaarsen tomaarsen marked this pull request as ready for review May 7, 2026 11:24
@tomaarsen
Copy link
Copy Markdown
Member Author

I've successfully merged the 3 into one, while making the agents load the correct files in my tests, for either of the 3 archetypes. Let me know your thoughts @burtenshaw !

  • Tom Aarsen

Copy link
Copy Markdown
Collaborator

@burtenshaw burtenshaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice. Tested in codex and claude code.

@burtenshaw burtenshaw merged commit 00bc1b8 into huggingface:main May 7, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants