Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

load from csv component #21

Merged
merged 4 commits into from
Nov 20, 2023

Conversation

Hakimovich99
Copy link
Contributor

No description provided.

@Hakimovich99 Hakimovich99 linked an issue Nov 20, 2023 that may be closed by this pull request
Copy link
Member

@RobbeSneyders RobbeSneyders left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @Hakimovich99! Looks good apart from some minor comments.

@@ -0,0 +1,29 @@
name: Load from csv file
description: Component that loads a dataset from huggingface hub
image: hakimovich99/load_from_csv:dev #TODO: change it to fndnt once implemented
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can use the ml6team github container registry for any components related to a use case.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to make sure, like that: ghcr.io/ml6team/load_from_csv:dev ?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. You can label them to link them to this repo.
See this script for instance, it uses the fondant build command.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@@ -0,0 +1,29 @@
name: Load from csv file
description: Component that loads a dataset from huggingface hub
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be updated.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file should not be in here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The pre-commit check doesn't work if I don't set this file: src/components/text_cleaning/src/main.py: error: Duplicate module named "main" (also at "src/components/load_from_csv/src/main.py")

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, the mypy pre-commit doesn't work well in a multi-repo. However adding the __init__.py files is only a band-aid and will not solve it. Feel free to remove the mypy pre-commit, we did it in other use case repositories as well. Eg. here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah nice I was wondering if there was a way to deactivate it instead, thanks!

Thanks for the feedback!
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this file needs to be in here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indeed thanks!

Copy link
Member

@RobbeSneyders RobbeSneyders left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @Hakimovich99!

@RobbeSneyders RobbeSneyders merged commit 36dad86 into main Nov 20, 2023
1 check passed
@RobbeSneyders RobbeSneyders deleted the 16-implement-component-to-load-evaluation-data branch November 20, 2023 12:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement component to load evaluation data
2 participants