Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[#47] Adding filesystem support for save_df #48

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from
Draft

Conversation

gauglertodd
Copy link
Collaborator

@gauglertodd gauglertodd commented Oct 23, 2024

Responding to some requests to allow for better save_df functionality, specifically saving the internal dataframe to cloud storage rather than saving to local path.

I'll play around with this approach and write some tests.

fixes #47

Copy link

github-actions bot commented Oct 23, 2024

Test Results

3 tests  +1   3 ✅ +1   49s ⏱️ +4s
1 suites ±0   0 💤 ±0 
1 files   ±0   0 ❌ ±0 

Results for commit 2036a41. ± Comparison against base commit 8868835.

♻️ This comment has been updated with latest results.

@gauglertodd gauglertodd force-pushed the tbg/47 branch 3 times, most recently from d47af0c to 11c3f24 Compare October 23, 2024 14:07
path: Optional[str] = None,
enable_perspective_dashboard: bool = False,
):
def __init__(self, name: str, namespace: str, enable_perspective_dashboard: bool = False, filesystem: Type[fs.FileSystem] = fs.LocalFileSystem):
Copy link
Member

@timkpaine timkpaine Oct 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this take an instance, otherwise how would you pass args to e.g. fs.S3FileSystem?

Copy link
Collaborator Author

@gauglertodd gauglertodd Oct 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

\we've seen some issues with serializing certain filesystem objects - the alternative is default construction of a filesystem, which you can toggle through env variables. We could probably just take like a filesystem_kwargs=dict() parameter too, I don't have strong feelings about this

@gauglertodd gauglertodd force-pushed the tbg/47 branch 2 times, most recently from 032e423 to 0558819 Compare October 23, 2024 18:19
@timkpaine timkpaine changed the title [#47] Adding filesystem support for save_df #47 Adding filesystem support for save_df Oct 23, 2024
@timkpaine timkpaine changed the title #47 Adding filesystem support for save_df [#47] Adding filesystem support for save_df Oct 23, 2024
@gauglertodd gauglertodd requested a review from timkpaine October 23, 2024 19:22
raydar/task_tracker/task_tracker.py Outdated Show resolved Hide resolved
...

Signed-off-by: Todd Gaugler <[email protected]>

...

...

.

...
@timkpaine timkpaine added type: enhancement Minor improvements type: feature Feature requests and removed type: enhancement Minor improvements labels Nov 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: feature Feature requests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add Better Support for save_df
2 participants