-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add support for optional progress updates channel #661
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -112,6 +112,25 @@ | |
// source storage to fetch large blobs. | ||
// If FindSuccessors is nil, content.Successors will be used. | ||
FindSuccessors func(ctx context.Context, fetcher content.Fetcher, desc ocispec.Descriptor) ([]ocispec.Descriptor, error) | ||
|
||
// UpdateChannel is an optional channel to receive progress updates. | ||
// Each update will include the number of bytes copied for a particular blob | ||
// or manifest, the expected total size, and the descriptor of the blob or | ||
// manifest. It is up to the consumer of the channel to differentiate | ||
// between updates among different blobs and manifests; no mechanism is | ||
// provided for distinguishing between them, other than the descriptor | ||
// passed with each update. The total size of downloads of all blobs and | ||
// manifests is not provided, as it is not known. You can calculate the | ||
// percentage downloaded for a particular blob in an individual update | ||
// based on the total size of that blob, which is provided in the | ||
// descriptor, and the number of bytes copied, which is provided in the | ||
// update. | ||
// Updates are sent each time a block is copied. The number of bytes copied | ||
// depends upon whatever calls the io.ReadCloser of the source Target. | ||
// This may be io.Copy, which, by default, is 32KB, or it may be some other | ||
// implementation. | ||
// The caller is responsible for closing the channel. | ||
UpdateChannel chan<- CopyUpdate | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Channels are blocking. If someone sets There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yeah, that is a good point. I was thinking about that, wasn't sure quite how to handle it. Here are some possibilities:
For either of the above, maybe we have one goroutine and one channel we own. Main routine publishes to our channel (we control it, so we can ensure no blocking issues), which then either publishes to the passed channel (option 1 above) or calls the passed function (option 2). At least we don't block our main routine, although if their function blocks, our goroutine still is blocked. Is a func call better than a goroutine? In theory, they could use a channel in there. Or have a slow func call. |
||
} | ||
|
||
// Copy copies a rooted directed acyclic graph (DAG) with the tagged root node | ||
|
@@ -266,11 +285,17 @@ | |
} | ||
|
||
// doCopyNode copies a single content from the source CAS to the destination CAS. | ||
func doCopyNode(ctx context.Context, src content.ReadOnlyStorage, dst content.Storage, desc ocispec.Descriptor) error { | ||
func doCopyNode(ctx context.Context, src content.ReadOnlyStorage, dst content.Storage, desc ocispec.Descriptor, ch chan<- CopyUpdate) error { | ||
rc, err := src.Fetch(ctx, desc) | ||
if err != nil { | ||
return err | ||
} | ||
if ch != nil { | ||
rc = &progressReader{ | ||
c: ch, | ||
r: rc, | ||
} | ||
} | ||
Comment on lines
+293
to
+298
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Wrapping a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Are you saying that it is performance impact by wrapping the What do you suggest? That we check for How would we then capture the updates? |
||
defer rc.Close() | ||
err = dst.Push(ctx, desc, rc) | ||
if err != nil && !errors.Is(err, errdef.ErrAlreadyExists) { | ||
|
@@ -291,7 +316,7 @@ | |
} | ||
} | ||
|
||
if err := doCopyNode(ctx, src, dst, desc); err != nil { | ||
if err := doCopyNode(ctx, src, dst, desc, opts.UpdateChannel); err != nil { | ||
return err | ||
} | ||
|
||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
/* | ||
Copyright The ORAS Authors. | ||
Licensed under the Apache License, Version 2.0 (the "License"); | ||
you may not use this file except in compliance with the License. | ||
You may obtain a copy of the License at | ||
|
||
http://www.apache.org/licenses/LICENSE-2.0 | ||
|
||
Unless required by applicable law or agreed to in writing, software | ||
distributed under the License is distributed on an "AS IS" BASIS, | ||
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
See the License for the specific language governing permissions and | ||
limitations under the License. | ||
*/ | ||
|
||
package oras | ||
|
||
import ( | ||
"io" | ||
|
||
ocispec "github.com/opencontainers/image-spec/specs-go/v1" | ||
) | ||
|
||
type CopyUpdate struct { | ||
Copied int64 | ||
Descriptor ocispec.Descriptor | ||
} | ||
|
||
type progressReader struct { | ||
desc ocispec.Descriptor | ||
r io.ReadCloser | ||
c chan<- CopyUpdate | ||
} | ||
|
||
func (p *progressReader) Close() error { | ||
return p.r.Close() | ||
} | ||
|
||
func (p *progressReader) Read(buf []byte) (int, error) { | ||
n, err := p.r.Read(buf) | ||
if n > 0 { | ||
p.c <- CopyUpdate{Copied: int64(n), Descriptor: p.desc} | ||
} | ||
return n, err | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Presenting a progress bar of an operation isn't an easy job. Eventually, we will end up with an MVC pattern if we continue iterating.
Therefore, we need to define data models first for oras operations like
Copy
,PushBytes
, etc.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In fact,
oras
CLI has a data model so as thebuildkit
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/cc @qweeah
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
True, but I am not concerned with that here.
oras
CLI might, but theCopy()
and such library calls should not care. They only should worry about some library-centric way of publishing updates. A CLI (likeoras
) or other consumer can do with them what they want.The data model is simple: stream of defined
struct
(orinterface
, if you prefer), each of which contains the amount of bytes transferred and descriptor to which it applies. Anything higher level would be outside the scope of this type of option, I think.