Performance issue in /datasets/dataloader.py (by P3)

Hello! I've found a performance issue in /datasets/dataloader.py: `dataset.batch(self.config['batch_size'])`[(here)](https://github.com/bilylee/SiamFC-TensorFlow/blob/f572dca95f2b3b2861f54de467259753428e468c/datasets/dataloader.py#L95) should be calle before `dataset.map(transform_fn, num_parallel_calls=self.config['prefetch_threads'])`[(here)](https://github.com/bilylee/SiamFC-TensorFlow/blob/f572dca95f2b3b2861f54de467259753428e468c/datasets/dataloader.py#L92), which could make your program more efficient.

Here is [the tensorflow document](https://tensorflow.google.cn/guide/data_performance?hl=zh_cn#vectorized_mapping) to support it.

Besides, you need to check the function `transform_fn` called in `dataset.map(transform_fn, num_parallel_calls=self.config['prefetch_threads'])` whether to be affected or not to make the changed code work properly. For example, if `transform_fn` needs data with shape (x, y, z) as its input before fix, it would require data with shape (batch_size, x, y, z) after fix.

Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance issue in /datasets/dataloader.py (by P3) #117

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Performance issue in /datasets/dataloader.py (by P3) #117

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions