Dataset problem #16

M11202226HSU · 2024-10-24T11:47:26Z

Since I couldn't find your [Rendered WB dataset], I decided to download the raw dataset. It contains 62,532 images and 1,991 ground truth images.
Could you please help me understand how to set up my dataset? I have a total of 62,532 images and 1,991 ground truth images. What should the data format be? Additionally, how should I divide the dataset into training, validation, and testing sets?

Thank you!

mahmoudnafifi · 2024-10-24T11:58:30Z

Hi, the dataset is available here. If links are not working, please check the Google Drive mirror.

Hope that helps.

M11202226HSU · 2024-10-24T12:11:05Z

Hello,

I have downloaded the dataset, and now I have input images and ground truth images. I noticed file names like 8D5U5524_C_FF, 8D5U5524_C_A4, 8D5U5524_C_AS, 8D5U5524_D_AS, 8D5U5524_C_L, and 8D5U5524_C_N. What do the suffixes C_FF, C_A4, C_AS, C_N, and C_L mean? Are they necessary for training?

Thank you for your response!

mahmoudnafifi · 2024-10-24T13:08:15Z

Yes they are necessary for training. File format is as follows: name_WB_picStyle and the corresponding ground truth image (for auto white balance): name_G_AS. G here refers to ground-truth (manually corrected white balance), C (for example) refers to cloudy white balance, and so on.

M11202226HSU · 2024-10-24T15:56:03Z

To train your model, should I place all the images (including 62,535 images) and ground truth (including 1,991 images) into one folder, or should I split them into training, validation, and testing sets? If splitting is required, what should the format be?

Thank you for your help !

M11202226HSU · 2024-10-24T16:12:30Z

I understand that C stands for cloudy and D stands for daylight, but I'm not sure about the following notations: C_"A4", C_"CS", C_"FF", C_"L", and C_"P". I think C_AS might represent the ground truth, but could you please explain the other labels?
Thank you for your response !!!

mahmoudnafifi · 2024-10-24T16:47:16Z

AS, CS, L, … are picture styles.
Deep wb performs three different wb settings (auto, indoor, outdoor) and thus we have three gt white balance (T, S, G: tungsten, shade, manual) with AS (as shot) picture style.

M11202226HSU · 2024-10-25T05:03:18Z

Should I simply place all the images and ground truth (GT) in the same folder for training, or is there a need to split them into training, validation, and testing sets?

Thank you for your help

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset problem #16

Dataset problem #16

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 25, 2024

Dataset problem #16

Dataset problem #16

Comments

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

M11202226HSU commented Oct 24, 2024

mahmoudnafifi commented Oct 24, 2024

M11202226HSU commented Oct 25, 2024