33 transforms group #51

philswatton · 2023-05-05T08:28:29Z

This PR:

Resolves New Experiment Group With Transforms #33, although we may wish to add more experiment groups in the future. I've raised Consider Adding Further Experiment Groups #52 to cover this
Adds a new experiment group called 'none-vs-transforms' to the datasets config, which compares an untransformed dataset A against a transformed dataset B
- There are 4 transforms:
  - Little blur (kernel 3x3, sigma 1)
  - Big blur (kernel 3x3, sigma 3)
  - Grayscale
  - Flipping the image 180 degrees
- There are 6 different drop % pairs: (0,0), (0,0.5), (0.5,0), (0.5,0.5), (0.25,0.75), (0.75,0.25)
  - These correspond to no data dropping, imbalanced data with overlap (drop 50% from one), balanced data without overlap (drop 50% from both), imbalanced data without overlap (drop 75% from one, 25% from the other)
- This creates a total of 4x6 = 24 dataset pairs. Once the three seeds are considered, this brings us up to 72 total additional dataset pairs
Adds a compose_transforms function to modsim2.utils.config, which draws on the previous code for loading transforms, but allows for None to be supplied as an input (and will return None in this case)
- This 1) allows for backwards compatibility: the previous experiment group will work the same as before
- And 2) means that we can read in the transforms for the dataset config
Uses this in the opts2dmpairArgs function in scripts/utils.py function
For transforms to be applied, DMPair.A.setup() and DMPair.B.setup() may need to be called. DMPair.compute_similarity() will therefore throw an error if transforms do not match those supplied to DMPair() (if transforms are not None)
Some fixes to a couple of files to reflect changes in the code
Develop was merged in to get the changes from 7 implement ot #47

joannacknight

I've added one suggestion.
The tests all passed on my machine.

src/modsim2/data/loader.py

lannelin

looks good so far, a couple of suggested changes. Finishing for now and will pick up again!

configs/datasets.yaml

lannelin · 2023-05-09T14:25:59Z

configs/datasets.yaml

thinking about this more in the interest of cutting down experiment space

here are 6 different drop % pairs: (0,0), (0,0.5), (0.5,0), (0.5,0.5), (0.25,0.75), (0.75,0.25)

what purpose do we think each of these serve, do we need all of them?
do we get anything further from the last two that we don't already have from 0.5,0.5?

I think my rough line of thinking on this is:

I'd like to test how much of an effect having none vs some overlapping data there is, given our theory of data similarity

I'd like to also account for the fact that data imbalance may have an effect on transfer success

We should make sure that in our experiment these two things are fully independent of each other. If we got rid of the last two groups, we'd only be testing data imbalance in the case where there is overlap between the observations in both datasets

That said, this isn't a particularly strong justification, so I'm open to removing them if we have a need to cut down the experiment space!

let's see how long things take to run but sounds reasonable to keep for now, thanks!

src/modsim2/utils/config.py

philswatton · 2023-05-12T15:00:28Z

I've added some new commits that somewhat change the PR in line with in person discussion. We now have:

several smaller experiment groups, all with their own file within a folder within the config folder
code updated to reflect the new structure
some unused source code removed
tests updated
extra test added for the compute_similarity error logic

joannacknight

The Pytest tests all passed for me. I also had a look at the generate scripts files and think I found one tiny issue there. I only ran the generate_metrics_scripts.py file, but think the issue is across all three files.

scripts/README.md

scripts/generate_metrics_scripts.py

scripts/generate_attack_scripts.py

scripts/generate_train_scripts.py

joannacknight

The tests all pass for me

lannelin

looks good to me! tests pass and debug made sense.

minor comment on removal of a file then good to go

lannelin · 2023-05-15T10:58:32Z

configs/datasets.yaml

I think we can also remove transforms.yaml?

Have removed!

philswatton added 4 commits May 4, 2023 16:51

compose utility function added

5dad0ac

new experiment group

f49d61b

now composes transforms if needed

26761bf

added 6th drop %s

e624b98

philswatton linked an issue May 5, 2023 that may be closed by this pull request

New Experiment Group With Transforms #33

Closed

philswatton added 3 commits May 9, 2023 10:00

Merge branch 'develop' into 33-transforms-group

4853cdd

calculate_similarity checks to see if transforms have been applied

f0e53fb

final fixes to get everything working

17b886d

philswatton marked this pull request as ready for review May 9, 2023 12:56

philswatton requested review from lannelin and joannacknight May 9, 2023 12:56

joannacknight requested changes May 9, 2023

View reviewed changes

src/modsim2/data/loader.py Outdated Show resolved Hide resolved

lannelin requested changes May 9, 2023

View reviewed changes

philswatton added 6 commits May 12, 2023 12:55

adapted scripts to new config setup

fd838d2

new config setup with new experiment groups

2bb12ac

streamlined code for reading in transforms

9520012

updated tests to reflect way in which configs are specified and used

9a74950

README updated

6cf6551

added test for compute similarity error logic

43ebfc5

philswatton requested review from lannelin and joannacknight May 12, 2023 15:00

joannacknight reviewed May 12, 2023

View reviewed changes

philswatton added 2 commits May 15, 2023 09:34

fixes to text and scripts

829bce4

explict read file

6677d88

philswatton requested a review from joannacknight May 15, 2023 08:37

joannacknight approved these changes May 15, 2023

View reviewed changes

fixed incorrect docstring

6525a4d

lannelin approved these changes May 15, 2023

View reviewed changes

removed transforms.yaml

bf89da5

philswatton merged commit 54559a1 into develop May 15, 2023

philswatton deleted the 33-transforms-group branch May 15, 2023 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

33 transforms group #51

33 transforms group #51

philswatton commented May 5, 2023 •

edited

Loading

joannacknight left a comment

lannelin left a comment

lannelin May 9, 2023

philswatton May 12, 2023

lannelin May 15, 2023

philswatton commented May 12, 2023

joannacknight left a comment

joannacknight left a comment

lannelin left a comment

lannelin May 15, 2023

philswatton May 15, 2023

33 transforms group #51

33 transforms group #51

Conversation

philswatton commented May 5, 2023 • edited Loading

joannacknight left a comment

Choose a reason for hiding this comment

lannelin left a comment

Choose a reason for hiding this comment

lannelin May 9, 2023

Choose a reason for hiding this comment

philswatton May 12, 2023

Choose a reason for hiding this comment

lannelin May 15, 2023

Choose a reason for hiding this comment

philswatton commented May 12, 2023

joannacknight left a comment

Choose a reason for hiding this comment

joannacknight left a comment

Choose a reason for hiding this comment

lannelin left a comment

Choose a reason for hiding this comment

lannelin May 15, 2023

Choose a reason for hiding this comment

philswatton May 15, 2023

Choose a reason for hiding this comment

philswatton commented May 5, 2023 •

edited

Loading