amd and pdd functions and tests #4

tinatn29 · 2025-04-24T17:57:00Z

amd.py file in src/diffpy/similarity/metrics (functions to compute amd and pdd)
tests (test_amd.py)
test_data

tn add amd

…milarity into tn-fix-flake8

sbillinge

please remove the __init__.py file from the src directory. We don't want it there.

Let's discuss a bit the package structure. Is there a reason for the choice of creating an amd.py module and putting pdd in it? As things stand, I think a better structure might be to make a metrics.py module in similarity and then we would have imports that look more like:

from diffpy.similarity.metrics import amd, pdd

and so on.

news/amd.rst

sbillinge · 2025-04-24T21:25:19Z

btw, this is failing tests. Things are missing from the requirements (and possibly pyproject.toml)

Actually, early commits I want to fail tests because I prefer if we write tests that capture the behavior we want before we implement the functionality in the functions, but we don't want them to fail because of things missing from the package structure.

Reach out if you have questions. This is a great start, but I prefer to go a bit more slowly initially.

tinatn29 · 2025-04-24T21:25:21Z

I can separate amd and pdd. I agree that it makes more sense to import them separately. I'll fix the tests and the news accordingly.

tinatn29 · 2025-04-24T21:27:48Z

Tests are failing because amd and pandas aren't listed in requirements. Should I add both of them to pip.txt?

sbillinge · 2025-04-24T21:36:25Z

Tests are failing because amd and pandas aren't listed in requirements. Should I add both of them to pip.txt?

both conda.txt and pip.txt, but could we start a new PR from a clean main and work on the behavior? What Use Cases (UCs) do these address?

tinatn29 · 2025-04-24T21:40:24Z

Tests are failing because amd and pandas aren't listed in requirements. Should I add both of them to pip.txt?

both conda.txt and pip.txt, but could we start a new PR from a clean main and work on the behavior? What Use Cases (UCs) do these address?

The tests are passing locally btw! (when I run pytest), so I think the only issue is the requirements. I have 3 test cases for each.

amd_compare(cif1, cif1) returns 0 because two structures are the same
amd_compare(cif1, cif2) returns amd distance between two structures
amd_compare([cif1, cif2], [cif1, cif2]) returns a distance matrix (DataFrame) between the two lists (2 x 2 matrix for the test case)

same thing for pdd_compare
cif1 and cif2 are the files I put in test_data

I'll start from a clean main and do this one by one. Sorry for the messy branch!

sbillinge · 2025-04-24T22:02:48Z

Tests are failing because amd and pandas aren't listed in requirements. Should I add both of them to pip.txt?

both conda.txt and pip.txt, but could we start a new PR from a clean main and work on the behavior? What Use Cases (UCs) do these address?

The tests are passing locally btw! (when I run pytest), so I think the only issue is the requirements. I have 3 test cases for each.
* amd_compare(cif1, cif1) returns 0 because two structures are the same

* amd_compare(cif1, cif2) returns amd distance between two structures

* amd_compare([cif1, cif2], [cif1, cif2]) returns a distance matrix (DataFrame) between the two lists (2 x 2 matrix for the test case)
same thing for pdd_compare cif1 and cif2 are the files I put in test_data

I'll start from a clean main and do this one by one. Sorry for the messy branch!

sounds good. The preferred approach for me is to figure out the UC (behavior someone using the code wants) then the code architecture and tests that implement hte behavior, and only then the functions themselves (though we generally define the functions and their API (inputs and outputs) and write the docstring as part of the architecture discussion.

I am not 100% sure who our target audience is and what they want to do, so I need guidance from you, but I could image the following UCs:

UC 1 - compute amd from a structure

Simon wants to compute the amd from a structure as he is working in diffpy-cmi
Simon passes the structure to diffpy.similarity (ds)
ds computes the amd and returns it simon

UC 2 - pdd

as UC1.-1.3 but Simon wants to compute the pdd

UC 3 - amd-similarity

Tina wants to compute the amd similarity (amd-s) between two structures
Tina gives both structures to amd_s()
amd_s computes the amd-s and returns it to Tina

UC4 - pdd

As UC3.1 - 3.3 but Tina wants the pdd-s

UC 5 - to scale

As UC3 but a pairwise computation of amd or pdd (or whatever similarity measure we have in the future) over a large set of structures with good performance

sbillinge · 2025-04-24T22:07:36Z

btw, please don't put the cif files in the PR yet. I would prefer that we pass in structures as diffpy structure objects so we are not wedded to a particular back-end (i.e., a file-system), so in general we want to separate the I/O and hte functionality to make functions more reusable. This means we will code up the test structures in the test function itself. If we test with files we prefer to build a pytest fixture that creates a files system at test time to test the I/O. But in this case I am not sure we even want to do that.

sbillinge · 2025-04-24T22:08:53Z

I copy-pasted the UCs as an issue. Let's leave that issue open and use it to collect our UCs. Feel free to add UCs (try and use the same pattern) if you can think of any. It is like a story-board. Just because we make a UC doesn't mean that we will implement the functionality, so don't hold back....this is a chance to capture everyone's ideas of what we want to be able to do with this code.

tinatn29 and others added 17 commits April 23, 2025 15:00

add test for amd

719b01e

add cif files for tests

3c64958

edit docstring

771a24d

edit amd and pdd functions

7596957

edit tests

d6ac6ea

add test data

001b632

Merge pull request #1 from tinatn29/tn-add-amd

2b81942

tn add amd

[pre-commit.ci] auto fixes from pre-commit hooks

787ff97

exclude tests/test_data and notebooks/

75a9d1e

fix end of file

12e1632

fix flake8

4539dba

fix codespell

b55765e

add init files

13c88cf

add news item

1fd556b

[pre-commit.ci] auto fixes from pre-commit hooks

5840608

whitespace trimming

0d7812a

Merge branch 'tn-fix-flake8' of https://github.com/tinatn29/diffpy.si…

97efc32

…milarity into tn-fix-flake8

sbillinge reviewed Apr 24, 2025

View reviewed changes

news/amd.rst Outdated Show resolved Hide resolved

news/amd.rst Outdated Show resolved Hide resolved

delete __init__ from src/

8765d72

tinatn29 added 3 commits April 24, 2025 17:30

separate amd and pdd

3d3a5e4

separate tests

b5af572

edit news item

c2301fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

amd and pdd functions and tests #4

amd and pdd functions and tests #4

Uh oh!

tinatn29 commented Apr 24, 2025

Uh oh!

sbillinge left a comment

Uh oh!

Uh oh!

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

tinatn29 commented Apr 24, 2025 •

edited

Loading

Uh oh!

tinatn29 commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

tinatn29 commented Apr 24, 2025 •

edited

Loading

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

amd and pdd functions and tests #4

Are you sure you want to change the base?

amd and pdd functions and tests #4

Uh oh!

Conversation

tinatn29 commented Apr 24, 2025

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

tinatn29 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tinatn29 commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

tinatn29 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025

Uh oh!

sbillinge commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tinatn29 commented Apr 24, 2025 •

edited

Loading

tinatn29 commented Apr 24, 2025 •

edited

Loading

sbillinge commented Apr 24, 2025 •

edited

Loading