feat(9586): implement freetext search in cht datasource #9625

sugat009 · 2024-11-07T07:26:49Z

Description

Closes: #9586

Code review checklist

Readable: Concise, well named, follows the style guide, documented if necessary.
Documented: Configuration and user documentation on cht-docs
Tested: Unit and/or e2e where appropriate
Internationalised: All user facing text
Backwards compatible: Works with existing data and configuration or includes a migration. Any breaking changes documented in the release notes.

Compose URLs

If Build CI hasn't passed, these may 404:

License

The software is provided under AGPL-3.0. Contributions to this project are accepted under the same license.

shared-libs/cht-datasource/src/local/report.ts

…text-search-in-cht-datasource

sugat009 · 2024-11-29T10:00:01Z

@jkuester PR is ready for review.

dianabarsan

I did a quick partial review and overall this is quite cool. I did leave some requests and questions inline.

api/src/controllers/report.js

dianabarsan · 2024-11-29T10:08:12Z

api/src/controllers/report.js

+
+module.exports = {
+  v1: {
+    get: serverUtils.doOrError(async (req, res) => {


I'm not a big fan of this callback style.

We've been doing this pattern for all the REST endpoints that call cht-datasource. What's the alternative?

IMHO doOrError is a nice way to reduce duplicated code and ensure we are handling errors consistently.

I think a try-catch block is not so much duplication, and it's more transparent than a nested callback.
I understand this already exists. I'm not a fan.

dianabarsan · 2024-11-29T10:09:55Z

api/src/controllers/contact.js

+        Object.assign(qualifier, Qualifier.byContactType(req.query.type));
+      }
+
+      const limit = req.query.limit ? Number(req.query.limit) : req.query.limit;


this seems strange that we assign a random non-truthy value (as in: whatever is in req.query.limit) instead of being specific.
Same applies to the reports controller.

Suggested change

const limit = req.query.limit ? Number(req.query.limit) : req.query.limit;

const limit = req.query.limit ? Number(req.query.limit) : false;

The false is a random pick.

The reason why req.query.limit is being passed when the conditional is falsy is that in the cht-datasource there is already a validation for this and also a default value. This is to ensure that the validation does not happen twice and also the default value.
Reference.

Then why not have cht-datasource also do the Number conversion then? Why have this validation here? Is limit ever expected to not be a number?

Is limit ever expected to not be a number?

Yeah, in cases like the one above, where it is passed as a query param in REST API, it is expected to be a stringified number. However, cht-datasource can also be used in non-REST API codes where end-users will have to pass in a number to cht-datasource because PouchDB expects the limit value to be a number. I think that's a reasonable approach to make the limit variable an explicit Number type, as that would align with the expected input for the PouchDB Adapter. This would provide better type safety and clarity in the code. The validation being present in cht-datasource still makes sense, and whether to apply the same validation elsewhere should be at the discretion of the end-user, based on their specific use case.

So even if it's a stringified number or a number, we still only ever evaluate it as a number. so it makes sense to only have validation in one spot, right?

Yes. Are you considering a type conversion from string to number a validation?

Ok, I'll use a different word than validation: processing.
It's possible to only do processing of the limit parameter in a single place: this includes validation, conversion and all the other things. We should do processing in a single place.

In most cases, I agree but I'm not sure why the conversion of limit from string to number should be designated to cht-datasource, it's not its concern.
Edit 1:
I might have misunderstood the above. Are you suggesting we do both conversion and validation in another place than both the API controller and cht-datasource?

I'm suggesting that conversion and validation can both be done in a single place: cht-datasource.

dianabarsan · 2024-11-29T10:11:49Z

api/src/controllers/report.js

+    getIds: serverUtils.doOrError(async (req, res) => {
+      await checkUserPermissions(req);
+
+      const qualifier = Qualifier.byFreetext(req.query.freetext);


So ... this endpoint .. if it doesn't get neither a freetext query param or a limit query param, it will end up returning ALL reports?

nope. it returns a 400 - Bad request error because freetext is required whereas limit is set to a default of 10000.

dianabarsan · 2024-11-29T10:12:53Z

api/src/routing.js

@@ -492,6 +494,12 @@ app.postJson('/api/v1/people', function(req, res) {
 app.get('/api/v1/person', person.v1.getAll);
 app.get('/api/v1/person/:uuid', person.v1.get);

+app.get('/api/v1/contact/id', contact.v1.getIds);


maybe /api/v1/contact/ids is more suitable.
The idea is that the URL isn't suggestive at all, without reading the implementation, I would never guess what this endpoint does.

Yes, REST API conventions are to name the API endpoint in a plural way like /api/v1/contacts or /api/v1/contacts/ids but this design decision had already been taken even before this ticket. I couldn't find the link to the conversation for this though.

That discussion happened in the parent ticket before we spun off the child isssue: #9544 (comment)

Thanks @jkuester . Your argument here is that "we've already decided and your input is not welcome?"

Sorry for being aggressive and confrontational in the above comment.

I maintain my comment about /api/v1/contact/id being quite unsuggestive, we shouldn't need thorough explanations and reasoning behind the naming choice in order for an api name to make sense.

I was just trying to provide the context for the discussion that Sugat referenced. 😬

I am happy to continue the design discussion here to come to an agreed upon approach. It will just be most efficient if we all understand what was already said to get us here. When starting work on new REST endpoints for the cht-datasource code, we chose to go with the pattern of singular entity names (so /api/v1/person instead of /api/v1/persons). When the endpoint can return 0-n entities I do not really see a compelling reason to prefer either singular or plural (since either might make more sense depending on the context). Two things seem clear to me though:

Under normal circumstances, we should not duplicate endpoints for the same resource (e.g. having both /api/v1/contact/id and /api/v1/contact/ids).

We should be consistent with our naming across our go-forward REST endpoints. Either using singular or plural, but not mixing both.

shared-libs/cht-datasource/src/remote/report.ts

dianabarsan · 2024-11-29T11:36:34Z

shared-libs/cht-datasource/test/local/contact.spec.ts

+        expect(getLineageDocsByIdOuter.calledOnceWithExactly(localContext.medicDb)).to.be.true;
+        expect(getDocsByIdsOuter.calledOnceWithExactly(localContext.medicDb)).to.be.true;


Same comment about assertions in afterEach and afterEach run order.

shared-libs/cht-datasource/test/remote/contact.spec.ts

dianabarsan · 2024-11-29T11:46:49Z

tests/integration/api/controllers/contact.spec.js

+  const expectedPlaces = [place0, clinic1, clinic2];
+  const expectedPlacesIds = expectedPlaces.map(place => place._id);
+
+  before(async () => {


All these tests should call the API endpoints, instead of requesting datasource code.

This was heavily discussed with QA when laying down the original pattern for these tests. The summary of the the IRL convos is here: #9090 (comment)

TLDR is we could duplicate tests, but the value of that seems nearly non-existant (while adding a non-zero cost in terms of additional tests to run).

Pleaaaase duplicate the tests :)

@sugat009 How feasible would it be to parameterize the tests where we are currently using cht-datasource so that we run the same test with both the cht-datasource remote wrapper and with the test utils making the same REST call?

I am thinking of something like this (but have not actually tried to run this yet):

[ (placeId) => Place.v1.get(dataContext)(Qualifier.byUuid(placeId)), (placeId) => { const opts = { path: `/api/v1/place/${placeId}`, }; return utils.request(opts); } ].forEach((getPlace) => { it('returns the place matching the provided UUID', async () => { const place = await getPlace(place0._id); expect(place).excluding(['_rev', 'reported_date']).to.deep.equal(place0); }); });

That would ensure we test both the cht-datasource Remote adapter as well as just using the test utils to hit the REST api "directly" without actually needing to duplicate every test...

This is a nice workaround and would probably be feasible and runnable but let's just duplicate the test code, it probably will be more maintainable and readable, and usually, test code doesn't need reusability.

dianabarsan · 2024-11-29T11:48:21Z

tests/integration/api/controllers/report.spec.js

+    const getReport = Report.v1.get(dataContext);
+
+    it('should return the report matching the provided UUID', async () => {
+      const resReport = await getReport(Qualifier.byUuid(report0._id));


All these tests should call the api endpoints, instead of request datasource code.

sugat009 added 2 commits November 6, 2024 18:29

Initial setup

ac0ab67

add /api/v1/contacts/:uuid endpoint

cd11adc

sugat009 linked an issue Nov 7, 2024 that may be closed by this pull request

Implement freetext search in cht-datasource #9586

Open

sugat009 added 4 commits November 7, 2024 18:21

add /api/v1/contact/:uuid?with_lineage=<option> endpoint

b7c5d1f

add missing files

f91e393

add missing files

52ac11b

add endpoint /api/v1/report

c1b68cf

sugat009 commented Nov 8, 2024

View reviewed changes

shared-libs/cht-datasource/src/local/report.ts Show resolved Hide resolved

sugat009 added 8 commits November 11, 2024 18:39

add additional checks for report validity

45c0ccf

add /api/contact/id endpoint (not tested yet)

acf903a

add endpoint /api/v1/report/id --untested

89927b1

implement search feature in /api/contact/id endpoint

45abf58

add search functionality to /api/report/id endpoint

6f5c326

add async API Contact.v1.getIdsAll

192af16

add async API Report.v1.getIds

bb98dcf

add JSDocs for functions

43efbef

sugat009 force-pushed the 9586-implement-freetext-search-in-cht-datasource branch from eba7aac to 43efbef Compare November 18, 2024 08:59

sugat009 added 13 commits November 18, 2024 16:53

Merge remote-tracking branch 'origin/master' into 9586-implement-free…

ede85fd

…text-search-in-cht-datasource

add unit tests for index.ts, qualifier.ts and contact.ts

2b33c07

add unit tests for report.ts

3bfcace

add tests for local/contact.js

074b0c1

add some additional tests in local/contact.spec.ts

1907907

add tests for local/report.ts

a629116

add unit tests for remote/contact.ts

c45cee8

add unit tests for remote/report.ts

5400736

add unit tests for contact-types.ts

bf46e4a

remove unused variables

b1cf669

add unit tests for api/src/controllers/contact.js

98fd4e2

add tests for api/src/controllers/report.js

ee32262

Merge remote-tracking branch 'origin/master' into 9586-implement-free…

b5bc6e6

…text-search-in-cht-datasource

sugat009 added 4 commits November 27, 2024 15:53

Merge remote-tracking branch 'origin/master' into 9586-implement-free…

4a18fc8

…text-search-in-cht-datasource

add integration test for api/src/controller/contact.js

eba1b61

add integration tests for api/src/controllers/report.js

9157059

fix eslint issues

d5fed9c

sugat009 marked this pull request as ready for review November 29, 2024 09:59

dianabarsan requested changes Nov 29, 2024

View reviewed changes

sugat009 added 6 commits December 2, 2024 10:05

changes missed during copy-paste

30d058e

change title of tests

662598c

fix indentation

89eb816

fix max-line 120 issue

3ea31ef

rename getIdsAll functions to getIds

a7a9272

add tests in controller unit tests for when limit is undefined or null

ef8b93f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(9586): implement freetext search in cht datasource #9625

feat(9586): implement freetext search in cht datasource #9625

sugat009 commented Nov 7, 2024 •

edited by github-actions bot

Loading

sugat009 commented Nov 29, 2024

dianabarsan left a comment

dianabarsan Nov 29, 2024

sugat009 Dec 2, 2024

jkuester Dec 2, 2024

dianabarsan Dec 3, 2024

dianabarsan Nov 29, 2024

sugat009 Nov 29, 2024

dianabarsan Dec 3, 2024

sugat009 Dec 3, 2024

dianabarsan Dec 3, 2024

sugat009 Dec 3, 2024

dianabarsan Dec 3, 2024

sugat009 Dec 3, 2024 •

edited

Loading

dianabarsan Dec 3, 2024

dianabarsan Nov 29, 2024

sugat009 Nov 29, 2024

dianabarsan Nov 29, 2024

sugat009 Nov 29, 2024

jkuester Dec 2, 2024

dianabarsan Dec 3, 2024

dianabarsan Dec 3, 2024

jkuester Dec 3, 2024

dianabarsan Nov 29, 2024

dianabarsan Nov 29, 2024

jkuester Dec 2, 2024

dianabarsan Dec 3, 2024

jkuester Dec 3, 2024

jkuester Dec 3, 2024

sugat009 Dec 3, 2024

dianabarsan Nov 29, 2024

	const limit = req.query.limit ? Number(req.query.limit) : req.query.limit;
	const limit = req.query.limit ? Number(req.query.limit) : false;

		expect(getLineageDocsByIdOuter.calledOnceWithExactly(localContext.medicDb)).to.be.true;
		expect(getDocsByIdsOuter.calledOnceWithExactly(localContext.medicDb)).to.be.true;

feat(9586): implement freetext search in cht datasource #9625

Are you sure you want to change the base?

feat(9586): implement freetext search in cht datasource #9625

Conversation

sugat009 commented Nov 7, 2024 • edited by github-actions bot Loading

Description

Code review checklist

Compose URLs

License

sugat009 commented Nov 29, 2024

dianabarsan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sugat009 Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sugat009 commented Nov 7, 2024 •

edited by github-actions bot

Loading

sugat009 Dec 3, 2024 •

edited

Loading