Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

start playing with hat tests #1815

Draft
wants to merge 10 commits into
base: main
Choose a base branch
from
Draft

start playing with hat tests #1815

wants to merge 10 commits into from

Conversation

josharian
Copy link
Collaborator

for voice discussion. definitely work in progress.

@josharian josharian force-pushed the josh/hat-stat-tests branch 2 times, most recently from 8610ad1 to f8770ab Compare August 20, 2023 01:46
@josharian
Copy link
Collaborator Author

josharian commented Aug 20, 2023

to discuss (notes to self):

  • general approach
  • allocation determinism
  • formatting etc, relation to other draft prs
  • performance
  • cross-package imports
  • better hat representation? (e.g. _ for default, single height for colors, double for both? mixes line meanings, need unambiguous color/shape letters, but better visually??)
  • which expansions are in scope?

todo (notes to self):

  • remove normalization routines
  • work through all todos in code
  • rename InMemory to Mock?
  • performance
  • add other language samples
  • try out on other hat experiments (including non-buggy first word experiment)
  • don't forget about logging :)
  • concise positions, even more concise

@josharian josharian added the to discuss Plan to discuss at meet-up label Aug 20, 2023
@josharian josharian force-pushed the josh/hat-stat-tests branch 2 times, most recently from 1190188 to 9179428 Compare August 20, 2023 05:18
Copy link
Member

@pokey pokey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just left quick comment; haven't really reviewed

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh wow ok. We were planning to just use https://github.com/microsoft/vscode-languageserver-node as we'll need that for lsp anyway 😅. Looks like your code goes a bit further, as they stop at text document and you go all the way to editor. But would be good to use as much of their code as possible as we'll be pulling that in anyway

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pokey I went to pull this in and hit a speed bump. Their TextDocument type implements a smaller interface than the cursorless TextDocument interface:

Type 'TextDocument' is missing the following properties from type 'TextDocument': range, eol, lineAt ts(2739)

Range and eol are trivial, but unfortunately, lineAt returns a non-trivial TextLine interface, so there's a meaningful amount of code missing.

I could try to (a) shrink our interface or (b) build our mock document as a wrapper around theirs, harder because of the TextLine thing, or (c) just continue with our own top-to-bottom mock. Or I guess (d) try to upstream code to make theirs match our needs.

Thoughts?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd lean towards a), falling back to b) as necessary

Copy link
Collaborator Author

@josharian josharian Sep 6, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(a) did not go well. We use the missing properties a lot. Half an hour in, and several false victories later...

Moving on to plan b.

Screenshot 2023-09-06 at 4 51 51 PM

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sigh. Plan B isn't promising either. We still need our implementation of TextLine, and re-implementing the rest in terms of their interface is janky. Not done, but the diff-stat isn't overwhelming:

 packages/common/src/testUtil/mockEditor.ts | 51 +++++++++++++++++----------------------------------

I think we should either (a) have you take a crack at this or (b) discuss again at a meet-up. Maybe there's something clever we can do that I'm missing...but these two interfaces don't line up nicely.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Moved this discussion to #1882

@josharian
Copy link
Collaborator Author

update from meeting: this is promising. i will continue to tinker with it.

@josharian josharian removed the to discuss Plan to discuss at meet-up label Aug 22, 2023
@josharian josharian force-pushed the josh/hat-stat-tests branch from 1857eb7 to 59206d4 Compare August 24, 2023 01:09
@josharian josharian marked this pull request as ready for review August 24, 2023 01:15
@josharian
Copy link
Collaborator Author

josharian commented Aug 24, 2023

this is the sort of thing that I could accidentally tinker with forever. there are lots of improvements that can be made and lots of directions to go. but the best possible direction is submitted, where it can be improved incrementally. :)

I know that this cannot go in as it is--it is lacking a fair bit of polish--but I would like to start the review process, because I think it will take several iterations. (btw, this currently includes #1825, because it is ~unusable without it.)

feel free to only do a high level review at this point if you would prefer. over voice would also be okay. up to you.

@josharian
Copy link
Collaborator Author

also, if you have any talon, python or typescript files of a medium-ish length, with no licensing issues, and that doesn't pertain directly to cursorless (so that it doesn't constantly appear when we grep the codebase), I love to add them...

@josharian
Copy link
Collaborator Author

josharian commented Aug 29, 2023

update from meetup:

  • we will use fake hat colors and shapes that have really obvious ascii representations. we don't need blue/fox/etc.
  • overall ascii representation looking promising. we explored a bunch of alternatives, but didn't find one we liked better than this.
  • josh to try to continue to speed up hat allocation so that the tests don't time out.
  • josh to hook up update fixture vs test fixture code.

@josharian
Copy link
Collaborator Author

josharian commented Sep 1, 2023

current status

early reviewer input requested

can happen concurrently with my next steps

  • what do we want to do about mocking the text editor? see start playing with hat tests #1815 (comment)
  • are you happy with the .golden and .stats files?
  • would love some sample files in other languages, particularly typescript, python, and a LISP-y language (or something else that is punctuation-heavy). see start playing with hat tests #1815 (comment)
  • are you happy with how this is triggered? (update fixtures to update, tests fixtures during unit tests)
  • is there a way to avoid bringing up the extension window during "update fixtures subset" if none of the tests in the subset needed?
  • finish reviewing the outstanding hat alloc perf PRs

my next steps

  • figure out the CRLF issue on windows
  • do a code cleanliness and documentation pass
  • incorporate any of the early reviewer input
  • merge the outstanding hat alloc perf PRs, pull them in

once all these are done, then we can move on to a regular, full, code-level review.

@pokey
Copy link
Member

pokey commented Sep 4, 2023

chatgpt? 😄 Happy to add some tho if you prefer

  • are you happy with how this is triggered? (update fixtures to update, tests fixtures during unit tests)

yep makes sense

  • is there a way to avoid bringing up the extension window during "update fixtures subset" if none of the tests in the subset needed?

not sure if this is what you're after, but we should have a launch config that updates fixtures only for the unit tests. should be trivial to add. and that won't pop up the extension window at all I believe

@@ -0,0 +1,159 @@
_ _ _ _ _ _
// Package intern interns strings.
[] [-----] [----] [-----] [-----]|
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I like this convention

func String(s string) string {
[--] [----]|| [----]| [----] |

B AA B 1 B CC2D B A3 A4 D
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do all these letters / numbers mean? I guess they're colors / shapes. And colored shape uses two rows?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added detail to the readme

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

where's the cursor? always at 0:0?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added to read me

Comment on lines 4 to 7
▁▁▁▁▁▁▃▁▃▁▁▁▃▃▁▁▁▃▁▁▃▁▁▃▁▁▃▃▃▁▁▃▃▁▃▁▃▃▃▃▃▁▃▃▁▃▃▃▃▃▃▃▁▃▃▃▁▃▁▃
▁▁▃▃▁▃▃▃▃▃▁▃▁▃▃▃▁▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃
▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▅▃▃▃▃▃▃▃▅▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▅▃▃▃▃▃
▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▃▅▃▃▃▃▃▃▃▅▃▃▃▃▃▃▃▃▅▃▃▅▃▅▃▅▃▅▃▅▃
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how am I supposed to read this chart? Is the x tokens, and the y penalty?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added to read me, please let me know if it is insufficient

Comment on lines +9 to +14
nHats:
mean: 100%
std: 0%
min: 100%
max: 100%
spark: 0 25 50 75 █ 100
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what do these statistics mean?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added to read me

std: 1%
min: 80%
max: 82%
spark: 0 25 50 75 ▂█▆ 100
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's this funny shape?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added to read me, might not be sufficient, please let me know if you still have questions

@josharian
Copy link
Collaborator Author

chatgpt? 😄 Happy to add some tho if you prefer

I'd rather use human written code. I'll grab some from a friend if you don't have some handy.

we should have a launch config that updates fixtures only for the unit tests

done

@josharian
Copy link
Collaborator Author

i have sinned...force pushed. sorry. will switch back to regular review mode now.

.gitattributes Outdated
@@ -0,0 +1,3 @@
# do not modify line endings of our hat test golden files
packages/cursorless-engine/src/test/fixtures/hat-stats/*.golden -crlf
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought the attribute is called eol=lf, or -text for "don't mangle this at all"

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh no not this again 😅. What's the issue you're trying to solve here @josharian?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It’s solved now, thanks to @auscompgeek .

The issue was that git was adding CR to the golden files on checkout, so the test that the golden files were unmodified was failing.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would prefer to avoid gitattributes if at all possible. If people want to check things out as CRLF, I don't see why we should get in the way of that. The last time we ran into this problem, we solved it by normalizing when we read the gold file. See

// We use this to ensure that the test works on Windows. Depending on user
// / CI git config, the file might be checked out with CRLF line endings
.replaceAll("\r\n", "\n");

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK. If you look at the commit history, I’ve also implemented that, when you gitattributes originally did not work. Although to avoid needless diffs, we also have to normalize at writing time as well…sigh.

@josharian
Copy link
Collaborator Author

marking as draft until #1882 is sorted

@josharian josharian marked this pull request as draft September 11, 2023 02:24
github-merge-queue bot pushed a commit that referenced this pull request Oct 17, 2023
fidgetingbits pushed a commit to fidgetingbits/cursorless that referenced this pull request Nov 3, 2023
@pokey pokey added the blocked Blocked on something; eg another PR being completed. Look in comments of issue / PR for reason label Jun 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocked Blocked on something; eg another PR being completed. Look in comments of issue / PR for reason
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants