-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use the ghc api to get imports #32
Conversation
This replaces our own hand rolled parser.
We no longer use it!
-- TODO[GL]: propagate error instead | ||
Left err -> do | ||
GHC.printBagOfErrors dynFlags err | ||
pure Nothing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe I should use error
here instead, as was previously done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think an error would be preferable. If we don't assume bugs in our code, it would indicate something to fix in the source file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
This will introduce more work over at #25 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @googleson78! Looks pretty good. I left some comments.
|
||
let | ||
-- [GL] The fact that the resulting strings here contain the "-X"s makes me a bit doubtful that this is the right approach, | ||
-- but this is what I found for now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
usesTH
is an heuristic that offers the user a convenient way to show that TH is needed, not an exact detection solution. 🤷
-- TODO[GL]: propagate error instead | ||
Left err -> do | ||
GHC.printBagOfErrors dynFlags err | ||
pure Nothing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think an error would be preferable. If we don't assume bugs in our code, it would indicate something to fix in the source file.
map GHC.unLoc $ | ||
GHC.getOptions dynFlags sb filePath | ||
-- TODO: should both of the files here be filePath? | ||
GHC.getImports dynFlags sb filePath filePath >>= \case |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
getImports
can throw exceptions. There are functions to get those formatted too, but perhaps it is fine to have them kill the program without further ado.
We would crash later anyways, because the go part can't parse the message we produce there. This is justified because if the ghc parser fails something is very wrong.
It's fine to pass both filePath as both arguments there because they provide enough info and we aren't that worried about formatting of error messages.
haskell_toolchain_library(name = "directory") | ||
|
||
haskell_toolchain_library(name = "ghc") | ||
|
||
# Only necessary to be able to import GHC.Platform in HImportScan.GHC.Settings | ||
# TODO[GL]: we can probably remove this once we only support ghc >=9.2 | ||
haskell_toolchain_library(name = "ghc-boot") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe this is redundant now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately it isn't - it seems I introduced another dependency on this package without knowing. Updated the comment.
In some closed source file I get this error,
Probably we will need to enable every extension that affects the parsing of the import/export lists ( I tested with and without |
Also, note that when we get the error message, there is no indication of the file that produced the failure. We should have |
I contributed some commits unrelated to my last comments. Let me know if you'd like any of those changed. |
@@ -98,7 +99,7 @@ scanImports dynFlags filePath contents = do | |||
-- This is far from ideal, however handling this better would require being able to communicate errors better to go. | |||
Left err -> do | |||
GHC.printBagOfErrors dynFlagsWithExtensions err | |||
error "ghc parsing failed" | |||
throwIO (GHC.mkSrcErr err) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this commit fix the issue where users don't know what parsing failed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, it doesn't. I tested this change before writing about it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right now, after my change which splits stderr and stdout, we get the parse error that ghc reports, e.g.
/home/googleson78/git/gazelle_haskell_modules/example/package-a/app/Main.hs:9:59: error:
parse error on input `PatTrue'
|
9 | import PackageA.Other.C(type DataC, DataC(DataC), pattern PatTrue)
| ^^^^^^^
himportscan: parse error on input `PatTrue'
gazelle: himportscan exited unsuccessfully exit status 1
Is this good enough, or should we explicitly report the file name separately?
How big is the 1.8 times difference considering all the other processing and generating going on. Is it 1.8 for the whole process, or only for the |
It is 1.8 for the whole process.
I think this would be preferable. |
This turns out to be much faster (close to 2 times on real world project).
I'm very surprised, but currently (or in general?) I need to investigate more. |
I have confirmed that it returns a Even if we have a typo, e.g. However, I also checked the behaviour with the current implementation, and it (sensibly) behaves the same way. So I think that this might not be a regression at all (unlike I was previously assuming) and we can safely "ignore" it. |
I added |
d941324
to
eaefe78
Compare
They don't. But I think they will be easy to add as we discover them. Thanks for your insights and your patches. |
This replaces our own hand rolled parser.
Fixes #31