A question about adding a custom dict. #675
Replies: 1 comment 3 replies
-
|
Beta Was this translation helpful? Give feedback.
3 replies
Answer selected by
bact
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello,
I want to receive a compound word as one token, and tag the token I want with the part-of-speech.
For example, when the film title is "ปกป้องโลก", "ปกป้องโลก" is taken as one token and tagged as a "noun".
I understand that I can use pythainlp's custom_dict to solve this.
But I have two questions.
=============[question]===============
1. I want to use "attacut" among the tokenizer engines,
but if custom_dict and engine=attacut are set together, the custom dictionary is not applied.
How do I apply a custom dict while using attacut?
2. If there is a movie title called "ทะเลาะกัน", It's a verb, but I want it to be tagged as a noun.
Can you add not only words but also POS to the custom dict so that it is reflected in the result?
Beta Was this translation helpful? Give feedback.
All reactions