Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate vocab cooc irregularities #333

Open
Yomguithereal opened this issue Oct 30, 2024 · 1 comment
Open

Investigate vocab cooc irregularities #333

Yomguithereal opened this issue Oct 30, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@Yomguithereal
Copy link
Member

Yomguithereal commented Oct 30, 2024

xan tokenize transcript --keep word -m 3 sotu.csv --sep ' ' -p | xan vocab cooc --sep ' ' tokens --doc president -w 2 | xan v -p
xan tokenize transcript --keep word -m 3 sotu.csv --sep ' ' -p | xan vocab cooc --sep ' ' tokens --doc president -w 2 --distrib --min-count 5 | xan top sdG2 -l 50 | xan v
@Yomguithereal Yomguithereal added the bug Something isn't working label Oct 30, 2024
@Yomguithereal
Copy link
Member Author

Windows are incorrect and don't follow intuition. must be > 1 and w = 2 means current word and next one only.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant