You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>Pyquantify is a powerful CLI tool for semantic analysis. It leverages natural language processing to unveil insights from text, files, or websites, empowering sophisticated data visualization and exploration.</p>
4
+
Pyquantify is a powerful CLI tool for semantic analysis. It leverages natural language processing to unveil insights from text, files, or websites, empowering sophisticated data visualization and exploration.
1.**Data Loading**: Load text data from raw input, files, or websites with interactive prompts for user input.
22
+
1.**Text Summarization**:
23
+
- Utilizes the BERT model for summarizing text.
24
+
- Provides caching functionality to speed up summarization for previously processed text.
25
+
- Supports exporting summaries to text files.
31
26
32
-
2.**Metrics Generation**: Calculate and display key metrics, including character count (with and without spaces), sentence count, word count, and paragraph count.
27
+
2.**Text Analysis**:
28
+
- Preprocesses text data including tokenization and part-of-speech tagging.
29
+
- Generates various metrics such as character count, word count, sentence count, etc.
30
+
- Analyzes morphological data including lemmatized forms, part-of-speech tags, and word frequencies.
31
+
- Performs sentiment analysis using the TextBlob library.
32
+
- Visualizes data through word clouds and word frequency charts.
33
33
34
-
3.**Morphological Analysis**: Generate a detailed table of word morphology, including word rank, original form, lemmatized form, part-of-speech (POS) tag, percentage occurrence, and count.
34
+
3.**Text Processing**:
35
+
- Offers functionality for cleaning and preprocessing text data.
36
+
- Implements functions for generating word clouds and word frequency charts.
37
+
- Calculates cosine similarity between two texts.
35
38
36
-
4.**Export Functionality**: Optionally export generated metrics, frequency tables, and visualizations to files.
39
+
4.**Data Loading and Exporting**:
40
+
- Supports loading text data from raw input, files, or websites.
41
+
- Provides export functionality for analyzed data, summaries, sentiment analysis results, and keywords extracted from text.
37
42
38
-
5.**Visualization**:
39
-
- Generate and visualize the frequency of the top 20 words in the text.
40
-
- Create and display a word cloud visualization of processed text data.
43
+
5.**CLI Interface**:
44
+
- Implements a command-line interface (CLI) using Click library.
45
+
- Offers commands for various text analysis and summarization tasks, including data visualization and sentiment analysis.
46
+
- Provides options for specifying data loading mode and exporting analysis results.
41
47
42
-
6.**Interactive Commands**: Utilize command-line interface commands for actions like displaying metrics, limiting results, searching for specific words, and generating visualizations.
48
+
6.**Parallel Processing**: Utilizes multiprocessing and concurrent.futures for parallel processing of tasks, improving performance for tasks like sentiment analysis and summarization.
43
49
44
-
7.**Summarize Text**: Summarize text using a BERT Extractive Summarizer.
50
+
7.**Unit Testing**:
51
+
- Includes unit tests for different modules and functionalities using the `unittest` framework.
52
+
- Uses mocking to isolate and test individual components such as data loading, summarization, and exporting.
45
53
46
-
8.**Sentiment Analysis**:
47
-
- Perform sentiment analysis on the text.
48
-
- Provides insights into sentiment polarity and subjectivity.
54
+
8.**Exception Handling and Error Reporting**:
55
+
- Handles exceptions gracefully and provides informative error messages.
56
+
- Reports errors such as unsupported operating systems, file not found, and invalid input modes.
-`--mode`: Specify the data loading mode for the first text (raw input, file, or website).
203
+
-`--other`: Specify the data loading mode for the second text (raw input, file, or website).
204
+
165
205
Feel free to explore additional options and functionalities by checking the help documentation for each command:
166
206
167
207
```bash
168
208
pyquantify [command] --help
169
209
```
170
-
</div>
171
210
172
211
173
-
<h2><aid="ques">:grey_question: FAQ</a></h2>
212
+
## FAQ
213
+
174
214
175
215
### Q: What is Pyquantify?
176
216
@@ -187,12 +227,15 @@ Originally conceived as a word frequency counter, Pyquantify's development took
187
227
### Q: Why the name change from NLPFreq to Pyquantify?
188
228
189
229
NLPFreq felt limiting and didn't capture the full scope of the project. Pyquantify more accurately reflects its capabilities as a Python-based tool for quantitative data analysis.
-[How to write a Good readme](https://bulldogjob.com/news/449-how-to-write-a-good-readme-for-your-github-project)
197
240
198
-
**Note:** Pyquantify has undergone thorough testing on Linux, and its functionality is confirmed to work seamlessly. However, it's important to note that when running on Windows Subsystem for Linux (WSL), certain features may have limited functionality due to the absence of the complete Linux toolset in the WSL environment.
241
+
-[NeuralNine's Publish Your Own Python Package](https://www.youtube.com/watch?v=tEFkHEKypLI)
0 commit comments