Skip to content

Keyword Extraction for Japanese Text using the RAKE Algorithm

License

Notifications You must be signed in to change notification settings

carol975/RAKE_JPN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAKE JPN

RAKE JPN is a Python tool for extracting keywords in Japanse.

This tool implements the RAKE (Rapid Automatic Keyword Extraction) algorithm proposed in: Rose, Stuart & Engel, Dave & Cramer, Nick & Cowley, Wendy. (2010). Automatic Keyword Extraction from Individual Documents. 10.1002/9780470689646.ch1.

The source code is released under the MIT License.

Installation

This tool requires python 3.5+

Clone the repository

$ git clone https://github.com/carol975/RAKE_JPN.git

Install the dependencies

$ cd RAKE_JPN
$ pip install -r requirements.txt

Usage Example

from rake import Rake
rake = Rake()
text = "杉山古墳(すぎやまこふん)は、奈良県奈良市大安寺にある古墳。形状は前方後円墳。大安寺古墳群を構成する古墳の1つ。国の史跡に指定されている(史跡「大安寺旧境内 附 石橋瓦窯跡」のうち)。" 
print(rake.get_keywords(text,3))  # get_keywords takes two params: 1. Text, 2. Max number of keywords to return, if zero then returns all keywords.

About

Keyword Extraction for Japanese Text using the RAKE Algorithm

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages