Electron microscopy
 
Rapid Automatic Keyword Extraction (RAKE)
- Python for Integrated Circuits -
- An Online Book -
Python for Integrated Circuits                                                                                   http://www.globalsino.com/ICs/        


Chapter/Index: Introduction | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | Appendix

=================================================================================

Rapid Automatic Keyword Extraction (RAKE) is an unsupervised, domain-independent, and language-independent method for extracting keywords from individual documents.[1] RAKE is based on their observation that keywords frequently contain multiple words but rarely contain standard punctuation or stopwords, such as the function words and, the, and of, or other words with minimal lexical meaning.

RAKE begins keyword extraction from a document by parsing its text into a set of candidate keywords with the steps below: [1]
          i) The document text is split into an array of words by specified word delimiters.
          ii) This array is split into sequences of contiguous words at phrase delimiters and stop word positions.
          iii) The words, which are assigned the same position in the text and together within a sequence are considered a candidate keyword.

============================================

         
         
         
         
         
         
         
         
         
         
         
         
         
         
         
         
         
         

 

 

 

 

 



















































 

[1] Michael W. Berry and Jacob Kogan, Text Mining: Applications and Theory, 2010.

 

 

 

=================================================================================