|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object edu.northwestern.at.utils.corpuslinguistics.stemmer.PorterStemmer
public class PorterStemmer
Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form. The input word can be provided a character at time (by calling add()), or at once by calling one of the various stem(something) methods.
Field Summary | |
---|---|
protected char[] |
b
|
protected int |
i
|
protected int |
i_end
|
protected static int |
INC
|
protected int |
j
|
protected int |
k
|
Constructor Summary | |
---|---|
PorterStemmer()
|
Method Summary | |
---|---|
void |
add(char ch)
Add a character to the word being stemmed. |
void |
add(char[] w,
int wLen)
Adds wLen characters to the word being stemmed contained in a portion of a char[] array. |
protected boolean |
cons(int i)
|
protected boolean |
cvc(int i)
|
protected boolean |
doublec(int j)
|
protected boolean |
ends(java.lang.String s)
|
char[] |
getResultBuffer()
Returns a reference to a character buffer containing the results of the stemming process. |
int |
getResultLength()
Returns the length of the word resulting from the stemming process. |
protected int |
m()
|
protected void |
r(java.lang.String s)
|
protected void |
setto(java.lang.String s)
|
void |
stem()
Stem the word placed into the Stemmer buffer through calls to add(). |
java.lang.String |
stem(java.lang.String word)
Find stem of a single word. |
protected void |
step1()
|
protected void |
step2()
|
protected void |
step3()
|
protected void |
step4()
|
protected void |
step5()
|
protected void |
step6()
|
java.lang.String |
toString()
After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.) |
protected boolean |
vowelinstem()
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
protected char[] b
protected int i
protected int i_end
protected int j
protected int k
protected static final int INC
Constructor Detail |
---|
public PorterStemmer()
Method Detail |
---|
public void add(char ch)
public void add(char[] w, int wLen)
public java.lang.String toString()
toString
in class java.lang.Object
public int getResultLength()
public char[] getResultBuffer()
protected final boolean cons(int i)
protected final int m()
protected final boolean vowelinstem()
protected final boolean doublec(int j)
protected final boolean cvc(int i)
protected final boolean ends(java.lang.String s)
protected final void setto(java.lang.String s)
protected final void r(java.lang.String s)
protected final void step1()
protected final void step2()
protected final void step3()
protected final void step4()
protected final void step5()
protected final void step6()
public void stem()
public java.lang.String stem(java.lang.String word)
Stemmer
stem
in interface Stemmer
word
- The word whose stem is desired.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |