- Download the snowball stemmer .jar from Weka website (link).
- Put it into your Weka root directory.
- Modify your RunWeka.ini file at that directory so it seems like this: cp=%CLASSPATH%;snowball.jar
Note that you can also use a stopword list to remove them when processing the text with the filter. To do so, click over the text field at the GUI. A filechooser window should arise letting you choose your stopword list. The format of this file is: one word per line, lines starting with "#" are considered comments. You should take this into account when creating your own stopword list. Anyway, there are some available on the net for you to use.
No comments:
Post a Comment