Stop Words Removal

Description

Removes meaningless word for further processing like di, saya, or dari. Uses web service provided by Faculty of Computer Science, University of Indonesia.

Try it: http://fws.cs.ui.ac.id/StopwordRemoverSampleClient/index.jsp

This task includes case folding and remove non-alphanumeric characters.

Be warned, the word tidak (en: not) is also removed. Depending on what you are going to do next, removing this word may affect the result

Requirement

  • Internet connection.

Example

Sample input:

Pak kepala desa tidak tahu bahwa 3 pencuri
di rumah itu adalah teman lamanya!

Sample output:

pak kepala desa tahu 3 pencuri
rumah teman