Hello ,
I'm looking for a way of cluster words. I tried the follow but the result isn't so good. I image that the problem is the way of represent the words for the kohonen network.
For example there are 10 words. So i convert this words using the ASCII representation. The problem is that the words have differente length so I count the length of the biggest word and then complete with 0 the other words. But it seems to be influenced the SOM and the cluster isn't so good.
What better approach can I use for it?
Sorry for my poor english.
Thanks,
Filipe