Shingling is an elegant clustering algorithm which can compute an approximation of Jaccard similarity in linear time. It is one of my favorite text clustering algorithm.
Here you can find a C++, STL, Boost implementation.
Nokia Lumia 925: al via i preordini in Italia
-
NStore lancia i preordini sul Nokia Lumia 925 che sarĂ acquistabile al
prezzo di 599,90 euro
7 minutes ago
nice work
ReplyDeletethe use of connected component analysis has given me some ideas for my own project,
http://github.com/matpalm/resemblance/tree/master
mat