1. Find the top 5000 strings (sorted by frequency).
2. Find the duplicates
Yahoo's Unwatchable Live Stream Proves Its Next Acquisition Should Be A
Proper Video Platform
-
[image: Screen shot 2013-05-20 at 2.25.17 PM]It's easy to forget that Yahoo
has had a long, on-again-off-again love affair with online video. But you
might...
3 minutes ago
Is an approximate solution ok? If so, I would use a spectral Bloom filter.
ReplyDeleteno i wanted to have exact
ReplyDelete