How can you identify a query session? Smart Miner: A New Framework for Mining Large Scale Web Usage Data suggests using three major components: 1. temporal visit constrains; 2. the links among pages, and 3. maximal visit paths, computed using an a-priori like algorithm. I suggest reading the paper if you want to see reasonable ideas for identifying query sessions.
What I don't like is the experimental part. A site with 1,5K unique users and 5K pages cannot be considered a Large Web site...
No comments:
Post a Comment