Thursday, November 02, 2006

[listen] blog analysis

1. Defree of distribution
  • WWW: Power Law Distribution
  • Blog: Log-Normal distribution + Power Law distribution
  • Social network: log-normal distribution
2. Small world property -> blog linkage like the six degree of seperation
3. Blog have many property between WWW and social network
4. Some tech using in the blog search: PageRank[98], HITS[99]
5. Community discovery:
  • approach: a) mutual awarness. b) ranking-based clustering method
  • emerge through the sustained action of individual bloggers, NOT the navigation of casual web surface.
6. Trend Extraction:
  • sultion: statistic, SVD, HOSVD
  • limitaion: aggregation, single trend, unstuctured data
7. Spam Blog Deection: relate to web spam detection.
  • detection method: temporal coherence, link coherence
8. Some conference about this:

No comments: