| Literature DB >> 26414063 |
Hikmat Ullah Khan1, Ali Daud1, Tahir Afzal Malik2.
Abstract
Social networking has revolutionized the use of conventional web and has converted World Wide Web into the social web as users can generate their own content. This change has been possible due to social web platforms like forums, wikis, and blogs. Blogs are more commonly being used as a form of virtual communication to express an opinion about an event, product or experience and can reach a large audience. Users can influence others to buy a product, have certain political or social views, etc. Therefore, identifying the most influential bloggers has become very significant as this can help us in the fields of commerce, advertisement and product knowledge searching. Existing approaches consider some basic features, but lack to consider some other features like the importance of the blog on which the post has been created. This paper presents a new metric, MIIB (Metric for Identification of Influential Bloggers), based on various features of bloggers' productivity and popularity. Productivity refers to bloggers' blogging activity and popularity measures bloggers' influence in the blogging community. The novel module of BlogRank depicts the importance of blog sites where bloggers create their posts. The MIIB has been evaluated against the standard model and existing metrics for finding the influential bloggers using dataset from the real-world blogosphere. The obtained results confirm that the MIIB is able to find the most influential bloggers in a more effective manner.Entities:
Mesh:
Year: 2015 PMID: 26414063 PMCID: PMC4587377 DOI: 10.1371/journal.pone.0138359
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of Symbols used in the paper.
| Symbol | Remarks |
|---|---|
|
| Set of Bloggers |
|
| Set of Blog Posts |
|
| Set of blog Sites |
|
|
|
|
|
|
|
|
|
|
| Number of blog posts posted by a blogger |
|
| Number of days blog posts posted by a blogger |
|
| Score of regular posting of a blogger |
|
| Length of blog posts posted by a blogger |
|
| Score of Average length of the blog posts posted by a blogger |
|
| Number of comments received on blog posts posted by a blogger |
|
| Number of Inlinks received on blog posts posted on a blogger |
|
| Number of outlinks in blog posts posted by a blogger |
|
| Number of Bloggers b who post in a blog site |
|
| Number of posts posted in a blog site |
|
| Number of in-links received by posts in a blog site |
|
| Number of comments received by posts in a blog site |
|
| Computed Score of Blogger |
|
| Computed Score of Blogger |
|
| Computed Score of Blogger |
|
| Computed Score of Weblog site |
|
| Final Influence Score of Blogger |
List of Features and their purpose.
| Feature N0 | Feature title | Remarks |
|---|---|---|
| f1 | Activity | To measure the post initiating capability of the blogger |
| f2 | Activeness | To measure the blogger ability to remain active in the blog |
| f3 | Consistency | To measure the consistent posting behavior of the blogger |
| f4 | Recognition | To measure how much other bloggers recognize the blogger |
| f5 | Authority | To measure how much authority is given in the blog to the blogger |
| f6 | Novelty | To measure how much novel content is posted by the blogger |
| f7 | BlogRank | To measure the significance of blog in which blogger post |
| f8 | PostLength | To measure the eloquence of the content posted by the blogger |
| f9 | NormalizedPostLength | To measure the normalized quality of content posted by the blogger |
TUAW Dataset Statistics.
| Bloggers | 51 |
| Posts | 17,831 |
| Inlinks | 53,575 |
| Comments | 2,67,949 |
| Weblogs | 6,655 |
| Inlinks per post | 3.004 |
| Comments per post | 15.027 |
| Posts per Blogger | 3496.6 |
| Average Post Length | 1321.225 |
List of top bloggers based on each single feature.
| F1-noofposts | F2-nooddays | F3-consistency | F4-com | F5-inlink | F6-outlink | F7-blogrank | F8-len | F9-avglength | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Scott McNulty | Scott McNulty | Barb Dybwad | Scott McNulty | Cory Bohon | Brad Hill | Scott McNulty | Erica Sadun | Weblogs, Inc. |
| 2 | Dave Caolo | Dave Caolo | David Chartier | Erica Sadun | Erica Sadun | C.K. Sample, III | Erica Sadun | David Chartier | Chris Ullrich |
| 3 | David Chartier | David Chartier | Sean Bonner | Dave Caolo | Robert Palmer | Michael Sciannamea | Dave Caolo | Scott McNulty | Pariah S. Burke |
| 4 | Erica Sadun | Erica Sadun | C.K. Sample, III | David Chartier | Dave Caolo | Greg Scher | David Chartier | Dave Caolo | Jason Clarke |
| 5 | C.K. Sample, III | Michael Rose | Erica Sadun | Victor Agreda, Jr. | Mike Schramm | Dori Smith | Cory Bohon | Mat Lu | Christina Warren |
| 6 | Mat Lu | Mat Lu | Scott McNulty | Mat Lu | Michael Rose | David Touve | Victor Agreda,Jr | Michael Rose | Brett Terpstra |
| 7 | Laurie A. Duncan | Cory Bohon | Dave Caolo | Cory Bohon | Mat Lu | Marc Orchant | Mat Lu | C.K.Sample, III | Scott Granneman |
| 8 | Cory Bohon | Laurie A. Duncan | Robert Palmer | Michael Rose | Steven Sande | Damien Barrett | Michael Rose | Laurie A. Duncan | Joshua Ellis |
| 9 | Michael Rose | Mike Schramm | Mat Lu | Mike Schramm | Scott McNulty | Jan Kabili | Mike Schramm | Cory Bohon | Caryn Coleman |
Fig 1The top influential bloggers based on single features.
A comparison of Top Results of modules, MIIB vs the baseline.
| Rank | Productivity | Popularity | Quality | Baseline | MIIB |
|---|---|---|---|---|---|
| 1 | Scott McNulty | Scott McNulty | Scott McNulty | Cory Bohon | Scott McNulty |
| 2 | Dave Caolo | Erica Sadun | Erica Sadun | Robert Palmer | Erica Sadun |
| 3 | David Chartier | Dave Caolo | Dave Caolo | Mat Lu | Dave Caolo |
| 4 | Erica Sadun | Cory Bohon | David Chartier | Christina Warren | David Chartier |
| 5 | C.K. Sample, III | David Chartier | Cory Bohon | Dave Caolo | Cory Bohon |
| 6 | Mat Lu | Victor Agreda, Jr. | Victor Agreda, Jr. | Chris Ullrich | Victor Agreda, Jr. |
| 7 | Laurie A. Duncan | Mat Lu | Mat Lu | Steven Sande | Mat Lu |
| 8 | Cory Bohon | Michael Rose | Michael Rose | Michael Rose | Michael Rose |
| 9 | Michael Rose | Mike Schramm | Mike Schramm | Victor Agreda, Jr. | Mike Schramm |
| 10 | Mike Schramm | Robert Palmer | Robert Palmer | Jason Clarke | Robert Palmer |
A comparison of Top results of MIIB vs Existing Metrics.
| Rank | MIBI [ | MIBIX [ | MIIB |
|---|---|---|---|
| 1 | Cory Bohon | Cory Bohon | Scott McNulty |
| 2 | Robert Palmer | Robert Palmer | Erica Sadun |
| 3 | Steven Sande | Steven Sande | Dave Caolo |
| 4 | Erica Sadun | Erica Sadun | David Chartier |
| 5 | Michael Rose | Christina Warren | Cory Bohon |
| 6 | Mike Schramm | Michael Rose | Victor Agreda, Jr. |
| 7 | Christina Warren | Mike Schramm | Mat Lu |
| 8 | Dave Caolo | Mat Lu | Michael Rose |
| 9 | Mat Lu | Dave Caolo | Mike Schramm |
| 10 | Brett Terpstra | Brett Terpstra | Robert Palmer |
A comparison of MIIB vs Existing Metrics using Evaluation Measures.
| OSim | Spearman Correlation | Kendall Correlation | |
|---|---|---|---|
|
| 1 | 0.9515 | 0.8667 |
|
| 0.8 | 0.2242 | 0.2 |
|
| 0.8 | 0.22 | 0.16 |
Fig 2Module-wise Comparative Analysis.
Fig 3Comparative analysis of the Modules of MIIB to find Top Influential Bloggers.
Person Rank-order Correlation of the modules and the MIIB.
| Comparison between | Dataset | Top 30 | Top 20 | Top 10 |
|---|---|---|---|---|
| Productivity vs Popularity | 0.8842 | 0.8641 | 0.8316 | 0.8362 |
| Productivity vs BlogRank | 0.8962 | 0.8784 | 0.8502 | 0.8625 |
| Popularity vs BlogRank | 0.9997 | 0.9996 | 0.9994 | 0.9988 |
| Productivity vs MIIB | 0.8962 | 0.8784 | 0.8502 | 0.8625 |
| Popularity vs MIIB | 0.9997 | 0.9996 | 0.9994 | 0.9988 |
| BlogRank vs MIIB | 1 | 1 | 1 | 1 |
Kendall Correlation of the modules and the MIIB.
| Comparison between | Dataset | Top 30 | Top 20 | Top 10 |
|---|---|---|---|---|
| Productivity vs Popularity | 0.70732 | 0.61839 | 0.62105 | 0.64444 |
| Productivity vs BlogRank | 0.73415 | 0.65057 | 0.63158 | 0.68889 |
| Popularity vs BlogRank | 0.95406 | 0.96782 | 0.98947 | 0.95556 |
| Productivity vs MIIB | 0.73415 | 0.65057 | 0.63158 | 0.68889 |
| Popularity vs MIIB | 0.95406 | 0.96782 | 0.98947 | 0.95556 |
| BlogRank vs MIIB | 1 | 1 | 1 | 1 |
Osim of the modules and the MIIB.
| Comparison between | Dataset | Top 30 | Top 20 | Top 10 |
|---|---|---|---|---|
| Productivity vs Popularity | 1 | 0.93333 | 0.8 | 0.8 |
| Productivity vs BlogRank | 1 | 0.93333 | 0.85 | 0.8 |
| Popularity vs BlogRank | 1 | 1 | 0.95 | 1 |
| Productivity vs MIIB | 1 | 0.93333 | 0.85 | 0.8 |
| Popularity vs MIIB | 1 | 1 | 0.95 | 1 |
| BlogRank vs MIIB | 1 | 1 | 1 | 1 |