| Literature DB >> 31751391 |
Andrea Fronzetti Colladon1, Maurizio Naldi2,3.
Abstract
TV series represent a growing sector of the entertainment industry. Being able to predict their performance allows a broadcasting network to better focus the high investment needed for their preparation. In this paper, we consider a well known TV series-The Big Bang Theory-to identify factors leading to its success. The factors considered are mostly related to the script, such as the characteristics of dialogues (e.g., length, language complexity, sentiment), while the performance is measured by the reviews submitted by viewers (namely the number of reviews as a measure of popularity and the viewers' ratings as a measure of appreciation). Through correlation and regression analysis, two sets of predictors are identified respectively for appreciation and popularity. In particular the episode number, the percentage of male viewers, the language complexity and text length emerge as the best predictors for popularity, while again the percentage of male viewers and the language complexity plus the number of we-words and the concentration of dialogues are the best choice for appreciation.Entities:
Mesh:
Year: 2019 PMID: 31751391 PMCID: PMC6874063 DOI: 10.1371/journal.pone.0225306
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distribution of average ratings per episode.
Fig 2Time evolution of average ratings.
Fig 3Time evolution of voters.
Fig 4Distribution of episode length in words.
Fig 5Standard deviation of frequency under Zipf law.
Fig 6Dialogue graph for Episode 1 of Season 1.
Spearman correlation coefficient for rating (†p < 0.001; ⊗p < 0.01; *p < 0.05).
| Predictor | Spearman |
|---|---|
| Voters | 0.783⊗ |
| Percentage of males | -0.577⊗ |
| Complexity | 0.316⊗ |
| I-words | -0.302⊗ |
| We-words | -0.277⊗ |
| HHI Concentration Index | 0.258⊗ |
| Sentiment | -0.165⊗ |
| No. of words | 0.158* |
| Episode | -0.011 |
Spearman correlation coefficient for voters (†p < 0.001; ⊗p < 0.01; *p < 0.05).
| Predictor | Spearman |
|---|---|
| Rating | 0.783⊗ |
| Percentage of males | -0.601⊗ |
| Complexity | 0.484⊗ |
| HHI Concentration Index | 0.432⊗ |
| Episode | -0.379⊗ |
| I-words | -0.338⊗ |
| No. of words | 0.308* |
| Sentiment | -0.254⊗ |
| We-words | -0.245⊗ |
Slopes and variances in the multilevel model for rating (†p < 0.001; ⊗p < 0.01; *p < 0.05).
| Predictor | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| Episode | 0.003 | 0.0042 | 0.0028 | |||||||
| Male percent. | −14.0† | −15.1† | ||||||||
| Complexity | 0.0036 | 0.0057* | 0.0066 | |||||||
| Sentiment | -0.51 | -1.43 | -1.11 | |||||||
| No. of words | -0.00003 | 0.00008 | 0.00006 | |||||||
| I-words | -0.0012 | -0.0015 | -0.0018 | |||||||
| We-words | -0.0038 | −0.0085† | −0.0065⊗ | |||||||
| HHI | -0.84 | −1.47⊗ | −1.48* | |||||||
| Amy | 0.0053 | -0.001 | 0.0043 | |||||||
| Bernadette | -0.0034 | -0.0003 | -0.0044 | |||||||
| Emily | -0.0005 | 0.0037 | -0.0005 | |||||||
| Howard | 0.0005 | 0.0015 | 0.0006 | |||||||
| Leonard | 0.001 | 0.0019 | 0.0014 | |||||||
| Leslie Winkle | -0.0066 | -0.0061 | -0.0099 | |||||||
| Penny | -0.0006 | -0.0013 | -0.0005 | |||||||
| Raj | -0.0028 | 0.0028 | -0.0015 | |||||||
| Sheldon | -0.0014 | 0.001 | 0.0004 | |||||||
| Stuart | -0.0103 | −0.0148* | −0.016* | |||||||
| Intercept | 8.11† | 8.077† | 19.79† | 8.30† | 8.157† | 8.368† | 8.282† | 8.157† | 21.772† | 9.105† |
| L2 Variance | 0.0786 | 0.0793 | 0.0411 | 0.0702 | 0.0794 | 0.0617 | 0.0934 | 0.0825 | 0.0185 | 0.0544 |
| L1 Variance | 0.105 | 0.105 | 0.085 | 0.104 | 0.105 | 0.104 | 0.103 | 0.102 | 0.076 | 0.095 |
Fig 7Cohen’s f2 for ratings.
Slopes and variances in the parsimonious model for rating (†p < 0.001; ⊗p < 0.01; *p < 0.05).
| Predictor | Weight |
|---|---|
| Male percentage | −14.49† |
| Complexity | 0.00579 |
| We-words | −0.0065⊗ |
| HHI | −1.463⊗ |
| Stuart | −0.014* |
| Intercept | 20.477† |
| L2 Variance | 0.030 |
| L1 Variance | 0.077 |
T-test results for top 10% versus bottom 90% comparison.
| Predictor | Average (Top 10%) | Average (Bottom 90%) | p-value |
|---|---|---|---|
| Episode | 12.77 | 12.01 | 0.593 |
| Male percentage | 0.822 | 0.833 | 0.000 |
| Sentiment | 0.550 | 0.553 | 0.328 |
| No. of words | 1546 | 1569 | 0.534 |
| Complexity | 31.23 | 26.02 | 0.015 |
| HHI | 0.222 | 0.196 | 0.030 |
| We-words | 19.08 | 26.35 | 0.000 |
| I-words | 119.50 | 130.36 | 0.040 |
| Amy | 2.65 | 7.19 | 0.002 |
| Bernadette | 3.62 | 5.48 | 0.136 |
| Emily | 0 | 0.55 | 0.001 |
| Howard | 14.19 | 12.88 | 0.502 |
| Leonard | 24.46 | 20.85 | 0.157 |
| Leslie Winkle | 0.23 | 0.28 | 0.898 |
| Penny | 17.04 | 17.06 | 0.992 |
| Raj | 9.31 | 9.56 | 0.866 |
| Sheldon | 28.50 | 25.25 | 0.205 |
| Stuart | 0.42 | 1.52 | 0.001 |
Slopes and variances in the multilevel model for voters (†p < 0.001; ⊗p < 0.01; *p < 0.05).
| Predictor | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| Episode | −16.277† | −16.743† | −17.813† | |||||||
| Male percent. | −13673† | −13430† | ||||||||
| Complexity | 8.195† | 10.584† | 11.57† | |||||||
| Sentiment | -2131 | 180.7 | 384 | |||||||
| No. of words | 0.07 | −0.331* | −0.346* | |||||||
| I-words | -0.485 | 0.429 | 0.03 | |||||||
| We-words | -1.537 | -2.085 | -0.803 | |||||||
| HHI | 529 | -866 | -863 | |||||||
| Amy | 2.454 | -5.045 | -0.297 | |||||||
| Bernadette | 0.257 | 2.277 | -1.192 | |||||||
| Emily | -12.593 | -4.506 | -8.422 | |||||||
| Howard | -0.072 | -1.946 | -2.632 | |||||||
| Leonard | 4.248* | 1.214 | 0.676 | |||||||
| Leslie | -5.055 | -12.967 | -16.853 | |||||||
| Penny | 3.902 | -0.335 | 0.402 | |||||||
| Raj | -3.462 | 1.039 | -2.480 | |||||||
| Sheldon | 2.87 | -0.206 | -0.65 | |||||||
| Stuart | -7.68 | -9.256 | -10.0 | |||||||
| Intercept | 2008† | 2203† | 13378† | 2963† | 1898† | 2109† | 1901† | 1808† | 13723† | 2515⊗ |
| L2 Variance | 64304 | 57977 | 32462 | 43419 | 61448 | 59905 | 56154 | 44691 | 23582 | 52497 |
| L1 Variance | 78890 | 66730 | 59706 | 73750 | 78922 | 78804 | 78765 | 74747 | 41077 | 57204 |
Fig 8Cohen’s f2 for voters.
Slopes and variances in the parsimonious model for voters (†p < 0.001; ⊗p < 0.01).
| Predictor | Weight |
|---|---|
| Episode number | −15.36† |
| Male percentage | −12441† |
| Complexity | 9.12† |
| No. of words | −0.338⊗ |
| Intercept | 12823† |
| L2 Variance | 27123 |
| L1 Variance | 44479 |