| Literature DB >> 35469124 |
Ala Mughaid1, Shadi Al-Zu'bi2, Ahmed Al Arjan1, Rula Al-Amrat1, Rathaa Alajmi1, Raed Abu Zitar3, Laith Abualigah4,5.
Abstract
People worldwide suffer from fake news in many life aspects, healthcare, transportation, education, economics, and many others. Therefore, many researchers have considered seeking techniques for automatically detecting fake news in the last decade. The most popular news agencies use e-publishing on their websites; even websites can publish any news they want. However, thus before quotation any news from a website, there should be a close look at news resource ranking by using a trusted websites classifier, such as the website world rank, which reflects the repute of these websites. This paper uses the world rank of news websites as the main factor of news accuracy by using two widespread and trusted websites ranking. Moreover, a secondary factor is proposed to compute the news accuracy similarity by comparing the current news with fakes news and getting the possible news accuracy. Experiments results are conducted on several benchmark datasets. The results showed that the proposed method got promising results compared to other comparative methods in defining the news accuracy.Entities:
Keywords: Artificial intelligence; Cyber privacy; Cybersecurity; Fake news; Social media accuracy; Text processing
Year: 2022 PMID: 35469124 PMCID: PMC9021563 DOI: 10.1007/s00500-022-07080-1
Source DB: PubMed Journal: Soft comput ISSN: 1432-7643 Impact factor: 3.732
Fig. 1Fake news dataset features
Ratio Rank API of news website
| Rank of 10 ( R1 ) | Ratio % |
|---|---|
| 10 | 25 |
| 9 | 21 |
| 8 | 19 |
| 7 | 17 |
| 6 | 15 |
| 5 | 13 |
| 4 | 9 |
| 3 | 7 |
| 2 | 5 |
| 1 | 1 |
| 0 | 0 |
Ratio Alexa Rank of news website
| Alexa rank (R2) | Ratio% |
|---|---|
| From >= 1 k To <= 100 k | 25 |
| From >= 200 k To < 100 k | 21 |
| From >= 300 k To < 200 k | 19 |
| From >= 400 k To < 300 k | 17 |
| From >= 500 k To < 400 k | 15 |
| From >= 600 k To < 700 k | 13 |
| From >= 700 k To < 800 k | 9 |
| From >= 800 k To < 900 k | 7 |
| From >= 900 k To <= 1 m | 5 |
| More than 1 m | 1 |
| None | 0 |
Fig. 2CNN.com website rank (API)
Fig. 3CNN.com website rank (Alexa)
RankAPI and Alexa classification criteria
| Rank API factor | Rank alexa factor |
|---|---|
| Computes the quality of content and backlinks | Using web classification by dynamic data inside the website to classify the foremost popularity websites |
| Rank score in this website (0–10) | |
| 10 highest page rank | |
| 0 lowest page rank |
TF matrix
| Term1 | Term2 | ... | Term-N | |
|---|---|---|---|---|
| Text-1 | A1 | A2 | ... | AN |
| Text-2 | B1 | B2 | ... | BN |
Aggregated News info
| News title | News Source | News URL |
|---|---|---|
| Israeli emergency services responding to a synagogue bleacher collapse described as a ’mass casualty event’ | CNN | |
| Samsung’s leaked Galaxy A22 may be its most affordable 5G phone to date | Engadget | |
| Vaccination centre ponders cutting days to ease nurses’ workload | Stuff.co.nz | |
| Starship SN15 patiently awaits a decision | NASA Spaceflight | |
| John Kiely launches attack on GAA over rules and accuses Galway of simulation | The42 | |
| Imperialism and national liberation | Socialist worker | |
| Zimbabwe says it has enough cash to buy record maize crop | Insiderzim | |
| MC GHIE COFFEE SHOP | Pattaya people | |
| HCMC to further promote caretaking for poor, vulnerable people: City Party Chief | sggpnews | |
| Navi Pillay explains ‘human rights’ limitations in Geneva on Tamil genocide | Tamil net |
News websites domain names ranking
| News# | Domain | Alexa-rank | Other-rank |
|---|---|---|---|
| 1 | 96 | 10/10 | |
| 2 | 1216 | 10/10 | |
| 3 | 4094 | 10/10 | |
| 4 | 46803 | 10/10 | |
| 5 | 70248 | 10/10 | |
| 6 | 1102118 | 1/10 | |
| 7 | 8974861 | 1/10 | |
| 8 | 6287109 | 1/10 | |
| 9 | 1261037 | 1/10 | |
| 10 | 1243830 | 1/10 |
News websites-ranking R (based on Equation 1)
| News# | R1-ratio (Alexa) 25% | R2-ratio (Rankapi) 25% | R=R1+R2 50% |
|---|---|---|---|
| 1 | 25 | 25 | 50 |
| 2 | 25 | 25 | 50 |
| 3 | 25 | 25 | 50 |
| 4 | 25 | 25 | 50 |
| 5 | 25 | 25 | 50 |
| 6 | 1 | 1 | 2 |
| 7 | 1 | 1 | 2 |
| 8 | 1 | 1 | 2 |
| 9 | 1 | 1 | 2 |
| 10 | 1 | 1 | 2 |
Cosine-Similarity score
| News# | Similarity Count of 5000 | TP 100% | FP 100% | FP/2 50% |
|---|---|---|---|---|
| 1 | 2929 | 9.3 | 90.7 | 45.4 |
| 2 | 5000 | 8.5 | 91.5 | 45.7 |
| 3 | 5000 | 17.8 | 82.2 | 41.1 |
| 4 | 5000 | 7 | 93 | 46.5 |
| 5 | 5000 | 7.3 | 92.7 | 46.3 |
| 6 | 5000 | 18 | 82 | 41.0 |
| 7 | 5000 | 20 | 80 | 40.0 |
| 8 | 5000 | 24.3 | 75.7 | 37.8 |
| 9 | 5000 | 22.4 | 77.6 | 38.8 |
| 10 | 5000 | 20 | 80 | 40 |
Fig. 4Distribute TP and FP
Time of similarity
| News# | Duration time/Seconds |
|---|---|
| 1 | 24 Seconds |
| 2 | 26 Seconds |
| 3 | 30 Seconds |
| 4 | 39 Seconds |
| 5 | 31 Seconds |
| 6 | 35 Seconds |
| 7 | 30 Seconds |
| 8 | 33 Seconds |
| 9 | 38 Seconds |
| 10 | 32 Seconds |
Fig. 5Similarity duration time
The final news accuracy score
| News# | R-Score 50% | CS-Score 50% | Accuracy Score= R+CS 100% | Accuracy Label |
|---|---|---|---|---|
| 1 | 50 | 45.4 | 95.4 | Accurate |
| 2 | 50 | 45.7 | 95.7 | Accurate |
| 3 | 50 | 41.1 | 91.1 | Accurate |
| 4 | 50 | 46.5 | 96.5 | Accurate |
| 5 | 50 | 46.3 | 96.3 | Accurate |
| 6 | 2 | 41.0 | 43 | Inaccurate |
| 7 | 2 | 40.0 | 42 | Inaccurate |
| 8 | 2 | 37.8 | 39.8 | Inaccurate |
| 9 | 2 | 38.8 | 40.8 | Inaccurate |
| 10 | 2 | 40 | 42 | Inaccurate |
Fig. 6The final news accuracy score
Comparison between Machine Learning and Proposed system
| News title | News source | Machine-learning prediction | Similarity with fake news | TP | News Accuracy |
|---|---|---|---|---|---|
| 1- Asteroids could be approaching Earth undetected thanks to quirk of the planet’s rotation | Pharmacy times.com | Fake News | 0.107605 | 0.892395 | 95% |
| 2- AT &T Stadium design plays role with sun, Jumbotron affecting multiple plays | Philippine Star | Real News | 0.169867 | 0.830133 | 93% |
| 3- Brother confirms death of animal lover in Tonga tsunami | Independent | Real News | 0.162772 | 0.837228 | 96% |
| 4- Building the off-Earth economy | The Indian Express | Real News | 0.0 93613 | 0.9 06387 | 93% |
| 5- COVID-19 Daily Bulletin | The Globe And Mail | Fake News | 0.129035 | 0.870965 | 94% |
| 6- Dutch TV suspends show over sexual misconduct claims | Reuters | Real News | 0.142656 | 0 .857344 | 92% |
| 7- Everton want Belgium manager and contact Belgian Football Association | BBC News | Real News | 0.120857 | 0.879143 | 93% |
| 8- India Fighting Another Wave While Maintaining Economic Growth, Says PM At World Economic Forum | Associated Press | Real News | 0.142416 | 0.857584 | 93% |
| 9- Nightclub helping police over missing woman | The Punch | Fake News | 0.047605 | 0.952395 | 93% |
| 10- Police did not receive any rally application | Bring Me the News | Fake News | 0.057764 | 0.942236 | 93% |
Comparing previous some works with the method used in the proposed search
| Source number | Use cosine similarity | Use the news site rating | Compare the news with fake news | Dealing with live news | Determine the source of news |
|---|---|---|---|---|---|
| [11] | No | No | Yes | No | Yes |
| [12] | Yes | No | Yes | No | Yes |
| [13] | Yes | No | Yes | No | Yes |
| [14] | No | No | Yes | No | Yes |
| [15] | No | No | Yes | No | Yes |
| [16] | Yes | No | Yes | No | Yes |