Literature DB >> 23597746

Comparing compression models for authorship attribution.

W Oliveira1, E Justino, L S Oliveira.   

Abstract

In this paper we compare different compression models for authorship attribution. To this end, three different types of compressors, Lempel-Ziv type (GZip), block sorting type (BZip) and statistical type (PPM), along with two different similarity measures were considered in our experiments. Besides, two different attribution methods are analyzed in this paper. Through a series of experiments performed on two different databases, we were able to show that all the compressors behave similarly, but the similarity measures can vary considerably depending on the strategy used for authorship attribution. Our results corroborate with the literature in the sense that compression models are a good alternative for authorship attribution surpassing traditional pattern recognition systems based on classifiers and feature extraction.
Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Year:  2013        PMID: 23597746     DOI: 10.1016/j.forsciint.2013.02.025

Source DB:  PubMed          Journal:  Forensic Sci Int        ISSN: 0379-0738            Impact factor:   2.395


  2 in total

Review 1.  Toward understanding the communication in sperm whales.

Authors:  Jacob Andreas; Gašper Beguš; Michael M Bronstein; Roee Diamant; Denley Delaney; Shane Gero; Shafi Goldwasser; David F Gruber; Sarah de Haas; Peter Malkin; Nikolay Pavlov; Roger Payne; Giovanni Petri; Daniela Rus; Pratyusha Sharma; Dan Tchernov; Pernille Tønnesen; Antonio Torralba; Daniel Vogt; Robert J Wood
Journal:  iScience       Date:  2022-05-13

2.  Visual Analysis of Research Paper Collections Using Normalized Relative Compression.

Authors:  Pere-Pau Vázquez
Journal:  Entropy (Basel)       Date:  2019-06-21       Impact factor: 2.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.