Literature DB >> 21868845

n-Gram Statistics for Natural Language Understanding and Text Processing.

C Y Suen1.   

Abstract

n-gram (n = 1 to 5) statistics and other properties of the English language were derived for applications in natural language understanding and text processing. They were computed from a well-known corpus composed of 1 million word samples. Similar properties were also derived from the most frequent 1000 words of three other corpuses. The positional distributions of n-grams obtained in the present study are discussed. Statistical studies on word length and trends of n-gram frequencies versus vocabulary are presented. In addition to a survey of n-gram statistics found in the literature, a collection of n-gram statistics obtained by other researchers is reviewed and compared.

Entities:  

Year:  1979        PMID: 21868845     DOI: 10.1109/tpami.1979.4766902

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  Sentiment Analysis on COVID-19 Twitter Data Streams Using Deep Belief Neural Networks.

Authors:  Jatla Srikanth; Avula Damodaram; Yuvaraja Teekaraman; Ramya Kuppusamy; Amruth Ramesh Thelkar
Journal:  Comput Intell Neurosci       Date:  2022-05-06

2.  New Insight on the Safety of Erenumab: An Analysis of Spontaneous Reports of Adverse Events Recorded in the US Food and Drug Administration Adverse Event Reporting System Database.

Authors:  Maurizio Sessa; Morten Andersen
Journal:  BioDrugs       Date:  2021-02-20       Impact factor: 5.807

3.  Generation of functional oligopeptides that promote osteogenesis based on unsupervised deep learning of protein IDRs.

Authors:  Mingxiang Cai; Baichuan Xiao; Fujun Jin; Xiaopeng Xu; Yuwei Hua; Junhui Li; Pingping Niu; Meijing Liu; Jiaqi Wu; Rui Yue; Yong Zhang; Zuolin Wang; Yongbiao Zhang; Xiaogang Wang; Yao Sun
Journal:  Bone Res       Date:  2022-03-01       Impact factor: 13.567

4.  Automatically clustering large-scale miRNA sequences: methods and experiments.

Authors:  Linxia Wan; Jiandong Ding; Ting Jin; Jihong Guan; Shuigeng Zhou
Journal:  BMC Genomics       Date:  2012-12-17       Impact factor: 3.969

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.