Literature DB >> 31127715

The Deep Learning-Based Recommender System "Pubmender" for Choosing a Biomedical Publication Venue: Development and Validation Study.

Renchu Guan^1,2, Dong Xu^1,3,4, Xiaoyue Feng^1,3, Hao Zhang¹, Yijie Ren³, Penghui Shang³, Yi Zhu³, Yanchun Liang^1,2.

Abstract

BACKGROUND: It is of great importance for researchers to publish research results in high-quality journals. However, it is often challenging to choose the most suitable publication venue, given the exponential growth of journals and conferences. Although recommender systems have achieved success in promoting movies, music, and products, very few studies have explored recommendation of publication venues, especially for biomedical research. No recommender system exists that can specifically recommend journals in PubMed, the largest collection of biomedical literature.
OBJECTIVE: We aimed to propose a publication recommender system, named Pubmender, to suggest suitable PubMed journals based on a paper's abstract.
METHODS: In Pubmender, pretrained word2vec was first used to construct the start-up feature space. Subsequently, a deep convolutional neural network was constructed to achieve a high-level representation of abstracts, and a fully connected softmax model was adopted to recommend the best journals.
RESULTS: We collected 880,165 papers from 1130 journals in PubMed Central and extracted abstracts from these papers as an empirical dataset. We compared different recommendation models such as Cavnar-Trenkle on the Microsoft Academic Search (MAS) engine, a collaborative filtering-based recommender system for the digital library of the Association for Computing Machinery (ACM) and CiteSeer. We found the accuracy of our system for the top 10 recommendations to be 87.0%, 22.9%, and 196.0% higher than that of MAS, ACM, and CiteSeer, respectively. In addition, we compared our system with Journal Finder and Journal Suggester, which are tools of Elsevier and Springer, respectively, that help authors find suitable journals in their series. The results revealed that the accuracy of our system was 329% higher than that of Journal Finder and 406% higher than that of Journal Suggester for the top 10 recommendations. Our web service is freely available at https://www.keaml.cn:8081/.
CONCLUSIONS: Our deep learning-based recommender system can suggest an appropriate journal list to help biomedical scientists and clinicians choose suitable venues for their papers. ©Xiaoyue Feng, Hao Zhang, Yijie Ren, Penghui Shang, Yi Zhu, Yanchun Liang, Renchu Guan, Dong Xu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 24.05.2019.

Entities: Chemical Disease Gene

Keywords: PubMed; biomedical literature; convolutional neural network; deep learning; recommender system

Mesh：

Year: 2019 PMID： 31127715 PMCID： PMC6555124 DOI： 10.2196/12957

Source DB: PubMed Journal: J Med Internet Res ISSN： 1438-8871 Impact factor: 5.428

Introduction

Background

With the fast-growing research activities, more biomedical papers are being published in thousands of journals worldwide. For example, PubMed Central (PMC) has 5.2 million papers and 7409 journals covering biomedical and life sciences [1]. Although these publications play a major role in disseminating research outcome, the growth of journal publications imposes a challenge for selections of appropriate publication venues. It is vital that authors submit to the right journal that meets the journal scope and provides sound reviews. It is equally important that they reach their intended audience and obtain a large number of citations [2]. However, researchers are unfamiliar with all the journals related to their work for choosing the most suitable one for submitting a paper. Moreover, different publication scopes of journals and research interests of reviewers and editors may affect the decision of a submitted manuscript. If the submitted paper cannot meet the interests of a publication venue and its editors and reviewers, it may lead to rejection, delay, or less readership. An appropriate recommender system can help solve this problem. Recommender systems have been proven to serve as an effective method for decision making in many areas such as music, movies, and information media choices [3-6]. The well-known techniques of recommender systems are content-based recommendation [7,8], collaborative filtering recommendation [4,9], and hybrid recommendation [6,10]. Content-based recommender systems recommend an item to a user based on a description of the item. Collaborative filtering methods and hybrid methods may outperform the content-based recommendations by applying user data, if available. However, after the user privacy issue of Facebook in 2018 and the introduction of European Union General Data Protection Regulation, user data are no longer easy to obtain. Moreover, in many domains, especially in material recommendation, there are no user data available for collaborative filtering methods at the beginning [11], which is regarded as a cold-start problem. Content-based recommendations do not need any user information and are more suitable for solving these problems [12]. Based on the content-based recommendation strategy, several attempts have been made to create recommender systems for medical applications and scientific literature. Using geotagged mobile search logs, Agarwal et al [13] adopted a Random Forest model to predict medical visits. Using topic, writing style, author information, citation information, abstract, and title as information items, latent Dirichlet allocation [14] and k-nearest neighbor [15] were used to classify the scientific literature for recommendation [2,12,16,17]. Luong et al [18] used the coauthors’ network as advanced information to recommend a publication venue. Beel et al [19] conducted a literature survey on recommender systems, exploring their methods, evaluation measurements, and datasets. For most of these recommender systems, the high-dimensional and sparse matrix computation is a critical problem [20]. Because of the mismatches caused by ambiguity in text comparisons, the content-based recommendation approach may cause a high error rate [21]. Recently, due to the ability of discovering intricate structures and deep semantics in high-dimensional data, deep learning methods have succeeded in many areas and recently been proposed to build recommender systems for both collaborative filtering and content-based approaches. Hinton et al [22] proposed restricted Boltzmann machines for modeling tabular or count data as a collaborative filtering model on the Netflix data set. McAuley et al [23] proposed an image-based recommendation, which adopted a deep learning model to extract image features. Van den Oord et al [24] applied a deep convolutional neural network (CNN) to predict latent factors from music audios for music recommendation. Wang et al [11] proposed a collaborative deep learning model to jointly perform deep representation learning for the content information and collaborative filtering of a rating matrix. However, to the best of our knowledge, these deep learning techniques have not been used in any biomedical literature recommender system. Most current venue recommendation studies focus on computer science and technology, but not on the biomedical field. Biomedical sciences are highly interdisciplinary and often link to engineering, medicine, biology, physics, psychology, etc, thereby serving more journals and more diverse topics than any other field. Hence, the development of a recommender system is more essential and challenging for the biomedical sciences than any other discipline. Furthermore, previous recommender systems were based on shallow machine learning methods and social networks. They were generally keyword-based methods and did not take semantics into account. In addition, the few existing systems only focus on journals under a certain organization, such as Elsevier, IEEE, and Springer, instead of PubMed.

Aim

In contrast to our previous study on computer science publication recommendations using conventional machine learning approaches [12], we proposed a deep learning–based recommender system for biomedical publication venues, named Pubmender. Due to the copious vocabulary of biomedical literature, the traditional vector space model can lead to high-dimensional and sparse problems. To address this issue, dimensionality reduction methods are needed before learning the pattern. Moreover, initializing text matrix by pretrained word embedding is more beneficial for training neural networks than random initialized embedding [25]. Accordingly, we applied a word2vec model for our study instead of using the conventional vector space model employed in our previous publication venue recommender system. In addition, deep learning models are able to learn multiple-level abstract representations of data with syntactic and semantic information, since more abstract concepts can be constructed with multiple processing layers [26]. We applied the deep learning approach to provide recommendations of journals for biomedical researchers. Unlike shallow learning, the state-of-the-art embedding method and deep CNN in Pubmender were trained from 837,882 papers in 1130 biomedical journals. This method can help researchers find a variety of choices, without being limited to their own knowledge of journals.

Methods

Pubmender System

Figure 1 shows the architecture and workflow of our Pubmender system. It consists of user interface, data preprocessing, abstract representation, classification, and ranking phase.

Figure 1

Architecture of our Pubmender system. CNN: convolutional neural network; ISSN: International Standard Serial Number.

The user interface obtains the input data (an abstract submitted by a user) and presents the recommendation results to the user. The data acquirement is followed by data preprocessing and information extraction. At the start of our deep learning model, the abstract representation phase converts an abstract to a vector. The original abstract vector is a concatenation of pretrained word vectors. Subsequently, deep CNN is applied to train the model to achieve high-level abstract representation. A three-layer fully connected network with a softmax operation is applied to classify papers based on the obtained abstract vectors. The recommendation list of the top N journals obtained from the ranking phase is presented to the user.

Data Preprocessing Methods

The data were downloaded from the File Transfer Protocol service of PubMed Central (PMC) [27], containing 1,534,649 papers. Based on the journal list of PMC, we selected normal journals deposited under full participation or the US National Institutes of Health portfolio mode, excluding records labeled “Predecessor,” “No New Content,” and “Now Select.” Papers from Jan 2007 to Apr 2017 were selected. Papers with no abstracts or with fewer than 200 characters in abstracts were deleted. Journals containing fewer than 100 papers were also removed. Finally, 880,165 papers in the XML format from 1130 journals were used in our study. Each PMC file is a semistructured XML document and contains various tags, such as , <abstract>, and <issn>. We extracted the content in <abstract>, <ISSN>, and <pub-date> fields from the raw XML files. Then, pissn and eissn in the ISSN field were replaced by “LocatorPlus ID,” which is the unique identification for a journal in the US National Library of Medicine catalog. After extraction, each abstract was stored in a corresponding file. Natural Language Toolkit was adopted to operate <span class="Disease">word</span> segmentation [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="28. . . <i>Natural Language Toolkit</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">28</a>].</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Abstract Representation </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>In Pubmender, the recommendation task is formulated into a multilabel classification problem, where the text representation and classification methods are critical. For abstracts, we originally embedded abstracts with pretrained <span class="Disease">word</span> vectors. Thereafter, the original embeddings were fed into CNN to achieve more abstract representation as explained below.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Architecture of our Pubmender system. CNN: convolutional neural network; ISSN: International Standard Serial Number.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Let v∈R be the k-dimensional <span class="Disease">word</span> vector corresponding to the i-th <span class="Disease">word</span> in the abstract A. An original representation of A is represented as a matrix V={v1,…,v}T, which is the concatenation of the words’ vectors. Due to the different sizes of abstracts, we set m as the maximum count of words in an abstract. A padding operation with zeros was adopted for input with fewer than m words in an abstract and a tail truncation operation for more than m words. The vectors of words adopt pretrained vectors using <span class="Disease">word</span> embedding and are induced from the PubMed abstracts and PubMed Central full text. The word2vec tool [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="29. . . <i>EVEX</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">29</a>] was adopted for <span class="Disease">word</span> embedding using the skip-gram model with a window size of h, hierarchical softmax training, and a frequent <span class="Disease">word</span> subsampling threshold to create k-dimensional vectors. <span class="Disease">Word</span> vectors are initialized by zeros if they are not in the pretrained vocabulary. Finally, the representation of an abstract is matrix V with a dimensionality of m*k. It was used as the input to feed to the next step. To achieve more abstract and semantic features, we adopted CNN to extract semantic information.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Figure 2 shows the structure of our deep CNN model. There are three convolutional and max-pooling layers in CNN, one fully connected layer, one hidden layer, and one softmax layer for classification. For an abstract, A(w1,w2,...,w) with w represents the i-th <span class="Disease">word</span> and v∈R is the k-dimensional <span class="Disease">word</span> vector corresponding to <span class="Disease">word</span> w. The abstract is represented as v1:=v1⨁v2⨁...⨁v (1), where ⨁ is the concatenation operator, m is the maximum length of abstracts (a scalar), and v refers to the vector of concatenation of the words w,w+1,…,w+. The first convolutional layer performs as a one-dimensional convolution operation on sliding windows of h1 words to produce a phrase feature. For example, a feature c is generated from a window of words v by c=g(f▪v+b1) (2). Here, b1∈R is a bias term and g is a nonlinear function such as rectified linear unit (ReLu). f∈R is the j-th convolutional kernel, whose shape is k × h1, where k is the dimension of <span class="Disease">word</span> vectors and h1 is the window size. This kernel is applied to each possible window of words in the abstract {v1:,v2:,...,v} to produce a feature map C=[c, c...,c] (3) with C∈R.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="fig"><img src="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555124/bin/jmir_v21i5e12957_fig2.jpg" style="width:99%;" /><b>Figure 2</b><p><span>The structure of our deep convolutional neural network model.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p></div><pxy><span>If there are r1 convolutional kernels, then is the result of the first convolution operation on V. The pooling operation is then carried out on C. Its function is to progressively reduce the spatial size of the representation to extract the key features and reduce the number of dimensions in the network. The pooling layer operates independently on every depth slice of the input and resizes it spatially, using the max-pooling operation [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="30. Hu B, Lu Z, Li H, Chen Q. Convolutional Neural Network Architectures for Matching Natural Language Sentences. <i>Proceeding of 27th Conference on Neural Information Processing Systems</i>. 2014 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">30</a>] in every two-unit window for each C. P, described below, is the result of the max-pooling operation: (4), where j is the j-th filter of the convolutional operation (5).</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>The second and third convolutional and pooling layers work the same as Equations (2) and (5). Following the three convolutional and pooling operations is the fully connected layer. Here, the input is represented with a more abstract feature , where r3 is the number of third-layer convolutional filters. The three convolutional and pooling operations indicate a phrase-level feature, a sentence-level feature, and an abstract-level feature. The algorithm of abstract embedding is listed in Textbox 1.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>The structure of our deep convolutional neural network model.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Input:</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Embedding the abstract A to matrix V={v1,…,v}T</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>r is the number of convolutional filters of layer t, where t=1, 2, 3</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>h is the convolutional window size of layer t, where t=1, 2, 3</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Output:</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Procedure:</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>for t=1, 2, 3</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>for j=1, 2,…r</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>for i=1, 2,…m–ht+1</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>End for</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>for i=1, 2,… (m–h+1)/2</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>End for</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>End for</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>End for</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Softmax Classification </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>A fully connected softmax layer is the last layer of Pubmender. Given the training sample, A, where T is the number of possible labels, z is the class score for the sample, and the estimated probabilities S∈[0,1) for each label j∈{1,2…T} the softmax formula is:</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>We trained the entire model by minimizing the <span class="Disease">cross-entropy error</span> defined as (7), where Y is the true classification output. This is a one-hot encoding of size T, where all elements except one are 0, and one element is 1. This element marks the correct class for the data classified. We employed the optimizer Adam to learn the model parameters, which is a variant of stochastic gradient descent [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="31. Kingma D, Ba J. . <i>ArXiv.org</i>. 2014 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">31</a>].</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h1>Results</h1> <h2><span>Datasets </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>After data preprocessing, there were 880,165 preprocessed papers from PMC in 1130 open-access journals from Jan 2007 to <span class="Gene">Apr</span> 2017. The “LocatorPlus ID” assigned to each journal by PMC is regarded as the classification label of a paper. We generated four data sets based on these papers. The first data set included all papers from 2007 to 2016, which was used to choose the feature representation method and train the prediction models. Papers in 2017 formed the second data set, which was used as the test set to verify Pubmender’s performance. The last two datasets chose papers from publications in Elsevier and Springer from 2017, which were used to compare our Pubmender with Journal Finder and Suggester. The statistics of the first dataset are described in Table 1.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 1</b><p><span>Details of the first dataset (Jan 2007 to Dec 2016).</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="30" span="1"><col width="550" span="1"><col width="280" span="1"><col width="140" span="1"><thead><tr valign="bottom"><td colspan="2" rowspan="1">Statistic</td><td rowspan="1" colspan="1">Number of journals<sup>a</sup></td><td rowspan="1" colspan="1">Number of papers<sup>b</sup></td></tr></thead><tbody><tr valign="top"><td colspan="2" rowspan="1"><bold>Size</bold></td><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1"> </td></tr><tr valign="top"><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1">100≤x<sup>c</sup>≤400</td><td rowspan="1" colspan="1">740</td><td rowspan="1" colspan="1">157,038</td></tr><tr valign="top"><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1">400<x≤2000</td><td rowspan="1" colspan="1">330</td><td rowspan="1" colspan="1">259,676</td></tr><tr valign="top"><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1">2000<x≤10,000</td><td rowspan="1" colspan="1">55</td><td rowspan="1" colspan="1">195,426</td></tr><tr valign="top"><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1">>10,000</td><td rowspan="1" colspan="1">5</td><td rowspan="1" colspan="1">225,742</td></tr><tr valign="top"><td rowspan="1" colspan="1"> </td><td rowspan="1" colspan="1">Total</td><td rowspan="1" colspan="1">1130</td><td rowspan="1" colspan="1">837,882</td></tr><tr valign="top"><td colspan="2" rowspan="1">Maximum class size</td><td rowspan="1" colspan="1">1</td><td rowspan="1" colspan="1">153,608</td></tr><tr valign="top"><td colspan="2" rowspan="1">Minimum class size</td><td rowspan="1" colspan="1">4</td><td rowspan="1" colspan="1">100</td></tr><tr valign="top"><td colspan="2" rowspan="1">Average class size</td><td rowspan="1" colspan="1">N/A<sup>d</sup></td><td rowspan="1" colspan="1">741</td></tr></tbody></table><p><span>aThis represents the total number of journals in this range.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><p><span>bThis represents the total number of papers published in all journals in this range.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><p><span>cx represents the number of papers published in one journal.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><p><span>dN/A: not applicable.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p></div></div><pxy><span>One of the biggest challenges of these datasets is that the data distribution is highly imbalanced. In the first dataset, 60 journals published more than 2000 papers, while 740 journals published fewer than 400 papers. The number of papers in “PLOS One” was 153,608, which is larger than the number in other journals, based on its extensive and comprehensive scope. “Scientific Reports” ranked second, with 37,864 papers published, while “Horticulture Research” only published 100 papers. The average paper count was 741, and 934 journals had fewer than that number.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Details of the first dataset (Jan 2007 to Dec 2016).</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>aThis represents the total number of journals in this range.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>bThis represents the total number of papers published in all journals in this range.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>cx represents the number of papers published in one journal.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>dN/A: not applicable.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Parameters and Measurements </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>For CNN, three convolutional and three pooling operations were adopted. The pretrained <span class="Disease">word</span> vectors generated by word2vec, available from Evex [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="29. . . <i>EVEX</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">29</a>], were used. The window size h was 5 and threshold was 0.001. The dimension of a pretrained vector was 200, that is, k=200. The length of abstract had a fixed size m, which is the maximum number of the words that most abstracts contain. Table 2 shows the <span class="Disease">word</span> statistic details of the papers. With the statistics, only 43,328 of 837,882 papers (5%) contained abstracts with more than 350 words. Therefore, we chose m=350. A zero-padding operation was applied for abstracts with fewer than 350 words, together with a tail-truncation operation for abstracts containing more than 350 words.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 2</b><p><span>Word statistics of abstracts.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="720" span="1"><col width="280" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Size</td><td rowspan="1" colspan="1">Number of abstracts</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">20≤x<sup>a</sup><50</td><td rowspan="1" colspan="1">25,499</td></tr><tr valign="top"><td rowspan="1" colspan="1">50≤x<100</td><td rowspan="1" colspan="1">76,614</td></tr><tr valign="top"><td rowspan="1" colspan="1">100≤x<150</td><td rowspan="1" colspan="1">139,420</td></tr><tr valign="top"><td rowspan="1" colspan="1">150≤x<200</td><td rowspan="1" colspan="1">227,993</td></tr><tr valign="top"><td rowspan="1" colspan="1">200≤x<250</td><td rowspan="1" colspan="1">191,156</td></tr><tr valign="top"><td rowspan="1" colspan="1">250≤x<300</td><td rowspan="1" colspan="1">87,597</td></tr><tr valign="top"><td rowspan="1" colspan="1">300≤x<350</td><td rowspan="1" colspan="1">46,275</td></tr><tr valign="top"><td rowspan="1" colspan="1">x>350</td><td rowspan="1" colspan="1">43,328</td></tr></tbody></table><p><span>ax denotes the number of words in the abstract.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p></div></div><pxy><span>The convolutional operation parameters are listed in Table 3. The activation function adopted the ReLU function. In the pooling layer, the size of max-pooling filters was two, applied with a stride of two down samples. The parameters of the following layers (pooling layers) had the same parameter settings. Normalization and dropout strategies were used in the fully connected layer. Rate of dropout was 0.2 and L2 normalization was adopted.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 3</b><p><span>Hyperparameters of convolutional operation.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="580" span="1"><col width="280" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Convolutional layer</td><td rowspan="1" colspan="1">Convolution kernel count</td><td rowspan="1" colspan="1">Window size</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">First</td><td rowspan="1" colspan="1">256</td><td rowspan="1" colspan="1">3</td></tr><tr valign="top"><td rowspan="1" colspan="1">Second</td><td rowspan="1" colspan="1">128</td><td rowspan="1" colspan="1">4</td></tr><tr valign="top"><td rowspan="1" colspan="1">Third</td><td rowspan="1" colspan="1">96</td><td rowspan="1" colspan="1">5</td></tr></tbody></table></div></div><h2><span>Evaluation of Recommendation Results </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <h3><span>Toy Experiment </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h3> <pxy><span>We designed a toy experiment to validate the deep learning method. In the first dataset, 421,168 papers were chosen from 60 journals with more than 2000 papers. The training set and test set contained 37,951 (90%) and 4,217 (10%) papers, respectively. We selected bi-directional long short-term memory (Bi-LSTM) and fastText [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="32. . . <i>GitHub</i>. 2018 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">32</a>] as comparison models for Pubmender. Bi-LSTM represents the recurrent neural network model with the max-pooling operation from a previous study [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="33. Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data. <i>Proceedings of 2017 Conference on Empirical Methods in Natural Language Processing</i>. 2017 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">33</a>]. Pretrained <span class="Disease">word</span> vectors, generated by word2vec from a previous study [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="29. . . <i>EVEX</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">29</a>], were used as the input original <span class="Disease">word</span> vectors for fastText and Pubmender.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>To evaluate the performance of our system, top-N accuracy was adopted as a measurement, which is defined as the probability that the expected label is in the top N predicted classes. For top-N, if the journal containing the abstract is among the top N ranked journals, the classification is correct. The symbol acc@N represents the accuracy of top-N, N=1, N=3, and N=5. The comparison of accuracy is listed in Table 4, which shows that both deep learning approaches outperformed fastText in all the three measurements. The accuracy of Bi-LSTM is nearly the same as that of Pubmender. However, the running time of Pubmender was 2660 abstracts per second, which is 78% faster than Bi-LSTM (1495 abstracts/second), and Bi-LSTM needs more memory.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 4</b><p><span>Accuracy of Bi-LSTM, fastText, and Pubmender. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="580" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Methods</td><td rowspan="1" colspan="1">acc@1</td><td rowspan="1" colspan="1">acc@3</td><td rowspan="1" colspan="1">acc@5</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">fastText</td><td rowspan="1" colspan="1">0.66</td><td rowspan="1" colspan="1">0.86</td><td rowspan="1" colspan="1">0.92</td></tr><tr valign="top"><td rowspan="1" colspan="1">Bi-LSTM<sup>a</sup> (max-pooling)</td><td rowspan="1" colspan="1">0.71</td><td rowspan="1" colspan="1">0.90</td><td rowspan="1" colspan="1">0.95</td></tr><tr valign="top"><td rowspan="1" colspan="1">Pubmender</td><td rowspan="1" colspan="1"><italic>0.72</italic></td><td rowspan="1" colspan="1"><italic>0.92</italic></td><td rowspan="1" colspan="1"><italic>0.96</italic></td></tr></tbody></table><p><span>aBi-LSTM: bi-directional long short-term memory.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p></div></div><pxy><span><span class="Disease">Word</span> statistics of abstracts.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>ax denotes the number of words in the abstract.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Hyperparameters of convolutional operation.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Accuracy of Bi-LSTM, fastText, and Pubmender. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>aBi-LSTM: bi-directional long short-term memory.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h3><span>First Dataset Result </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h3> <pxy><span>For the first dataset, there were 837,882 papers in 1130 journals from 2007 to 2016. The training, validation, and test sets contained 670,306 (80%), 83,788 (10%), and 83,788 (10%) randomly selected papers, respectively. The results of and comparisons with previous work are provided in Table 5.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 5</b><p><span>Accuracy of the classification by Pubmender and other systems. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="160" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Methods</td><td rowspan="1" colspan="1">Paper count</td><td rowspan="1" colspan="1">Journal count</td><td rowspan="1" colspan="1">acc@1</td><td rowspan="1" colspan="1">acc@3</td><td rowspan="1" colspan="1">acc@5</td><td rowspan="1" colspan="1">acc@10</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Pubmender</td><td rowspan="1" colspan="1">837,882</td><td rowspan="1" colspan="1">1130</td><td rowspan="1" colspan="1"><italic>0.50</italic></td><td rowspan="1" colspan="1"><italic>0.71</italic></td><td rowspan="1" colspan="1"><italic>0.78</italic></td><td rowspan="1" colspan="1"><italic>0.86</italic></td></tr><tr valign="top"><td rowspan="1" colspan="1">MAS<sup>a</sup> [<xref rid="ref2" ref-type="bibr">2</xref>]</td><td rowspan="1" colspan="1">58,466</td><td rowspan="1" colspan="1">300</td><td rowspan="1" colspan="1">—<sup>b</sup></td><td rowspan="1" colspan="1">—</td><td rowspan="1" colspan="1">0.24</td><td rowspan="1" colspan="1">0.46</td></tr><tr valign="top"><td rowspan="1" colspan="1">ACM<sup>c</sup> [<xref rid="ref16" ref-type="bibr">16</xref>]</td><td rowspan="1" colspan="1">172,890</td><td rowspan="1" colspan="1">2197</td><td rowspan="1" colspan="1">—</td><td rowspan="1" colspan="1">—</td><td rowspan="1" colspan="1">0.56</td><td rowspan="1" colspan="1">0.70</td></tr><tr valign="top"><td rowspan="1" colspan="1">CiteSeer [<xref rid="ref16" ref-type="bibr">16</xref>]</td><td rowspan="1" colspan="1">35,020</td><td rowspan="1" colspan="1">739</td><td rowspan="1" colspan="1">—</td><td rowspan="1" colspan="1">—</td><td rowspan="1" colspan="1">0.24</td><td rowspan="1" colspan="1">0.29</td></tr></tbody></table><p><span>aMAS: Microsoft Academic Search.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><p><span>bExperimental evaluation is not available.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><p><span>cACM: Association for Computing Machinery.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p></div></div><pxy><span>The other two systems were from three widely used digital libraries: Association for Computing Machinery (ACM) [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="16. Yang ZH, Davison BD. Venue Recommendation: Submitting Your Paper with Style. <i>Proceedings of the 11th International Conference on Machine Learning and Applications</i>. 2012 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">16</a>], CiteSeer [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="16. Yang ZH, Davison BD. Venue Recommendation: Submitting Your Paper with Style. <i>Proceedings of the 11th International Conference on Machine Learning and Applications</i>. 2012 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">16</a>], and Microsoft Academic Search (<span class="Disease">MAS</span>) [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="2. Medvet E, Bartoli A, Piccinin G. Publication Venue Recommendation Based on Paper Abstract. <i>Proceedings of the IEEE 26th International Conference on Tools with Artificial Intelligence</i>. 2014 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">2</a>]. The results from Table 5 show that Pubmender achieved the best performance. The proposed system can achieve 0.50 on <span class="Gene">acc@1</span> and 0.86 on acc@10. Our system improved performance by 225% over CiteSeer and <span class="Disease">MAS</span> in terms of <span class="Gene">acc@5</span>, and by 87% over <span class="Disease">MAS</span> and 196% over CiteSeer in terms of acc@10. The system described by Yang and Davidson [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="16. Yang ZH, Davison BD. Venue Recommendation: Submitting Your Paper with Style. <i>Proceedings of the 11th International Conference on Machine Learning and Applications</i>. 2012 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">16</a>] used topic and writing-style information, and the system described by Medvet et al [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="2. Medvet E, Bartoli A, Piccinin G. Publication Venue Recommendation Based on Paper Abstract. <i>Proceedings of the IEEE 26th International Conference on Tools with Artificial Intelligence</i>. 2014 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">2</a>] used the abstracts and titles. However, our Pubmender obtained the best accuracy by using abstracts only.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>To present the ability of handling imbalanced data of Pubmender, we divided the test set into four classes (tiny, small, medium, and large) according to the paper counts of different journals. From Table 6, for the tiny set, Pubmender achieved 0.27 accuracy on the <span class="Gene">acc@1</span> and 0.54 on <span class="Gene">acc@5</span>, which are greater than the accuracy on <span class="Gene">acc@5</span> and acc@10 (in Table 5) from <span class="Disease">MAS</span> and CiteSeer, respectively. The accuracy of the top-10 (acc@10) of a large set reached 0.98. In the paper by Medvet et al [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="2. Medvet E, Bartoli A, Piccinin G. Publication Venue Recommendation Based on Paper Abstract. <i>Proceedings of the IEEE 26th International Conference on Tools with Artificial Intelligence</i>. 2014 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">2</a>], 58,466 papers were partitioned almost uniformly into 300 conferences from the <span class="Disease">MAS</span>. In CiteSeer [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="16. Yang ZH, Davison BD. Venue Recommendation: Submitting Your Paper with Style. <i>Proceedings of the 11th International Conference on Machine Learning and Applications</i>. 2012 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">16</a>], 35,020 selected papers were published across 739 venues, each of which had at least 20 papers. The average number of papers for each venue was 47. Therefore, the CiteSeer dataset is almost balanced. In contrast, the imbalance of our data, as shown in Table 1, was very critical. The sizes of classes in our dataset ranged from 100 to 153,608 papers; for example, the number of papers in “PLOS One” was 153,608, which is 270 times the average number of papers in all journals. Compared with balanced data, the classification of critically imbalanced data was a complex problem to tackle. For this problem, our model achieved satisfactory results.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 6</b><p><span>Pubmender accuracy at top N(@N) of imbalance class data. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="300" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Paper count range</td><td rowspan="1" colspan="1">acc@1</td><td rowspan="1" colspan="1">acc@3</td><td rowspan="1" colspan="1">acc@5</td><td rowspan="1" colspan="1">acc@10</td><td rowspan="1" colspan="1">Paper count</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Tiny</td><td rowspan="1" colspan="1">0.27</td><td rowspan="1" colspan="1">0.44</td><td rowspan="1" colspan="1">0.54</td><td rowspan="1" colspan="1">0.66</td><td rowspan="1" colspan="1">16,337</td></tr><tr valign="top"><td rowspan="1" colspan="1">Small</td><td rowspan="1" colspan="1">0.43</td><td rowspan="1" colspan="1">0.63</td><td rowspan="1" colspan="1">0.72</td><td rowspan="1" colspan="1">0.82</td><td rowspan="1" colspan="1">26,259</td></tr><tr valign="top"><td rowspan="1" colspan="1">medium</td><td rowspan="1" colspan="1">0.62</td><td rowspan="1" colspan="1">0.81</td><td rowspan="1" colspan="1">0.88</td><td rowspan="1" colspan="1">0.94</td><td rowspan="1" colspan="1">19,588</td></tr><tr valign="top"><td rowspan="1" colspan="1">Large</td><td rowspan="1" colspan="1">0.66</td><td rowspan="1" colspan="1">0.91</td><td rowspan="1" colspan="1">0.96</td><td rowspan="1" colspan="1">0.98</td><td rowspan="1" colspan="1">22,579</td></tr><tr valign="top"><td rowspan="1" colspan="1">All</td><td rowspan="1" colspan="1">0.50</td><td rowspan="1" colspan="1">0.71</td><td rowspan="1" colspan="1">0.78</td><td rowspan="1" colspan="1">0.86</td><td rowspan="1" colspan="1">84,763</td></tr></tbody></table></div></div><pxy><span>Accuracy of the classification by Pubmender and other systems. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>aMAS: Microsoft Academic Search.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>bExperimental evaluation is not available.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>cACM: Association for Computing Machinery.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Pubmender accuracy at top N(@N) of imbalance class data. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Moreover, excluding accuracy, we choose precision, recall, and the F1-score as measurements. For an individual class C, the assessment is defined by tp (true positives), fp (false positives), fn (false negatives), and tn (true negatives). Accuracy, precision, and recall are calculated from the counts for C. Quality of the overall classification is evaluated in two ways: macro-averaging and micro-averaging. The macro-average is the average of the same measures calculated for all classes. With the sum of counts to obtain cumulative tp, fp, tn, and fn, micro-average metrics are calculated [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="34. Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. <i>Information Processing & Management</i>. 2009 <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">34</a>]. The following equations show how the desired results are individually achieved:</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>We listed macro-average and micro-average metrics in Table 7. Macro-averaging treats all classes equally, while micro-averaging favors bigger classes. From Table 7, it can be seen that when the number of recommended journals increases, the probability of capturing the real journal also increases. Therefore, the recall is increased step by step from top-1 to top-10. With the growth in the number of recommended journals, the number of falsely selected journals is also growing, which results in a decrease in precision. The F1-score favors a balanced view.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 7</b><p><span>Macro-average and Micro-average metrics for recommendation results.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="160" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Metrics</td><td colspan="3" rowspan="1">Macro-average</td><td colspan="3" rowspan="1">Micro-average</td></tr><tr valign="top"><td rowspan="1" colspan="1"><break></break></td><td rowspan="1" colspan="1">Precision</td><td rowspan="1" colspan="1">Recall</td><td rowspan="1" colspan="1">F1</td><td rowspan="1" colspan="1">Precision</td><td rowspan="1" colspan="1">Recall</td><td rowspan="1" colspan="1">F1</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Top-1</td><td rowspan="1" colspan="1">0.38</td><td rowspan="1" colspan="1">0.32</td><td rowspan="1" colspan="1">0.33</td><td rowspan="1" colspan="1">0.50</td><td rowspan="1" colspan="1">0.50</td><td rowspan="1" colspan="1">0.50</td></tr><tr valign="top"><td rowspan="1" colspan="1">Top-3</td><td rowspan="1" colspan="1">0.37</td><td rowspan="1" colspan="1">0.50</td><td rowspan="1" colspan="1">0.41</td><td rowspan="1" colspan="1">0.45</td><td rowspan="1" colspan="1">0.71</td><td rowspan="1" colspan="1">0.55</td></tr><tr valign="top"><td rowspan="1" colspan="1">Top-5</td><td rowspan="1" colspan="1">0.35</td><td rowspan="1" colspan="1">0.59</td><td rowspan="1" colspan="1">0.42</td><td rowspan="1" colspan="1">0.42</td><td rowspan="1" colspan="1">0.78</td><td rowspan="1" colspan="1">0.55</td></tr><tr valign="top"><td rowspan="1" colspan="1">Top-10</td><td rowspan="1" colspan="1">0.32</td><td rowspan="1" colspan="1">0.70</td><td rowspan="1" colspan="1">0.42</td><td rowspan="1" colspan="1">0.38</td><td rowspan="1" colspan="1">0.86</td><td rowspan="1" colspan="1">0.53</td></tr></tbody></table></div></div><h3><span>New Data Verification </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h3> <pxy><span>To show the performance on new data, 42,283 papers from January 2017 to April 2017 were extracted to make further predictions. This comprised 1321 journals, some of which did not appear in the first dataset. These unseen journals increased the difficulty of prediction. The accuracy of Pubmender on the top-1, 3, 5, and 10 was 0.39, 0.61, 0.68, and 0.76, respectively. The accuracies on <span class="Gene">acc@5</span> and acc@10 were 183% and 162% higher than those of CiteSeer, respectively. From these results, we conclude that our proposed recommender system achieves a satisfactory result, even for new data that may not belong to the same data distribution.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Comparison with Journal Finder and Journal Suggester </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>Journal Finder is provided by Elsevier for recommending Elsevier journals [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="35. . . <i>Elsevier</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">35</a>]. There are 45 Elsevier journals in our dataset. Five of them with more papers were selected. The paper counts were 582 for Medicine, 193 for Data in Brief, 124 for NeuroImage: Clinical, 117 for Redox Biology, and 87 for Preventive Medicine Reports.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Elsevier’s Journal Finder requires input of the title and abstract of submitted paper, and fields of research. The titles and abstracts were extracted from XML files and then fed into Journal Finder. We chose fields of research in “Engineering,” “GeoSciences,” “Life and Health Science,” and “Chemistry.” The results are listed in Table 8 and show that our system is much better than Journal Finder.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 8</b><p><span>Comparison between Pubmender and Journal Finder. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="440" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Systems</td><td rowspan="1" colspan="1">acc@1</td><td rowspan="1" colspan="1">acc@3</td><td rowspan="1" colspan="1">acc@5</td><td rowspan="1" colspan="1">acc@10</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Pubmender</td><td rowspan="1" colspan="1"><italic>0.62</italic></td><td rowspan="1" colspan="1"><italic>0.75</italic></td><td rowspan="1" colspan="1"><italic>0.84</italic></td><td rowspan="1" colspan="1"><italic>0.90</italic></td></tr><tr valign="top"><td rowspan="1" colspan="1">Journal Finder</td><td rowspan="1" colspan="1">0.05</td><td rowspan="1" colspan="1">0.12</td><td rowspan="1" colspan="1">0.13</td><td rowspan="1" colspan="1">0.21</td></tr><tr valign="top"><td rowspan="1" colspan="1">Improvement (%)</td><td rowspan="1" colspan="1">1140</td><td rowspan="1" colspan="1">525</td><td rowspan="1" colspan="1">546</td><td rowspan="1" colspan="1">329</td></tr></tbody></table></div></div><pxy><span>Macro-average and Micro-average metrics for recommendation results.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Comparison between Pubmender and Journal Finder. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Comparison between Pubmender and Journal Suggester. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>Journal Suggester [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="36. . . <i>Springer Nature</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">36</a>], recommends journals published by Springer. Journal Suggester also requires input of the title and abstract, and field of research. We chose “<span class="Chemical">Biomedicine</span>” as the field of research. There are 14 journals from Springer in our dataset, and seven of them were chosen for comparison based on a significant number of papers in each journal — Cell Death & Disease, <span class="Disease">Malaria Journal</span>, Nanoscale Research Letters, Nature Communications, Parasites & Vectors, Scientific Reports, and Trials. Each journal chose the top 100 papers according to the size of the XML files. The results are listed in Table 9. Again, our system was much better than Journal Suggester.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <div class="xtable"><div class="fig"><b>Table 9</b><p><span>Comparison between Pubmender and Journal Suggester. Italicized values indicate the best results. acc@N represents the accuracy for top-N selection.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></p><table xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" frame="hsides" rules="groups" width="1000" cellpadding="5" cellspacing="0" border="1"><col width="440" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><col width="140" span="1"><thead><tr valign="top"><td rowspan="1" colspan="1">Systems</td><td rowspan="1" colspan="1">acc@1</td><td rowspan="1" colspan="1">acc@3</td><td rowspan="1" colspan="1">acc@5</td><td rowspan="1" colspan="1">acc@10</td></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Pubmender</td><td rowspan="1" colspan="1"><italic>0.57</italic></td><td rowspan="1" colspan="1"><italic>0.81</italic></td><td rowspan="1" colspan="1"><italic>0.87</italic></td><td rowspan="1" colspan="1"><italic>0.91</italic></td></tr><tr valign="top"><td rowspan="1" colspan="1">Journal Suggester</td><td rowspan="1" colspan="1">0.11</td><td rowspan="1" colspan="1">0.15</td><td rowspan="1" colspan="1">0.17</td><td rowspan="1" colspan="1">0.18</td></tr><tr valign="top"><td rowspan="1" colspan="1">Improvement (%)</td><td rowspan="1" colspan="1">418</td><td rowspan="1" colspan="1">440</td><td rowspan="1" colspan="1">412</td><td rowspan="1" colspan="1">406</td></tr></tbody></table></div></div><h1>Discussion</h1> <pxy><span>In this study, Pubmender was proposed to recommend a biomedical publishing venue to user. CNN was used to obtain the abstract representation. Our results show the performance of the system.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Principal Results </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>For biomedical publications, our Pubmender system is the first recommender system with <span class="Disease">word</span> embedding and deep learning models. It achieves 87.0%, 22.9%, and 196.0% higher accuracy than recommender systems on <span class="Disease">MAS</span>, ACM, and CiteSeer, respectively. In addition, the experiment results also revealed that the accuracy of our system was superior to that of Journal Finder and Journal Suggester. Our web service is freely available online [<a title="" data-container="body" data-toggle="popover" data-placement="right" data-html="true" data-trigger="hover click" data-content="37. . . <i>Pubmender: Deep Learning Based Recommender System for Biomedical Publication Venue</i>. <a target='_blank' style='cursor:pointer;' href='si.php?db=pubmed&id='><span class='glyphicon glyphicon-share-alt'></span></a>">37</a>].</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Comparison with Prior Work </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>Because no paper has been published about biomedical venue recommendations, we cannot perform any exact comparison with previous work. However, some publishers provide tools to help authors choose suitable journals. We chose two tools provided by Elsevier and Springer for comparison. The first one is Journal Finder provided by Elsevier for recommending journals of Elsevier. Pubmender achieved a much higher accuracy than Journal Finder on four metrics. For example, on <span class="Gene">acc@1</span>, the accuracy of our system reached 0.62 and Journal Finder was given an accuracy rating of 0.05, with 1140% improvement; on acc@10, the accuracy of our system reached 0.84, which is 546% higher than that of Journal Finder. Pubmender also significantly outperformed another tool, Journal Suggester.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <h2><span>Conclusions </span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button> </h2> <pxy><span>In this study, we proposed a biomedical publishing venue recommender system—Pubmender. In this system, an abstract is first represented by a vector using the composition of pretrained <span class="Disease">word</span> vectors. Subsequently, a deep CNN architecture is designed to represent and classify the submitted abstract. The original vectors are converted into more abstract feature vectors containing semantic information using deep CNNs, which overcome the sparse high-dimensional problem.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> <pxy><span>The experimental results showed that our proposed system achieves more successful performance than that of <span class="Disease">MAS</span>, ACM, CiteSeer, Journal Finder, and Journal Suggester. Even for journals containing a small number of abstracts, the performance of Pubmender was satisfactory, because Pubmender’s high-level representation method catches more semantic and structural information from the abstract.</span> <button onclick="translate_abc(this)" style="border:none;outline:none;color:#5577AA;font-size:10px;margin-bottom:0px;" title="Translate into Chinese"> <span class="glyphicon glyphicon-transfer"></span> </button></pxy> </div> <div class="tab-pane fade" id="refx"> <div style="padding-top:15px;"> <span style="font-size:13px;"> <span class="glyphicon glyphicon-stats"> </span> 3 in total</span></div> <div style="padding-top:4px;"> </div> <span class="literature_info"></span> <h2><span class="s2"></span> <span class="review">Review</span> <a href="si.php?db=pubmed&id=23787338" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">1. Representation learning: a review and new perspectives.</span></a></h2> <span class="author">Authors: Yoshua Bengio; Aaron Courville; Pascal Vincent </span><br> <span class="journal">Journal: IEEE Trans Pattern Anal Mach Intell </span> <span class="year">Date: 2013-08 </span> <span class="year">Impact factor: 6.226 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /><h2><span class="s2"></span> <a href="si.php?db=pubmed&id=31127715" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">2. The Deep Learning-Based Recommender System "Pubmender" for Choosing a Biomedical Publication Venue: Development and Validation Study.</span></a></h2> <span class="author">Authors: Renchu Guan; Dong Xu; Xiaoyue Feng; Hao Zhang; Yijie Ren; Penghui Shang; Yi Zhu; Yanchun Liang </span><br> <span class="journal">Journal: J Med Internet Res </span> <span class="year">Date: 2019-05-24 </span> <span class="year">Impact factor: 5.428 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /><h2><span class="s2"></span> <a href="si.php?db=pubmed&id=27655225" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">3. Impact of Predicting Health Care Utilization Via Web Search Behavior: A Data-Driven Analysis.</span></a></h2> <span class="author">Authors: Vibhu Agarwal; Liangliang Zhang; Josh Zhu; Shiyuan Fang; Tim Cheng; Chloe Hong; Nigam H Shah </span><br> <span class="journal">Journal: J Med Internet Res </span> <span class="year">Date: 2016-09-21 </span> <span class="year">Impact factor: 5.428 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /> <div style="padding:0px;"> </div> <div style="padding-top:5px;"> <span style="font-size:13px;"> <span class="glyphicon glyphicon-stats"> </span> 3 in total</span> </div> </div> <div class="tab-pane fade" id="citex"> <div style="padding-top:15px;"> <span style="font-size:13px;"> <span class="glyphicon glyphicon-stats"> </span> 4 in total</span></div> <div style="padding-top:4px;"> </div> <h2><span class="s2"></span> <a href="si.php?db=pubmed&id=35495546" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">1. Recommender-as-a-Service with Chatbot Guided Domain-science Knowledge Discovery in a Science Gateway.</span></a></h2> <span class="author">Authors: Komal Vekaria; Prasad Calyam; Sai Swathi Sivarathri; Songjie Wang; Yuanxun Zhang; Ashish Pandey; Cong Chen; Dong Xu; Trupti Joshi; Satish Nair </span><br> <span class="journal">Journal: Concurr Comput </span> <span class="year">Date: 2020-11-11 </span> <span class="year">Impact factor: 1.831 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /><h2><span class="s2"></span> <a href="si.php?db=pubmed&id=31127715" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">2. The Deep Learning-Based Recommender System "Pubmender" for Choosing a Biomedical Publication Venue: Development and Validation Study.</span></a></h2> <span class="author">Authors: Renchu Guan; Dong Xu; Xiaoyue Feng; Hao Zhang; Yijie Ren; Penghui Shang; Yi Zhu; Yanchun Liang </span><br> <span class="journal">Journal: J Med Internet Res </span> <span class="year">Date: 2019-05-24 </span> <span class="year">Impact factor: 5.428 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /><h2><span class="s2"></span> <a href="si.php?db=pubmed&id=31592230" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">3. Trends in Alzheimer's Disease Research Based upon Machine Learning Analysis of PubMed Abstracts.</span></a></h2> <span class="author">Authors: Renchu Guan; Xiaojing Wen; Yanchun Liang; Dong Xu; Baorun He; Xiaoyue Feng </span><br> <span class="journal">Journal: Int J Biol Sci </span> <span class="year">Date: 2019-08-06 </span> <span class="year">Impact factor: 6.580 </span><br><hr style="padding:0px;margin:10px;margin-left:0px;" /><h2><span class="s2"></span> <span class="review">Review</span> <a href="si.php?db=pubmed&id=36004369" target="_blank" style="cursor:pointer;"><span style="font-weight:500;font-size:15px;color:#337AB7;">4. Applications of natural language processing in ophthalmology: present and future.</span></a></h2> <span class="author">Authors: Jimmy S Chen; Sally L Baxter </span><br> <span class="journal">Journal: Front Med (Lausanne) </span> <span class="year">Date: 2022-08-08 </span><hr style="padding:0px;margin:10px;margin-left:0px;" /> <div style="padding-top:15px;"> <span style="font-size:13px;"> <span class="glyphicon glyphicon-stats"> </span> 4 in total</span></div> </div> </div> </div> <script type="text/javascript"> $('.more_ref_info a').click(function() { $(".more_ref_info").html('<div class="alert alert-info" style="padding:8px;padding-left:0px;width:90%"> <div class="three-bounce"> Loading <div class="bounce1"></div> <div class="bounce2"></div> <div class="bounce3"></div> </div> </div>'); $.post('codes/reference/ref.php',{pn:'2',idx:'31127715'},function(data) { $(".more_ref_info").html(data); }) }); $('.more_cite_info a').click(function() { $(".more_cite_info").html('<div class="alert alert-info" style="padding:8px;padding-left:0px;width:90%"> <div class="three-bounce"> Loading <div class="bounce1"></div> <div class="bounce2"></div> <div class="bounce3"></div> </div> </div>'); $.post('codes/reference/cite.php',{pn:'2',idx:'31127715'},function(data) { $(".more_cite_info").html(data); }) }); </script> <script type="text/javascript"> $(document).ready(function(){ $(".con").html('<br><br><div style="width:280px;"><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div></div>'); $.post('codes/translate/IF.php',{db:'pubmed',id:'1438-8871',lang:'en'},function(data) { $(".con").html(data); }); }); $('.search_IF a').click(function(e) { $(".con2").html(''); $(".con").html('<br><div style="width:380px;"><center><font color="#87CEEB"><b>Please waiting ...</b></font></center><br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div></div>'); $.post('codes/translate/IF.php',{db:'pubmed',id:'1438-8871',lang:'en'},function(data) { $(".con").html(data); }); }); </script> <script type="text/javascript"> $('.dx_button').click(function() { loading.showLoading({ type:1, tip:"Loading" }) $.post('codes/translate/download_dx.php',{pmid:'31127715'},function(data) { eval('var data='+data); if(data.ti==1){ loading.hideLoading(); window.open('tmpe/31127715.pdf') } }) }) </script> <script type="text/javascript"> $(document).ready(function(){ var t=new Date().getTime(); var id=getCookie('w_id'); if(window.XMLHttpRequest){ var xhr=new XMLHttpRequest(); }else{ var xhr=new ActiveXObject('Microsoft.XMLHTTP'); } xhr.open('GET','src/php/index.php?p='+t); xhr.send(); xhr.onreadystatechange=function(){ if(xhr.readyState==4){ if(xhr.status==200){ if(!(xhr.responseText=='' && id!='' && id!=0)){ var n_val=getCookie('name_val'); //alert(n_val); $.post('codes/translate/download.php',{doi:'10.2196/12957',user:n_val},function(data) { $(".d_button").html(data); }); } } } } }); </script> <script> $(document).ready(function(){ $("#Chemical_id").change(function() { if($("#Chemical_id").is(":checked")) { $(".Chemical").addClass("Chemical_desc"); $(".Chemical_desc4").addClass("Chemical_desc3"); $(".Chemical").bind('click',function(e){ e.preventDefault(); var namea = $(this).text(); $(document).ready(function(){ $("#myModal_annotation").modal("show") }); $(".annotation_alert").html('<br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div>'); $.get('codes/geo/annotation.php',{pmid:'31127715',namea:namea,typea:'Chemical',query:'',db:'pubmed'},function(data) { $(".annotation_alert").html(data); }) }) }else{ $(".Chemical_desc4").removeClass("Chemical_desc3"); $(".Chemical").removeClass("Chemical_desc"); $(".Chemical").unbind(); } }) $("#Disease_id").change(function() { if($("#Disease_id").is(":checked")) { $(".Disease").addClass("Disease_desc"); $(".Disease_desc4").addClass("Disease_desc3"); $(".Disease").bind('click',function(e){ e.preventDefault(); var namea = $(this).text(); $(document).ready(function(){ $("#myModal_annotation").modal("show") }); $(".annotation_alert").html('<br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div>'); $.get('codes/geo/annotation.php',{pmid:'31127715',namea:namea,typea:'Disease',query:'',db:'pubmed'},function(data) { $(".annotation_alert").html(data); }) }) }else{ $(".Disease_desc4").removeClass("Disease_desc3"); $(".Disease").removeClass("Disease_desc"); $(".Disease").unbind(); } }) $("#Gene_id").change(function() { if($("#Gene_id").is(":checked")) { $(".Gene").addClass("Gene_desc"); $(".Gene_desc4").addClass("Gene_desc3"); $(".Gene").bind('click',function(e){ e.preventDefault(); var namea = $(this).text(); $(document).ready(function(){ $("#myModal_annotation").modal("show") }); $(".annotation_alert").html('<br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div>'); $.get('codes/geo/annotation.php',{pmid:'31127715',namea:namea,typea:'Gene',query:'',db:'pubmed'},function(data) { $(".annotation_alert").html(data); }) }) }else{ $(".Gene_desc4").removeClass("Gene_desc3"); $(".Gene").removeClass("Gene_desc"); $(".Gene").unbind(); } }) $(".population").addClass("population_desc"); $("#population_id").change(function() { if($("#population_id").is(":checked")) { $(".population_desc4").addClass("population_desc3"); $(".population").addClass("population_desc"); }else{ $(".population_desc4").removeClass("population_desc3"); $(".population").removeClass("population_desc"); } }) $(".interventions").addClass("interventions_desc"); $("#interventions_id").change(function() { if($("#interventions_id").is(":checked")) { $(".interventions_desc4").addClass("interventions_desc3"); $(".interventions").addClass("interventions_desc"); }else{ $(".interventions_desc4").removeClass("interventions_desc3"); $(".interventions").removeClass("interventions_desc"); } }) $(".outcomes").addClass("outcomes_desc"); $("#outcomes_id").change(function() { if($("#outcomes_id").is(":checked")) { $(".outcomes_desc4").addClass("outcomes_desc3"); $(".outcomes").addClass("outcomes_desc"); }else{ $(".outcomes_desc4").removeClass("outcomes_desc3"); $(".outcomes").removeClass("outcomes_desc"); } }) }) </script> <div class="col-sm-4" style=""> <div id="myNav"> <span class="con"> </span> </div> <span class="con2"></span> <span class="con3"></span> </div> </div> </div> <script type="text/javascript"> function translate_xyz(btnObj){ var x = btnObj.previousElementSibling.innerHTML; $(".con2").html(''); $(".con").html('<br><div style="width:380px;"><center><font color="#87CEEB"><b>正在翻译中 ...</b></font></center><br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div></div>'); $.post('codes/translate/translate_content.php',{content:x,to_lang:'en'},function(data) { $(".con").html(data); }); } function translate_abc(btnObj){ var x = btnObj.previousElementSibling.innerHTML; $(".con2").html(''); $(".con").html('<br><div style="width:380px;"><center><font color="#87CEEB"><b>正在翻译中 ...</b></font></center><br><div class="spinner"><div class="double-bounce1"></div><div class="double-bounce2"></div></div></div>'); $.post('codes/translate/translate_content.php',{content:x,to_lang:'zh'},function(data) { $(".con").html(data); }); } </script> <script type="text/javascript"> $(document).ready(function(){ loading.hideLoading(); }); $('.tab_b a').click(function() { initial_url_paras = window.location.href.split("?"); initial_url = initial_url_paras[0]; paras = initial_url_paras[1]; paras_array = paras.split("&"); for(let ii=0;ii<paras_array.length;ii++){ current_para_array = paras_array[ii].split("="); if(current_para_array[0]=="db"){dbx=current_para_array[1]} if(current_para_array[0]=="id"){idx=current_para_array[1]} } $(".ax2").html(' <div style="background-color:#d9edf7;padding:1px;padding-left:6px;margin-left:4px;font-size:12px;"><table><tr> <td>跳转中 ... </td> <td> <div class="three-bounce" style="min-height:22px;"> <div class="bounce1"></div> <div class="bounce2"></div> <div class="bounce3"></div> </div></td></tr></table> </div>'); window.location.href = 'si.php?db=' + dbx + '&id=' + idx; }) </script> <div class="modal fade" id="myModal_annotation" tabindex="-1" role="dialog" aria-labelledby="myModalLabel" aria-hidden="true"> <div class="modal-dialog" style="width:300px;"> <div class="modal-content"> <div class="modal-body"> <button type="button" class="close" data-dismiss="modal" aria-hidden="true">× </button> <span class="annotation_alert"></span> </div> </div> </div> </div> <br> <script type="text/javascript" src="src/js/child_nav.js"></script> <div id="autoHeightDiv"></div> <div class="footLineGray" style="border:none;"></div> <div class="lineWhite" style="border:none;"></div> <div class="webFoot"> <div class="foot middle" style="text-align:center;padding-right:10px;;padding-top:19px;background:white;border:none;"> 北京卡尤迪生物科技股份有限公司 © 2022-2023. </div> </div> <script> $(function () { $("[data-toggle='tooltip']").tooltip({html : true }); }); $(function() { $('#rct_show_id').click(function() { $('.rct_class').show() $('.entity_class').hide() $('#rct_show_id').hide() $('#rct_hide_id').show() }) $('#rct_hide_id').click(function() { $('.rct_class').hide() $('.entity_class').show() $('#rct_show_id').show() $('#rct_hide_id').hide() }) }) </script> <script> $(function () { $("[data-toggle='popover']").popover({html:true,trigger:'hover click'}); }); </script> <script type="text/javascript" src="src/js/child_nav.js"></script> <script type="text/javascript" src="src/js/clickx.js"></script> <script src="end.js"></script> </body> </html>