Literature DB >> 33620460

Active Learning to Classify Macromolecular Structures in situ for Less Supervision in Cryo-Electron Tomography.

Xuefeng Du1, Haohan Wang2, Zhenxi Zhu3, Xiangrui Zeng4, Yi-Wei Chang5, Jing Zhang6, Eric Xing7, Min Xu4.   

Abstract

MOTIVATION: Cryo-Electron Tomography (cryo-ET) is a 3D bioimaging tool that visualizes the structural and spatial organization of macromolecules at a near-native state in single cells, which has broad applications in life science. However, the systematic structural recognition and recovery of macromolecules captured by cryo-ET are difficult due to high structural complexity and imaging limits. Deep learning based subtomogram classification have played critical roles for such tasks. As supervised approaches, however, their performance relies on sufficient and laborious annotation on a large training dataset.
RESULTS: To alleviate this major labeling burden, we proposed a Hybrid Active Learning (HAL) framework for querying subtomograms for labelling from a large unlabeled subtomogram pool. Firstly, HAL adopts uncertainty sampling to select the subtomograms that have the most uncertain predictions. This strategy enforces the model to be aware of the inductive bias during classification and subtomogram selection, which satisfies the discriminativeness principle in AL literature. Moreover, to mitigate the sampling bias caused by such strategy, a discriminator is introduced to judge if a certain subtomogram is labeled or unlabeled and subsequently the model queries the subtomogram that have higher probabilities to be unlabeled. Such query strategy encourages to match the data distribution between the labeled and unlabeled subtomogram samples, which essentially encodes the representativeness criterion into the subtomogram selection process. Additionally, HAL introduces a subset sampling strategy to improve the diversity of the query set, so that the information overlap is decreased between the queried batches and the algorithmic efficiency is improved. Our experiments on subtomogram classification tasks using both simulated and real data demonstrate that we can achieve comparable testing performance (on average only 3% accuracy drop) by using less than 30% of the labeled subtomograms, which shows a very promising result for subtomogram classification task with limited labeling resources. AVAILABILITY: https://github.com/xulabs/aitom.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Year:  2021        PMID: 33620460     DOI: 10.1093/bioinformatics/btab123

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Deep-Precognitive Diagnosis: Preventing Future Pandemics by Novel Disease Detection With Biologically-Inspired Conv-Fuzzy Network.

Authors:  Aviral Chharia; Rahul Upadhyay; Vinay Kumar; Chao Cheng; Jing Zhang; Tianyang Wang; Min Xu
Journal:  IEEE Access       Date:  2022-02-21       Impact factor: 3.476

2.  Self-supervised learning for macromolecular structure classification based on cryo-electron tomograms.

Authors:  Tarun Gupta; Xuehai He; Mostofa Rafid Uddin; Xiangrui Zeng; Andrew Zhou; Jing Zhang; Zachary Freyberg; Min Xu
Journal:  Front Physiol       Date:  2022-08-30       Impact factor: 4.755

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.