| Literature DB >> 35135830 |
Abstract
Entities:
Keywords: arthritis; health care; outcome and process assessment; patient care team; rheumatoid
Mesh:
Year: 2022 PMID: 35135830 PMCID: PMC9279752 DOI: 10.1136/annrheumdis-2022-222141
Source DB: PubMed Journal: Ann Rheum Dis ISSN: 0003-4967 Impact factor: 27.973
Figure 1State-action pairs in the reinforcement learning concept using the example of chess (A) with transfer to rheumatology (B). An agent recognises the current situation (state) and independently takes an action. A reward function evaluates the respective decisions with regard to a certain goal, for example, remission. By this loop control, the system constantly improves its decisions. This could be a closed loop in the case of a drug pump and reliable biosensors and digital biomarkers, respectively. PRO, patient-reported outcomes.
Explanation of terms and concepts
| Artificial intelligence (AI) | General term when computer systems take over tasks that are typically assigned to human attributes such as learning, recognising, planning and so on. Can also be robots or cars that move independently in their environment. |
| Algorithm | Set of steps for a computer program to accomplish a task or to solve a problem. |
| Machine learning (ML) | Subform of AI. Computer systems that learn and adapt independently from |
| Supervised learning (SL) | Subform of ML. Models are trained and validated in existing, labelled data sets. These are typically used for classification tasks, for example, to predict future disease states or to detect pathologies on images. |
| Unsupervised learning (UL) | Subform of ML. Models are created from unlabelled data, for example, for clustering or outlier detection in electronic medical records. |
| Reinforcement learning (RL) | Subform of ML. Models that can make prospective decisions on their own and constantly improve them depending on the results. |
| Q-learning | Subform of RL. A model-free, flexible RL algorithm to learn the value of a certain action. Random actions outside a specific system can be learnt, for example, by imitating and improving expert actions. |
| Artificial neural networks | A set of algorithms, modelled loosely after the human brain, in the form of different layers similar to neurons. A powerful tool which can be used for supervised, unsupervised or RL. |
Figure 2Multimodal decision making by reinforcement learning algorithms at different time points. Adapted from Ref. 15.