| Literature DB >> 34204065 |
Fatima Hameed Khan1, Muhammad Adeel Pasha1, Shahid Masud1.
Abstract
Artificial intelligence (AI) has successfully made its way into contemporary industrial sectors such as automobiles, defense, industrial automation 4.0, healthcare technologies, agriculture, and many other domains because of its ability to act autonomously without continuous human interventions. However, this capability requires processing huge amounts of learning data to extract useful information in real time. The buzz around AI is not new, as this term has been widely known for the past half century. In the 1960s, scientists began to think about machines acting more like humans, which resulted in the development of the first natural language processing computers. It laid the foundation of AI, but there were only a handful of applications until the 1990s due to limitations in processing speed, memory, and computational power available. Since the 1990s, advancements in computer architecture and memory organization have enabled microprocessors to deliver much higher performance. Simultaneously, improvements in the understanding and mathematical representation of AI gave birth to its subset, referred to as machine learning (ML). ML includes different algorithms for independent learning, and the most promising ones are based on brain-inspired techniques classified as artificial neural networks (ANNs). ANNs have subsequently evolved to have deeper and larger structures and are often characterized as deep neural networks (DNN) and convolution neural networks (CNN). In tandem with the emergence of multicore processors, ML techniques started to be embedded in a range of scenarios and applications. Recently, application-specific instruction-set architecture for AI applications has also been supported in different microprocessors. Thus, continuous improvement in microprocessor capabilities has reached a stage where it is now possible to implement complex real-time intelligent applications like computer vision, object identification, speech recognition, data security, spectrum sensing, etc. This paper presents an overview on the evolution of AI and how the increasing capabilities of microprocessors have fueled the adoption of AI in a plethora of application domains. The paper also discusses the upcoming trends in microprocessor architectures and how they will further propel the assimilation of AI in our daily lives.Entities:
Keywords: application-specific integrated circuits; artificial intelligence; automation; instruction set architecture; intelligent systems; machine learning; microprocessors; multicores; real-time processing
Year: 2021 PMID: 34204065 PMCID: PMC8227299 DOI: 10.3390/mi12060665
Source DB: PubMed Journal: Micromachines (Basel) ISSN: 2072-666X Impact factor: 2.891
Figure 1Taxonomy of AI and its sub-fields.
Figure 2Generic operation of an Expert System.
Figure 3Artificial intelligence over the years.
Figure 4Five stages of pipelining in microprocessors.
Basic features of popular superscalar microprocessors (1990s–2000s).
| High-Performance (Superscalar) Microprocessors | |||||
|---|---|---|---|---|---|
| Microprocessor | Year | Clock Speed (MHz) | Transistor Size (microns) | Cache Size (KB) | Pipe Stages |
| Intel 486 (Intel, Santa Clara, CA, USA) | 1989 | 25 to 50 | 0.8–1 | 8 | 5 |
| Intel Pentium Pro (Intel, Santa Clara, CA, USA) | 1995 | 200 | 0.35–0.6 | 8/8 | 12–14 |
| DEC Alpha 21164 (DEC, Maynard, MA, USA) | 1996 | 500 | 0.5 | 8/8/96 | 7 |
| Power PC 604e | 1997 | 233 | 0.25 | 32/32 | 6 |
| AMD K5 (AMD, Santa Clara, CA, USA) | 1996 | 75–133 | 0.35–0.5 | 8/16 | 5 |
| MIPS R10000 (MIPS Technologies, Sunnyvale, CA, USA) | 1996 | 200 | 0.35 | 32/32 | 5 |
| Intel Pentium IV (Intel, Santa Clara, CA, USA ) | 2000 | 1400–2000 | 0.18 | 256 | 20 |
Figure 5Trend of heat dissipation with the increase in power density of Intel chips.
Basic features of popular multicore microprocessors (2005 onwards).
| Multicore Microprocessors | |||||
|---|---|---|---|---|---|
| Microprocessor | Year | Clock Speed (GHz) | Transistor Size (nm) | Caches (MB) | Cores |
| AMD Athlon 64 X2 | 2005 | 2 | 90–65 | 0.5 | 2 |
| Intel Core 2 Duo | 2006 | 2.66 | 65 | 4 | 2 |
| Intel Core 2 Quad Q6600 | 2007 | 2.4 | 65 | 8 | 4 |
| Intel Core i7-3770 | 2012 | 3.4 | 22 | 8 | 4 |
| AMD Ryzen 7 1700x | 2017 | 3-3.6 | 14 | 4/16 | 8 |
| Intel Core i9 10900 | 2020 | 5.20 | 14 | 20 | 10 |
| Intel Xeon Platinum 9282 | 2019 | 3.8 | 14 | 77 | 56 |
| AMD Ryzen Threadripper 3990X (AMD, Santa Clara, CA, USA) | 2020 | 4.3 | 7 | 32/256 | 64 |
Figure 6Performance evolution of general-purpose microprocessors.
Figure 7Comparison between CPU operations per second for single core and dual core.
Figure 8Single-instruction multiple-data (SIMD) operation.
Figure 9Mapping of fully connected (FC) layers onto matrix multiplication.
Figure 10Mapping of convolution (Conv) layers onto matrix multiplication.
Comparison of the number of CPU and GPU cores.
| CPU | GPU | ||||
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 4 | 8 |
| - | 4352 |
|
| 4 | 16 | |||
|
| 8 | 28 |
| 640 | 5120 |
|
| 4 | 28 | |||
|
| 4 | 56 |
| 432 | 6912 |
|
| 24 | 64 | |||
Figure 11Hardware block diagram showing the generic structure of a spatial architecture.