Leading machine learning tools, how will deep learning change the field of medical imaging?

Deep learning is showing a growing trend in data analysis and is known as one of the 10 breakthrough technologies in 2013 [1]. It is an improvement to the neural network, including more computational layers, enabling higher levels of abstraction and prediction in the data [2]. So far, it is becoming a leading machine learning tool in the field of general imaging and computer vision.

In particular, Convolutional Neural Networks (CNN) have proven to be an advantageous tool for many computer vision tasks. Deep CNN can automatically learn intermediate and advanced abstract concepts derived from raw data (eg, images). Recent results show that the generic descriptor extracted from CNN is very effective in object recognition and localization of natural images. Medical image analysis groups around the world are rapidly entering the field and apply CNN and other deep learning methods to a wide range of applications. Many good results are emerging.

In the field of medical imaging, the exact diagnosis or assessment of a disease depends on image acquisition and image interpretation. In recent years, with the development of technology, devices can collect data at a faster rate and with a more powerful resolution, which greatly improves the quality of image acquisition. However, the improvement of image interpretation by computer technology has only just begun. At present, most medical image interpretations are performed by doctors. However, human interpretation of images is often one-sided because of its subjectivity, large changes in the different interpreters, and fatigue. Many diagnostic tasks require an initial search process to detect abnormalities and quantify changes in measurements and time. Computerized tools, especially image analysis and machine learning, play a key role in improving diagnosis. They support expert workflows by helping identify areas that need treatment. Among these tools, deep learning has quickly confirmed its superiority as a basis and can improve accuracy. It also opens up new areas for data analysis and continues to develop at an unprecedented rate.

A. Historical Network

The basic idea behind neural networks and deep learning has existed for decades [3]. They are usually only a few layers. The appearance of the back-propagation algorithm has significantly improved the performance of neural networks. However, performance is still not enough. Other classifiers have been gradually developed, including decision trees, boosTIng, and support vector machines. Each of them has been applied to medical image analysis, especially for detecting abnormalities, and they have also been applied in other related fields such as segmentation (segmentaTIon). Despite this development, relatively high false positive rates are still common.

As early as 1996 in the work of Sahiner et al., CNN (convolutional neural network) was applied to medical image processing [4]. In this work, ROIs (Region of Interests) containing biopsy-proven masses or normal tissues were extracted from mammograms. The CNN contains one input layer, two hidden layers, and one output layer and the back propagation used. In this pre-GPU era, training time was described as "computation-intensive" but no specific time was given. In 1993, CNN was used for lung nodule detection [5]. In 1995, CNN was used to detect microcalcification on mammograms [6].

A typical CNN for image processing has a structure that consists of a series of convolution filter layers interspersed with a series of data compression or pooling layers. A convoluTIon filter processes a small block of the input image. Similar to the low-level pixel processing of the human brain, the convolution filter can detect highly correlated image features, such as lines or circles that can represent sharp edges (for example, for organ detection) or circles (such as objects for circles). Like colon polyps, then high-level features such as local or global shapes and textures. The output of the CNN is usually a label of one or more probabilities or classes corresponding to the images. The convolution filter can learn directly from the training data. This is exactly what people need because it reduces the need for very time-consuming manual marking features. Without the use of convolution filters, filters designed for specific applications and some features that need to be calculated are inseparable from these artificial features during the preprocessing of the image.

CNN is a highly parallelized algorithm. Compared to individual CPU processing, a large part of the practicality of using CNN comes from the huge speed increase (approximately 40 times) contributed by the image processing unit (GPU). Early papers describing the value of GPUs for training CNN and other machine learning technologies were published in 2006 [8]. In medical image processing, GPUs were first introduced for segmentation, reconstruction, and registration, followed by machine learning [9], [10]. Interestingly, although Eklund et al. [10] widely discussed convolutions in their 2013 paper, convolutional neural networks and deep learning were not mentioned at all. This highlights how rapidly the major reforms in deep learning have rapidly adjusted medical image processing research.

B. Today's Network

Due to the development of new variants of CNN and the emergence of parallel solvers for modern GPU optimization, deep neural networks have recently gained considerable commercial interest. The power of CNN is due to its deep architecture, which allows it to extract a set of distinguishing features at different levels of abstraction. Training a deep convolutional neural network from scratch is a huge challenge. First, CNN requires a large amount of tagged data, which is difficult to achieve in the medical field. This is because it is very expensive to ask experts to mark, and samples of diseases (such as lesions) are scarce. Second, training depth CNN requires a lot of computational and memory resources. Without them, the training process can be very time consuming. Third, training a deep CNN is often complicated by overfitting and convergence problems, and it is often necessary to repeatedly adjust the learning parameters or architecture of the network to ensure that all layers learn at a considerable speed. In view of the above difficulties, some new learning schemes called "transfer learning" and "fine-tuning" have been proposed to provide solutions and are being accepted by more and more people. These will be discussed further in Section II-C.

C. Network in the medical field

The domain deep learning method is most effective when applied to large training sets, but in the medical field, large data sets are not always available. Therefore, we face a series of major challenges, including: (a) Can deep neural networks be effectively used in medical tasks? (b) Is transfer learning from general imagery to the medical field relevant? (c) Can we rely solely on the characteristics of learning, or can we combine them with artificially produced functions to accomplish tasks? This special issue of IEEE imaging (IEEE-TMI) for deep learning of medical imaging focuses on the advancement of this new era of machine learning and its role in the field of medical image processing. This question describes the recent achievements of CNN and other deep learning applications in medical tasks. It contains 18 articles selected from 50 papers of various investigators from all over the world. This is a very high number for IEEE's special problems, and this is the ratio of the time from the publication solicitation to the submission deadline. It was achieved within a short time in the past. The paper focuses on a large number of traditional tasks from detection to categorization (eg, lesion detection, image segmentation, shape modeling, image registration), as well as some open and novel application areas. It also includes some work focused on network exploration and gives an idea of â€‹â€‹how different tasks, parameters, and training sets should be selected for the architecture.