Objective To develop a deep learning system for CT images to assist in the diagnosis of thoracolumbar fractures and analyze the feasibility of its clinical application. Methods Collected from West China Hospital of Sichuan University from January 2019 to March 2020, a total of 1256 CT images of thoracolumbar fractures were annotated with a unified standard through the Imaging LabelImg system. All CT images were classified according to the AO Spine thoracolumbar spine injury classification. The deep learning system in diagnosing ABC fracture types was optimized using 1039 CT images for training and validation, of which 1004 were used as the training set and 35 as the validation set; the rest 217 CT images were used as the test set to compare the deep learning system with the clinician’s diagnosis. The deep learning system in subtyping A was optimized using 581 CT images for training and validation, of which 556 were used as the training set and 25 as the validation set; the rest 104 CT images were used as the test set to compare the deep learning system with the clinician’s diagnosis. Results The accuracy and Kappa coefficient of the deep learning system in diagnosing ABC fracture types were 89.4% and 0.849 (P<0.001), respectively. The accuracy and Kappa coefficient of subtyping A were 87.5% and 0.817 (P<0.001), respectively. Conclusions The classification accuracy of the deep learning system for thoracolumbar fractures is high. This approach can be used to assist in the intelligent diagnosis of CT images of thoracolumbar fractures and improve the current manual and complex diagnostic process.
The diagnosis of hypertrophic cardiomyopathy (HCM) is of great significance for the early risk classification of sudden cardiac death and the screening of family genetic diseases. This research proposed a HCM automatic detection method based on convolution neural network (CNN) model, using single-lead electrocardiogram (ECG) signal as the research object. Firstly, the R-wave peak locations of single-lead ECG signal were determined, followed by the ECG signal segmentation and resample in units of heart beats, then a CNN model was built to automatically extract the deep features in the ECG signal and perform automatic classification and HCM detection. The experimental data is derived from 108 ECG records extracted from three public databases provided by PhysioNet, the database established in this research consists of 14,459 heartbeats, and each heartbeat contains 128 sampling points. The results revealed that the optimized CNN model could effectively detect HCM, the accuracy, sensitivity and specificity were 95.98%, 98.03% and 95.79% respectively. In this research, the deep learning method was introduced for the analysis of single-lead ECG of HCM patients, which could not only overcome the technical limitations of conventional detection methods based on multi-lead ECG, but also has important application value for assisting doctor in fast and convenient large-scale HCM preliminary screening.
Colorectal cancer (CRC) is a common malignant tumor that seriously threatens human health. CRC presents a formidable challenge in terms of accurate identification due to its indistinct boundaries. With the widespread adoption of convolutional neural networks (CNNs) in image processing, leveraging CNNs for automatic classification and segmentation holds immense potential for enhancing the efficiency of colorectal cancer recognition and reducing treatment costs. This paper explores the imperative necessity for applying CNNs in clinical diagnosis of CRC. It provides an elaborate overview on research advancements pertaining to CNNs and their improved models in CRC classification and segmentation. Furthermore, this work summarizes the ideas and common methods for optimizing network performance and discusses the challenges faced by CNNs as well as future development trends in their application towards CRC classification and segmentation, thereby promoting their utilization within clinical diagnosis.
Cardiovascular disease (CVD) is one of the leading causes of death worldwide. Heart sound classification plays a key role in the early detection of CVD. The difference between normal and abnormal heart sounds is not obvious. In this paper, in order to improve the accuracy of the heart sound classification model, we propose a heart sound feature extraction method based on bispectral analysis and combine it with convolutional neural network (CNN) to classify heart sounds. The model can effectively suppress Gaussian noise by using bispectral analysis and can effectively extract the features of heart sound signals without relying on the accurate segmentation of heart sound signals. At the same time, the model combines with the strong classification performance of convolutional neural network and finally achieves the accurate classification of heart sound. According to the experimental results, the proposed algorithm achieves 0.910, 0.884 and 0.940 in terms of accuracy, sensitivity and specificity under the same data and experimental conditions, respectively. Compared with other heart sound classification algorithms, the proposed algorithm shows a significant improvement and strong robustness and generalization ability, so it is expected to be applied to the auxiliary detection of congenital heart disease.
Pneumoconiosis ranks first among the newly-emerged occupational diseases reported annually in China, and imaging diagnosis is still one of the main clinical diagnostic methods. However, manual reading of films requires high level of doctors, and it is difficult to discriminate the staged diagnosis of pneumoconiosis imaging, and due to the influence of uneven distribution of medical resources and other factors, it is easy to lead to misdiagnosis and omission of diagnosis in primary healthcare institutions. Computer-aided diagnosis system can realize rapid screening of pneumoconiosis in order to assist clinicians in identification and diagnosis, and improve diagnostic efficacy. As an important branch of deep learning, convolutional neural network (CNN) is good at dealing with various visual tasks such as image segmentation, image classification, target detection and so on because of its characteristics of local association and weight sharing, and has been widely used in the field of computer-aided diagnosis of pneumoconiosis in recent years. This paper was categorized into three parts according to the main applications of CNNs (VGG, U-Net, ResNet, DenseNet, CheXNet, Inception-V3, and ShuffleNet) in the imaging diagnosis of pneumoconiosis, including CNNs in pneumoconiosis screening diagnosis, CNNs in staging diagnosis of pneumoconiosis, and CNNs in segmentation of pneumoconiosis foci to conduct a literature review. It aims to summarize the methods, advantages and disadvantages, and optimization ideas of CNN applied to the images of pneumoconiosis, and to provide a reference for the research direction of further development of computer-aided diagnosis of pneumoconiosis.
The processing mechanism of the human brain for speech information is a significant source of inspiration for the study of speech enhancement technology. Attention and lateral inhibition are key mechanisms in auditory information processing that can selectively enhance specific information. Building on this, the study introduces a dual-branch U-Net that integrates lateral inhibition and feedback-driven attention mechanisms. Noisy speech signals input into the first branch of the U-Net led to the selective feedback of time-frequency units with high confidence. The generated activation layer gradients, in conjunction with the lateral inhibition mechanism, were utilized to calculate attention maps. These maps were then concatenated to the second branch of the U-Net, directing the network’s focus and achieving selective enhancement of auditory speech signals. The evaluation of the speech enhancement effect was conducted by utilising five metrics, including perceptual evaluation of speech quality. This method was compared horizontally with five other methods: Wiener, SEGAN, PHASEN, Demucs and GRN. The experimental results demonstrated that the proposed method improved speech signal enhancement capabilities in various noise scenarios by 18% to 21% compared to the baseline network across multiple performance metrics. This improvement was particularly notable in low signal-to-noise ratio conditions, where the proposed method exhibited a significant performance advantage over other methods. The speech enhancement technique based on lateral inhibition and feedback-driven attention mechanisms holds significant potential in auditory speech enhancement, making it suitable for clinical practices related to artificial cochleae and hearing aids.
Objective To establish a machine learning-based risk prediction model of combined chronic obstructive pulmonary disease (COPD) with lung cancer, so as to explore the high risk factors for COPD patients with lung cancer and to lay the foundation for early detection of lung cancer risk in COPD patients. Methods A total of 154 patients from the Second Hospital of Dalian Medical University from 2010 to 2021 were retrospectively analyzed, including 99 patients in the COPD group and 55 patients in the COPD with lung cancer group. the chest high resolution computed tomography (HRCT) scans and pulmonary function test of each patient were acquired. The main analyses were as follow: (1) to valid the statistically differences of the basic information (such as age, body mass index, smoking index), laboratory test results, pulmonary function parameters and quantitative parameters of chest HRCT between the two groups; (2) to analyze the indicators of high risk factors for lung cancer in COPD patients using univariate and binary logistic regression (LR) methods; and (3) to establish the machine learning model (such as LR and Gaussian process) for COPD with lung cancer patients. Results Based on the statistical analysis and LR methods, decreased BMI, increased whole lung emphysema index, increased whole lung mean density, and increased percentage activity of exertional spirometry and prothrombin time were risk factors for COPD with lung cancer patients. Based on the machine learning prediction model for COPD with lung cancer patients, the area under the receiver operating characteristic curve for LR and Gaussian process were obtained as 0.88 using the soluble fragments of prothrombin time percentage activity, whole lung emphysema index, whole lung mean density, and forced vital capacity combined with neuron-specific enolase and cytokeratin 19 as features. Conclusion The prediction model of COPD with lung cancer patients using a machine learning approach can be used for early detection of lung cancer risk in COPD patients.
The effective classification of multi-task motor imagery electroencephalogram (EEG) is helpful to achieve accurate multi-dimensional human-computer interaction, and the high frequency domain specificity between subjects can improve the classification accuracy and robustness. Therefore, this paper proposed a multi-task EEG signal classification method based on adaptive time-frequency common spatial pattern (CSP) combined with convolutional neural network (CNN). The characteristics of subjects' personalized rhythm were extracted by adaptive spectrum awareness, and the spatial characteristics were calculated by using the one-versus-rest CSP, and then the composite time-domain characteristics were characterized to construct the spatial-temporal frequency multi-level fusion features. Finally, the CNN was used to perform high-precision and high-robust four-task classification. The algorithm in this paper was verified by the self-test dataset containing 10 subjects (33 ± 3 years old, inexperienced) and the dataset of the 4th 2018 Brain-Computer Interface Competition (BCI competition Ⅳ-2a). The average accuracy of the proposed algorithm for the four-task classification reached 93.96% and 84.04%, respectively. Compared with other advanced algorithms, the average classification accuracy of the proposed algorithm was significantly improved, and the accuracy range error between subjects was significantly reduced in the public dataset. The results show that the proposed algorithm has good performance in multi-task classification, and can effectively improve the classification accuracy and robustness.
Deep learning method can be used to automatically analyze electrocardiogram (ECG) data and rapidly implement arrhythmia classification, which provides significant clinical value for the early screening of arrhythmias. How to select arrhythmia features effectively under limited abnormal sample supervision is an urgent issue to address. This paper proposed an arrhythmia classification algorithm based on an adaptive multi-feature fusion network. The algorithm extracted RR interval features from ECG signals, employed one-dimensional convolutional neural network (1D-CNN) to extract time-domain deep features, employed Mel frequency cepstral coefficients (MFCC) and two-dimensional convolutional neural network (2D-CNN) to extract frequency-domain deep features. The features were fused using adaptive weighting strategy for arrhythmia classification. The paper used the arrhythmia database jointly developed by the Massachusetts Institute of Technology and Beth Israel Hospital (MIT-BIH) and evaluated the algorithm under the inter-patient paradigm. Experimental results demonstrated that the proposed algorithm achieved an average precision of 75.2%, an average recall of 70.1% and an average F1-score of 71.3%, demonstrating high classification accuracy and being able to provide algorithmic support for arrhythmia classification in wearable devices.
Fetal electrocardiogram (ECG) signals provide important clinical information for early diagnosis and intervention of fetal abnormalities. In this paper, we propose a new method for fetal ECG signal extraction and analysis. Firstly, an improved fast independent component analysis method and singular value decomposition algorithm are combined to extract high-quality fetal ECG signals and solve the waveform missing problem. Secondly, a novel convolutional neural network model is applied to identify the QRS complex waves of fetal ECG signals and effectively solve the waveform overlap problem. Finally, high quality extraction of fetal ECG signals and intelligent recognition of fetal QRS complex waves are achieved. The method proposed in this paper was validated with the data from the PhysioNet computing in cardiology challenge 2013 database of the Complex Physiological Signals Research Resource Network. The results show that the average sensitivity and positive prediction values of the extraction algorithm are 98.21% and 99.52%, respectively, and the average sensitivity and positive prediction values of the QRS complex waves recognition algorithm are 94.14% and 95.80%, respectively, which are better than those of other research results. In conclusion, the algorithm and model proposed in this paper have some practical significance and may provide a theoretical basis for clinical medical decision making in the future.