Prediction of significant congenital heart disease in infants and children using continuous wavelet transform and deep convolutional neural network with 12-lead electrocardiogram

Lee, Yu-Shin; Chung, Hung-Tao; Lin, Jainn-Jim; Hwang, Mao-Sheng; Liu, Hao-Chuan; Hsu, Hsin-Mao; Chang, Ya-Ting; Peng, Syu-Jyun

doi:10.1186/s12887-025-05628-2

Research
Open access
Published: 24 April 2025

Prediction of significant congenital heart disease in infants and children using continuous wavelet transform and deep convolutional neural network with 12-lead electrocardiogram

Yu-Shin Lee^1,2,
Hung-Tao Chung¹,
Jainn-Jim Lin³,
Mao-Sheng Hwang¹,
Hao-Chuan Liu¹,
Hsin-Mao Hsu¹,
Ya-Ting Chang¹ &
…
Syu-Jyun Peng^2,4

BMC Pediatrics volume 25, Article number: 324 (2025) Cite this article

525 Accesses
Metrics details

Abstract

Background

Congenital heart disease (CHD) affects approximately 1% of newborns and is a leading cause of mortality in early childhood. Despite the importance of early detection, current screening methods, such as pulse oximetry and auscultation, have notable limitations, particularly in identifying non-cyanotic CHD. (AI)-assisted electrocardiography (ECG) analysis offers a cost-effective alternative to conventional CHD detection. However, most existing models have been trained on older children, limiting their generalizability to infants and young children. This study developed an AI model trained on real-world ECG data for the detection of hemodynamically significant CHD in children under five years of age.

Methods

ECG data was retrospectively collected from 1,035 patients under five years old at Chang Gung Memorial Hospital, Taoyuan, Taiwan (2013–2020). Based on ECG findings, patients were categorized into the following groups: normal heart structure (NOR), non-significant right heart disease (RHA), significant right heart disease (RHB), non-significant left heart disease (LHA), and significant left heart disease (LHB). ECG signals underwent preprocessing using continuous wavelet transformation and segmentation into 2-s intervals for data augmentation. Transfer learning was applied using three pre-trained deep learning models: ResNet- 18, InceptionResNet-V2, and NasNetMobile. Model performance was evaluated in terms of accuracy, sensitivity, specificity, F1 score, and area under the receiver operating characteristic curve (AUC).

Results

Among the tested models, the model based on ResNet-18 demonstrated the best overall performance in predicting clinically significant CHD, achieving accuracy of 73.9%, an F1 score of 75.8%, and an AUC of 81.0% in differentiating significant from non-significant CHD. InceptionResNet-V2 performed well in detecting left heart disease but was computationally intensive. The proposed AI model significantly outperformed conventional ECG interpretation by pediatric cardiologists (accuracy 67.1%, sensitivity 71.6%).

Conclusions

This study highlights the potential of AI-assisted ECG analysis for CHD screening in young children. The ResNet-18-based model outperformed conventional ECG evaluation, suggesting its feasibility as a supplementary tool for early CHD detection. Future studies should focus on multi-center validation, inclusion of more CHD subtypes, and integration with other screening modalities to improve diagnostic accuracy and clinical applicability.

Peer Review reports

Background

Roughly 1% of all newborns present with congenital heart disease (CHD) [1, 2], and most CHD-related mortalities occur before the age of five, making early detection crucial. Echocardiography is the gold standard for diagnosing CHD; however, it is an expensive procedure requiring highly trained personnel. Several methods have been devised for CHD detection. Pulse oximetry performed at 24 h after birth is a cost-effective alternative for the screening of critical cyanotic congenital heart conditions that require intervention [3]. One meta-analysis reported that this method achieves sensitivity of 76.3% with specificity of 99.9% and a false positive rate of 0.14% [4]. However, this method fails to detect common noncyanotic CHDs, such as ventricular septal defect (VSD) and patent ductus arteriosus (PDA) [3]. Left heart disease can have a profound impact on hemodynamics, leading to early heart failure. Companion screening methods are required to enhance sensitivity to noncyanotic CHDs.

Auscultation for the detection of heart murmurs is the method most commonly used by pediatricians in screening for major CHDs. Detecting heart murmurs of > grade 2 has been shown to yield sensitivity of 89.6% with specificity of 97.3% and a false positive rate of 2.7% [5]. While heart murmur analysis is effective in dealing with most CHDs associated with left heart disease, it is far less effective in detecting CHDs associated with right heart disease, such as atrial septal defects, which often do not produce heart murmurs [6]. Moreover, access to trained pediatricians is often limited to major medical centers, and the associated costs are very high. There is a demand for screening tools applicable to a broader range of CHDs.

Researchers have made significant strides in applying artificial intelligence (AI) to the interpretation of phonocardiograms (PCG) for the detection of common CHDs. Some models have achieved sensitivity of 99.0% with specificity of 98.0% and a false positive rate of 2.0% [7]. However, these preliminary studies were based on well-prepared datasets, raising concerns about the practical applicability of this method under real-world conditions.

Electrocardiogram (ECG) analysis is another method commonly used for the screening of congenital heart disease. This affordable method provides objective measurements of electrical activity, avoiding the subjective interpretation of indistinct indicators by clinicians. ECG also enables the detection of conditions that do not produce an audible heart sound, such as atrial septal defect (ASD), where ECG indicators occur earlier than heart murmurs.

Table 1 lists representative studies on the screening of congenital heart disease [4, 7,8,9,10,11,12,13]. One AI model demonstrated good performance in detecting hemodynamic atrial septal defect (Qp/Qs > 1.5) with sensitivity of 76%, specificity of 96% and a false positive rate rate of 2.0% in school-aged children [8]. Another AI model trained using a large ECG database for the detection of CHD demonstrated sensitivity of 74.7% and specificity of 94.1% [14].

Table 1 Representative literature on the screening of congenital heart disease

Full size table

It is important to note that most of the training data assembled for these models was from school-aged children and adolescents [12]. Moreover, many of these studies did not include demographic data or address the issue of hemodynamic significance.

There is a pressing need for affordable and accessible screening methods for the detection of non-critical, but clinically significant CHDs in early childhood. This study trained an AI model using real-world ECG data to detect the presence of clinically significant CHDs in children under the age of five.

Participants and methods

Data sources

Patient data were collected retrospectively from Chang Gung Memorial Hospital, Taiwan. The study included patients under five years old who were diagnosed with specific CHDs, including atrial septal defect (ASD), ventricular septal defect (VSD), patent ductus arteriosus (PDA), pulmonary stenosis (PS), aortic stenosis (AS), coarctation of the aorta (CoA), and Tetralogy of Fallot (TOF) between January 2013 and December 2020. Data were also collected from patients under five years old who had normal heart structure (confirmed by ECG) and visited the outpatient department between December 2020 and March 2021.

Patient grouping process

Initial patient enrollment (Fig. 1) was followed by classification based on the presence of congenital heart disease (CHD). Patients without CHD were categorized into the NOR (normal) group. Those diagnosed with CHD underwent further stratification based on electrocardiographic (ECG) signals. Patients with CHDs causing left ventricular hypertrophy (LVH) were classified under Left Heart Disease (LHD), while those with patients with CHDs that causing right ventricular hypertrophy (RVH) were categorized as Right Heart Disease (RHD). Within these groups, disease severity was further assessed according to clinical significance. The criteria for determining clinical significance are outlined in Table 2.

Table 2 Significance criteria for common congenital heart diseases

Full size table

Below, we present the criteria used for classifying congenital heart diseases according to severity and assessing clinical significance:

Atrial Septal Defect (ASD) [15]: Defects measuring < 5 mm were classified as small lesions with an 87% likelihood of spontaneous closure, typically managed by observation and regular follow-up.
Pulmonary Valve Stenosis (PS) [16]: Echocardiographic findings indicating a pulmonary artery flow velocity of < 3 m/s were categorized as mild pulmonary stenosis, typically managed by observation and follow-up.
Tetralogy of Fallot (TOF): A common cyanotic congenital heart disease with a significant impact on cardiac function and infant growth, necessitating corrective surgery.
Ventricular Septal Defect (VSD) [17]: Small defects (< 4 mm) are more likely to undergo spontaneous closure than are larger defects.
Patent Ductus Arteriosus (PDA) [18]: Small lesions (< 2 mm) can be treated via transcatheter coil occlusion. Larger lesions (> 2 mm) can have a significant impact on hemodynamics, requiring closure via transcatheter device placement.
Aortic Valve Stenosis (AS) [19]: An aortic flow velocity of > 3 m/s indicates moderate aortic stenosis, which can impair ventricular diastolic function.
Coarctation of the Aorta (CoA) [20]: A flow velocity of roughly 3 m/s corresponds to a pressure gradient of ~ 40 mmHg, which is associated with significant coarctation, as verified by cardiac catheterization and angiography.

Patients in the LHD group were divided into two subgroups: those with non-significant left heart disease (LHA) and those with significant left heart disease (LHB). Similarly, patients in the RHD group were classified into RHA (non-significant right heart disease) and RHB (significant right heart disease). The grouping methodology used in this study is illustrated in Fig. 1.

Model construction

To develop a machine learning model capable of detecting CHDs of clinical significance, we established the following classification frameworks based on ECG data:

1.
Right heart disease:
- ◦ Model 1: NOR vs. (RHA + RHB) – To predict the presence of total right heart disease (RHD).
- ◦ Model 2: (NOR + RHA) vs. RHB – To predict the presence of significant right heart disease (RHB).
2.
Left heart disease:
- ◦ Model 3: NOR vs. (LHA + LHB) – To predict the presence of total left heart disease (LHD).
- ◦ Model 4: (NOR + LHA) vs. LHB – To predict the presence of significant left heart disease (LHB).
3.
Significant CHD:
- ◦ Model 5: (NOR + RHA + LHA) vs. (RHB + LHB) To predict congenital heart disease of clinical significance.

Data acquisition

For each patient, 12-lead resting ECG signals were recorded using a GE MAC 5500 HD device (GE Healthcare, Chicago, Illinois, USA). ECGs were collected in a calm and resting state to minimize motion artifacts. Asmall group of senior technicians (> 20 years of experience) performed all ECG acquisitions to ensure consistency. ECG signals were recorded at a sampling rate of 500 Hz for 10 s and stored in XML format within the MUSE Cardiology Information System.

To facilitate analysis, the XML files were converted to CSV format using Python within the Anaconda Prompt environment (Austin, Texas, USA). The CSV files were subsequently imported into MATLAB 2022b (Natick, Massachusetts, USA), where they underwent continuous wavelet transformation (CWT) based on the Morlet wavelet to generate time–frequency spectrograms. The transformed spectral data were then saved in MAT format for further processing.

To maximize data utilization, each 10-s ECG recording was segmented into five 2-s overlapping segments, thereby increasing the number of samples. Each 2-s segment was represented as a 12 × 1000 matrix, with each row corresponding to one of the 12 ECG leads (I, II, III, aVR, aVL, aVF, V1, V2, V3, V4, V5, and V6) and each column representing a time point.

Each ECG segment was labeled according to patient classification (Normal, Non-significant Right Heart Disease, Significant Right Heart Disease, Non-significant Left Heart Disease, Significant Left Heart Disease). All data were cross-checked by pediatric cardiologists to ensure data quality and correct labeling. Preprocessed ECG data and spectrograms were stored in MAT format for model training and analysis.

This preprocessing workflow was meant to ensure high-quality spectral representations of ECG signals and optimize the data for machine learning analyses. Representative examples of ECG waveforms and their corresponding time–frequency spectrograms are illustrated in Fig. 2.

Signal preprocessing [21, 22]

To ensure high-quality ECG signal processing, the following pre-processing techniques were applied:

Signal pre-processing:

1.
Baseline wander removal:
- Baseline drift, typically caused by patient movement or respiration, was removed using a high-pass filter with a cutoff frequency of 0.5 Hz.
- This eliminated slow-varying components, while preserving diagnostically relevant QRS and ST segments.
2.
Powerline interference reduction:
- AC noise contamination due to powerline interference (60 Hz in Taiwan) was suppressed by applying a notch filter centered at 60 Hz.
3.
Low-pass filtering:
- High-frequency noise was attenuated using a low-pass filter with a cutoff frequency of 100 Hz to eliminate high-frequency artifacts while preserving ECG components of clinical relevance.
4.
Segmentation:
- Each 10-second ECG recording was divided into five non-overlapping 2-second segments to enhance data diversity and improve model training.
- Each segment retained the original 500 Hz sampling rate, forming a 12 × 1000 matrix (12 leads × 1000 time points).
5.
Continuous Wavelet Transform (CWT):

The CWT of a signal $x\;(t)$ is defined as follows:
$$W\left(a,b\right)={\int }_{-\infty }^{\infty }x\left(t\right){\psi }^{*}\left(\frac{t-b}{a}\right)dt$$

where $W\;(a,\;b)$ represents the wavelet coefficient at scale a and position b; $a\;>\;0$ is the scale parameter controlling the dilation or compression of the wavelet; $b\;\in\;\mathbb{R}$ is the translation parameter determining the shift along the time axis; and $\psi\;(t)$ is the mother wavelet function, and $\psi\ast\;(t)$ denotes its complex conjugate.

Morlet Wavelet (Mother Wavelet):

This study selected the Morlet wavelet due to its ability to balance time and frequency localization in the analysis of ECG signals. It is expressed as
$$\psi\;(t)\;=\;e^{j2\pi f_0t}e^\frac{-t^2}{{2\sigma}^2}$$

where $f_0$ is the central frequency of the wavelet; $\sigma$ determines the time-domain spread; and t represents time.
6.
Normalization:
- CWT spectrogram values were normalized to a range of [0, 1] to facilitate stable and efficient model training.
7.
Artifact removal:
- Segments with excessive noise or loss of lead contact (e.g., flatline or irregular spikes) were manually excluded by an experienced technician prior to feature extraction.

Model training

MATLAB 2022b (Natick, Massachusetts, USA) was utilized for model training. Transfer learning was implemented using three pre-trained convolutional neural networks (CNNs): ResNet- 18 [23], InceptionResNet-V2 [24], and NasNetMobile [25]. The third-to-last fully connected layer of each model underwent output size modification, and the final classification layer was replaced to accommodate the patient groups in this study.

To ensure a balanced dataset, 80% of the data were allocated to the training set, while the remaining 20% were designated as the test set. To maintain consistency across sets, we calculated the modulo of our data, ensuring that all ECG samples from the same patient were assigned to the same set.

Below, we outline the hyperparameter settings employed in the training of ResNet- 18, InceptionResNet-V2, and ASNetMobile as well as the rationale behind their selection.

Optimizer: We employed the Stochastic Gradient Descent with Momentum (SGDM) as an optimizer, as it provides a good balance between stability and convergence speed. It is widely used in medical image and signal processing tasks involving deep learning.
Initial learning rate = 0.001: This is a common setting for transfer learning, allowing fine-tuning of pre-trained models without drastic parameter updates.
Momentum = 0.9: This value was selected to accelerate convergence by dampening oscillations during gradient updates.
L2 regularization = 0.1: This value was selected to prevent overfitting, considering the relatively small size of our image dataset.
Minibatch size = 8: This value is meant to balance computational efficiency and convergence stability under GPU memory constraints.
Training = 5 epochs: We limited the number of training epochs because the models had been pre-trained on large datasets, and experiments revealed a plateau in performance after a few epochs, suggesting that prolonged training could lead to overfitting.
Validation: five-fold cross-validation was employed to ensure robust model performance without excessive dependence on any particular data partition.

Figure 3 Outlines the preprocessing and training workflow. All computation was performed using a custom-assembled workstation, equipped with an Intel Core i9 - 13900 K CPU, 128GB of RAM, and an Nvidia GeForce RTX 4090 GPU to ensure high-performance.

Model evaluation

Model performance was assessed by comparing the predicted results with clinical reports generated by a pediatric cardiologist. According to rule based criteria proposed by Society Guideline [26, 27], an ECG was classified as abnormal if the clinical report indicated the presence of atrial or ventricular hypertrophy.

Statistical analysis

After model training, confusion matrices were generated to assess the performance of our model using the following metrics: accuracy, sensitivity, specificity, F1 score, and area under the receiver operating characteristic curve (AUC). The performance metrics are based on four components from the confusion matrix used in binary classification: true positive (TP), true negative (TN), false positive (FP), and false negative (FN). The formulas used to derive the performance metrics are as follows:

$$\begin{array}{c}Accuracy=\frac{True\; positive\left(TP\right)+True\;negative(TN)}{TP+TN+false\;positive\left(FP\right)+false\;negative(FN)}\\Sensitivity=\frac{TP}{TP+FN}\\Specificity=\frac{TN}{TN+FP}\\F1score=\frac{2TP}{2TP+FP+FN}\end{array}$$

The results are reported as the mean, standard deviation, and 95% confidence interval derived from five-fold cross-validation. All statistical analysis was performed using MATLAB 2022b and Microsoft Excel.

Ethical approval

All methods in this study were performed in strict accordance with the Declaration of Helsinki. This study was approved by Institutional Review Board (IRB) of Chang Gung Medical Foundation (Reference number: 202102195B0). Due to retrospective design of this study, the IRB committee waived the need for participant consent.

Results

This study examined 1,035 patients aged 0 to 5, including 234 (22%) with normal heart structure (NOR), 100 patients (10%) with non-significant right heart disease (RHA), 291 patients (28%) with significant right heart disease (RHB), 141 patients (14%) with non-significant left heart disease (LHA), and 269 patients (26%) with significant left heart disease (LHB).

The age distribution was as follows: 0 years old (416 patients; 40.2%), 1 year (249 patients; 24.1%), 2 years (164 patients; 15.8%), 3 years (99 patients; 9.6%), and 4 years (107 patients; 10.3%). Table 3 lists the distribution of case numbers and the mean age in each group.

Table 3 Patient distribution and mean age by group

Full size table

The mean age varied across groups, with the highest mean age in the RHB group (24.4 ± 18.8 months, 95% CI: 22.2–26.6 months) and the lowest mean age in the LHB group (16.1 ± 15.2 months, 95% CI: 14.3–18.0 months). The NOR and LHA groups presented similar age distributions, while the mean age in the RHA group was slightly lower than in the NOR group (ANOVA P < 0.05). Figure 4 illustrates the distributions of heart disease types and ages, and Table 4 presents a detailed breakdown of heart defects in the dataset.

Table 4 Demographic data

Full size table

In detecting total right heart disease (NOR vs RHA + RHB), the model derived from InceptionResNet-V2 presented the best overall performance, with accuracy of 0.707 ± 0.027 (95% CI: 0.656—0.758), F1 score of 0.772 ± 0.035 (95% CI: 0.704—0.773), and AUC of 0.758 ± 0.027 (95% CI: 0.725—0.791). In detecting clinically significant right heart disease (NOR + RHA vs RHB), the model derived from ResNet- 18 presented the best performance, with accuracy of 0.789 ± 0.009 (95% CI: 0.778—0.799), F1 score of 0.770 ± 0.014 (95% CI: 0.704—0.773), and AUC of 0.852 ± 0.011 (95% CI: 0.838—0.866).

In detecting total left heart disease (NOR vs LHA + LHB), the model derived from InceptionResNet-V2 presented the best performance, with accuracy of 0.710 ± 0.011 (95% CI: 0.697—0.724), F1 score of 0.737 ± 0.008 (95% CI: 0.727—0.747), and AUC of 0.802 ± 0.019 (95% CI: 0.802 ± 0.019). In detecting significant left heart disease (NOR + LHA vs LHB), the model derived from InceptionResNet-V2 presented the best performance, with accuracy of 0.744 ± 0.033 (95% CI: 0.704—0.785), F1 score of 0.695 ± 0.035 (95% CI: 0.652—0.738), and AUC of 0.816 ± 0.035 (95% CI: 0.773—0.859). The results are detailed in Table 5.

Table 5 Impact of significance in prediction of CHD: results of transfer learning using various pre-trained models

Full size table

To simulate the conditions typically encountered in daily practice, we combined right and left heart diseases into one group. In this analysis, the model derived from ResNet- 18 achieved the best performance, with accuracy of 0.739 ± 0.012 (95% CI: 0.724—0.753), F1 score of 0.758 ± 0.015 (95% CI: 0.740—0.776), and AUC of 0.810 ± 0.013 (0.794—0.825). Figure 5 presents a boxplot for these results of five-fold cross-validation. Comprehensive loss curves are presented in Supplementary Fig. 1.

The average elapsed time for model training was as follows: InceptionResNet V2 (33 min 12 secs), ResNet- 18 (7 min 27 secs, and NasNetMobile (51 min 46 secs). These results are detailed in Table 6.

Table 6 Result of clinically-applicable model (NOR + RHA + LHA vs. RHB + LHB)

Full size table

The performance metrics in the ECG reports generated by a pediatric cardiologist for the current dataset were as follows: accuracy 0.671, sensitivity 0.716, specificity 0.648, F1 score 0.702. Overall, the proposed AI model proved superior to current best practices in screening for clinically significant congenital heart disease based on ECG vector changes.

Discussion

This study demonstrated the application of deep learning model derived from ResNet- 18 for the classification of congenital heart disease (CHD) based on ECG data, achieving a good balance between performance and computational efficiency. Notably, our ResNet- 18 model outperformed InceptionResNet-V2 in overall performance due to its efficient architecture and generalizability across CHD subtypes. Our findings demonstrate the potential utility of AI-enhanced ECG interpretation as a screening tool for hemodynamically significant CHD in infants and young children.

Scope of CHD inclusion and model generalizability

A universal screening tool should ideally detect all forms of CHD, including rare but critical conditions (e.g., single ventricle defects, hypoplastic left heart syndrome, Ebstein’s anomaly). However, our primary focus was on the detection of hemodynamically significant acyanotic CHD in infants and young children—a common but underdiagnosed subgroup—due to the following reasons:

Clinical prevalence: Left-to-right shunt lesions (e.g., ventricular septal defect, atrial septal defect, patent ductus arteriosus) and obstructive lesions (e.g., coarctation of the aorta, aortic stenosis) constitute the majority of CHD cases in this age group.
Screening utility: Acyanotic but hemodynamically significant defects often evade detection by pulse oximetry screening, as their initial presentation tends to subtle or asymptomatic. Delayed diagnosis increases the risk of heart failure.
Common ECG features: The ECG abnormalities observed in both common and rare congenital heart diseases stem primarily from vector changes. In right heart disease, these changes manifest as right axis deviation and right ventricular hypertrophy, whereas in left heart disease, they appear as left axis deviation and left ventricular hypertrophy.

The authors suspect that our AI model could detect even rare, life-threatening congenital heart diseases, provided they exhibit pronounced ECG changes. The generalizability of the model could likely be improved by increasing the number of normal ECGs beyond the 234 included in this study. Furthermore, a prospective study incorporating ECGs from healthy pediatric patients would better reflect real-world screening populations.

Performance of proposed model in differentiating left and right heart disease

The model derived from ResNet- 18 demonstrated the best overall performance in detecting right heart disease, which is consistent with previous studies that used ECG to detect right ventricular hypertrophy. However, its performance was significantly lower when applied to left heart disease. This suboptimal performance may be attributed to developmental factors, such as the rapid increase in left ventricular (LV) mass early in life, which can obscure ECG markers of left heart disease. Another possible explanation is the difficulty in feature extraction, as detection of left heart disease traditionally relies on echocardiographic parameters, and ECG findings—such as left axis deviation—are often subtle and lack reliability.

Detection of ECG abnormalities

One previous study on the detection of ventricular hypertrophy in right heart diseases reported accuracy of 0.78, which is comparable to the performance of our ResNet- 18 model (0.79). Another study using ECG rule-based criteria for detecting left ventricular hypertrophy achieved relatively low accuracy (0.65–0.75) [28]. The performance of our AI-based method was also comparable to a prior study that employed similar techniques for detecting atrial septal defect (ASD), achieving an AUC of 0.88 [8].

Rule-based ECG classifications frequently fall within the normal range, making them less reliable for ruling out congenital heart disease (CHD). As a result, it is unreasonable to expect clinicians to diagnose CHD solely based on ECG findings. In this study, left ventricular hypertrophy (LVH) and right ventricular hypertrophy (RVH) were used as benchmarks to ensure alignment with standard clinical practice.

Model design, hyperparameters, and oversampling strategy

In this study, ResNet- 18 outperformed InceptionResNet-V2 and NasNetMobile, due to its efficient residual connections and relatively low computational complexity. Many studies on the application of AI techniques to the interpretation of ECG data employ pretrained models utilizing residual networks [12, 29]. In the current study, ResNet- 18 achieved accuracy on par with deeper models, such as ResNet- 50 and ResNet- 101, while requiring significantly less training time, making it a better alternative for real-world deployment.

The pre-trained Inception model, developed by Google Research, has been previously applied to AI research on CHD [7, 30]. In this study, we selected its latest iteration, which emphasizes residual connections for enhanced performance. We also utilized the NasNetMobile pre-trained model, which differs from manually engineered architectures by employing neural architecture search (NAS) to optimize network design. NasNetMobile is specifically tailored for mobile devices, ensuring efficient performance in low-power environments, such as medical offices where computational resources tend to be limited.

CHD datasets often suffer from class imbalances due to the low prevalence of certain conditions. To address this issue, we implemented an oversampling strategy, in which each 10-s ECG recording was segmented into five 2-s segments. This segmentation approach allowed us to generate a more balanced dataset, reducing the risk of the model being biased toward the majority class.

To mitigate concerns regarding temporal dependencies, we applied Continuous Wavelet Transform (CWT), which captures information in both the time and frequency domains, minimizing potential distortion.

Future research will explore additional data augmentation techniques and alternative approaches to segmentation.

Clinical significance and future implications

It is important to consider that the expertise in pediatric cardiology tends to be concentrated in large medical centers. Thus, the proposed AI model could potentially expand CHD screening into resource-limited areas. AI-augmented ECG analysis could serve as a supplementary tool for early CHD detection, similar to the way pulse oximetry testing is used to screen newborn. Possible implementations include the following:

Telemedicine integration: AI models could facilitate remote ECG analysis, reducing the need for in-person evaluations by specialists.
Primary care utility: Clinicians without specialized training in cardiology could use AI-assisted ECG screening to identify at-risk infants who require echocardiography.
Integration with other modalities: Previous studies have demonstrated that a combination of AI-based ECG analysis with human intervention can enhance detection performance [31]. Moreover, chest X-ray (CXR) imaging has been used to evaluate the hemodynamic significance of CHD [32]. Integrating multiple modalities could lead to the development of a versatile screening models.

This study was subject to several limitations, which should be considered in the interpretation of our findings. First, the retrospective nature of this study and its reliance on single-center data highlight the need for future multi-center studies to verify the generalizability of the model. Moreover, the lack of an external test set means that further external validation will be required to assess its real-world applicability.

Another limitation is the inclusion of only a few rare CHD cases, which restricted the ability of the model to generalize across a broad spectrum of congenital heart conditions. Expanding the dataset to include more diverse CHD subtypes would improve model robustness. It is also important to consider the potential ECG acquisition bias, as differences in operator techniques and device settings may impact model performance. Future studies should evaluate the model using ECG records acquired by multiple operators under various operating conditions to ensure its reliability across different clinical settings.

Lastly, the influence of age-related changes on ECG interpretation suggests a need for further age-stratified analysis. Developing age-specific models could enhance diagnostic accuracy, particularly for younger patients with ECG markers that vary significantly with cardiac maturation.

Conclusion

This study marks a significant advancement in AI-assisted CHD screening for young children. Our ResNet- 18-based model demonstrated stable performance in the detection of hemodynamically significant CHD, effectively balancing accuracy and computational efficiency. The proposed AI-driven model outperformed conventional ECG-based screening methods that rely on rule-based criteria. When used as a complement to pulse oximetry screening in newborns, this approach could facilitate early detection of conditions requiring intervention, thereby reducing the risk of complications in children aged 0 to 5, a period of rapid cardiac development.

Data availability

Data collected in the current study are available from the corresponding author upon reasonable request.

Abbreviations

AS:: Aortic valve stenosis
ASD:: Atrial septal defect
CHD:: Congenital heart disease
CoA:: Coarctation of aorta.
ECG:: Electrocardiogram
LHA:: Non-significant left heart disease
LHB:: Significant left heart disease
NOR:: Normal heart structure
PDA:: Patent ductus arteriosus
PS:: Pulmonary valve stenosis
RHA:: Non-significant right heart disease
RHB:: Significant right heart disease
SGDM:: Stochastic gradient descent with momentum
TOF:: Tetralogy of Fallot
VSD:: Ventricular septal defect

References

Wu MH, Chen HC, Lu CW, Wang JK, Huang SC, Huang SK. Prevalence of congenital heart disease at live birth in Taiwan. J Pediatr. 2010;156(5):782–5.
Article PubMed Google Scholar
Yeh SJ, Chen HC, Lu CW, Wang JK, Huang LM, Huang SC, Huang SK, Wu MH. Prevalence, mortality, and the disease burden of pediatric congenital heart disease in Taiwan. Pediatr Neonatol. 2013;54(2):113–8.
Article PubMed Google Scholar
Engel MS, Kochilas LK. Pulse oximetry screening: a review of diagnosing critical congenital heart disease in newborns. Med Devices (Auckl). 2016;9:199–203.
PubMed Google Scholar
Plana MN, Zamora J, Suresh G, Fernandez-Pineda L, Thangaratinam S, Ewer AK. Pulse oximetry screening for critical congenital heart defects. Cochrane Database Syst Rev. 2018;3(3):Cd011912.
PubMed Google Scholar
Zhao QM, Niu C, Liu F, Wu L, Ma XJ, Huang GY. Accuracy of cardiac auscultation in detection of neonatal congenital heart disease by general paediatricians. Cardiol Young. 2019;29(5):679–83.
Article PubMed Google Scholar
Tanghöj G, Liuba P, Sjöberg G, Naumburg E. Predictors of the need for an atrial septal defect closure at very young age. Frontiers in Cardiovascular Medicine. 2020;6:185.
Article PubMed PubMed Central Google Scholar
Alkahtani HK, Haq IU, Ghadi YY, Innab N, Alajmi M, Nurbapa M. Precision diagnosis: an automated method for detecting congenital heart diseases in children from phonocardiogram signals employing deep neural network. IEEE Access. 2024;12:76053–64.
Article Google Scholar
Mori H, Inai K, Sugiyama H, Muragaki Y. Diagnosing atrial septal defect from electrocardiogram with deep learning. Pediatr Cardiol. 2021;42(6):1379–87.
Article PubMed Google Scholar
Lv J, Dong B, Lei H, Shi G, Wang H, Zhu F, Wen C, Zhang Q, Fu L, Gu X, et al. Artificial intelligence-assisted auscultation in detecting congenital heart disease. European Heart Journal - Digital Health. 2021;2(1):119–24.
Article PubMed PubMed Central Google Scholar
Xu W, Yu K, Ye J, Li H, Chen J, Yin F, Xu J, Zhu J, Li D, Shu Q. Automatic pediatric congenital heart disease classification based on heart sound signal. Artif Intell Med. 2022;126: 102257.
Article PubMed Google Scholar
Liu J, Wang H, Yang Z, Quan J, Liu L, Tian J. Deep learning-based computer-aided heart sound analysis in children with left-to-right shunt congenital heart disease. Int J Cardiol. 2022;348:58–64.
Article PubMed Google Scholar
Du Y, Huang S, Huang C, Maalla A, Liang H. Recognition of child congenital heart disease using electrocardiogram based on residual of residual network. In: 2020 IEEE International Conference on Progress in Informatics and Computing (PIC). 2020. p. 145–148.
Liu K, Bhalla JS, Anderson J, Niaz T, Anjewierden S, Attia ZI, Friedman PA, Madhavan M. Artificial intelligence algorithm for the detection of atrial septal defect using electrocardiogram. Journal of the American College of Cardiology. 2023;81(8_Supplement):2354–2354.
Article Google Scholar
Xu W, Yu K, Xu J, Ye J, Li H, Shu Q. Artificial intelligence technology in cardiac auscultation screening for congenital heart disease: present and future. Zhejiang Da Xue Xue Bao Yi Xue Ban. 2020;49(5):548–55.
PubMed PubMed Central Google Scholar
Radzik D, Davignon A, van Doesburg N, Fournier A, Marchand T, Ducharme G. Predictive factors for spontaneous closure of atrial septal defects diagnosed in the first 3 months of life. J Am Coll Cardiol. 1993;22(3):851–3.
Article CAS PubMed Google Scholar
Cuypers JAAE, Witsenburg M, Linde Dvd, Roos-Hesselink JW. Pulmonary stenosis: update on diagnosis and therapeutic options. Heart. 2013;99(5):339–47.
Zhao QM, Niu C, Liu F, Wu L, Ma XJ, Huang GY. Spontaneous closure rates of ventricular septal defects (6,750 consecutive neonates). Am J Cardiol. 2019;124(4):613–7.
Article PubMed Google Scholar
Fernández Ruiz A, del Cerro Marín MJ, Rubio Vidal D, Castro Gussoni MC, Moreno Granados F. Transcatheter closure of patent ductus arteriosus using the Amplatzer duct occluder: initial results and mid-term follow-up. Revista Española de Cardiología (English Edition). 2002;55(10):1057–62.
Article Google Scholar
Otto CM, Nishimura RA, Bonow RO, Carabello BA, Erwin JP, Gentile F, Jneid H, Krieger EV, Mack M, McLeod C, et al. 2020 ACC/AHA guideline for the management of patients with valvular heart disease: a report of the American College of Cardiology/American Heart Association Joint Committee on clinical practice guidelines. Circulation. 2021;143(5):e72–227.
Article PubMed Google Scholar
Carvalho JS, Redington AN, Shinebourne EA, Rigby ML, Gibson D. Continuous wave Doppler echocardiography and coarctation of the aorta: gradients and flow patterns in the assessment of severity. Br Heart J. 1990;64(2):133–7.
Article CAS PubMed PubMed Central Google Scholar
Desai U, Martis RJ, Nayak CG, Sarika K, Seshikala G. Machine intelligent diagnosis of ECG for arrhythmia classification using DWT, ICA and SVM techniques. In: 2015 Annual IEEE India Conference (INDICON). 2015. p. 1–4.
Desai U, Martis RJ, Gurudas Nayak C, Seshikala G, Sarika K, Shetty KR. Decision support system for arrhythmia beats using ECG signals with DCT, DWT AND EMD methods: a comparative study. Journal of Mechanics in Medicine and Biology. 2016;16(01):1640012.
Article Google Scholar
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). 2016. p. 770–778.
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA. Inception-ResNet and the impact of residual connections on learning. arXiv. 1602.07261.
Zoph B, Vasudevan V, Shlens J, Le QV. Learning transferable architectures for scalable image recognition. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Computer Society; 2018. p. 8697–8710.
Davignon A, Rautaharju P, Boisselle E, Soumis F, Mégélas M, Choquette A. Normal ECG standards for infants and children. Pediatr Cardiol. 1980;1(2):123–31.
Article Google Scholar
Hancock EW, Deal BJ, Mirvis DM, Okin P, Kligfield P, Gettes LS. AHA/ACCF/HRS recommendations for the standardization and interpretation of the electrocardiogram. JACC. 2009;53(11):992–1002.
Article PubMed Google Scholar
Bratincsák A, Kimata C, Limm-Chan BN, Vincent KP, Williams MR, Perry JC. Electrocardiogram standards for children and young adults using z-scores. Circulation: Arrhythmia and Electrophysiology. 2020;13(8): e008253.
PubMed Google Scholar
Pachiyannan P, Alsulami M, Alsadie D, Saudagar AKJ, AlKhathami M, Poonia RC. A novel machine learning-based prediction method for early detection and diagnosis of congenital heart disease using ECG signal processing. Technologies. 2024;12(1): 4.
Article Google Scholar
Jia H, Tang S, Guo W, Pan P, Qian Y, Hu D, Dai Y, Yang Y, Geng C, Lv H. Differential diagnosis of congenital ventricular septal defect and atrial septal defect in children using deep learning–based analysis of chest radiographs. BMC Pediatr. 2024;24(1):661.
Article PubMed PubMed Central Google Scholar
Huang Y, Zhong S, Zhang X, Kong L, Wu W, Yue S, Tian N, Zhu G, Hu A, Xu J, et al. Large scale application of pulse oximeter and auscultation in screening of neonatal congenital heart disease. BMC Pediatr. 2022;22(1):483.
Article CAS PubMed PubMed Central Google Scholar
Toba S, Mitani Y, Yodoya N, Ohashi H, Sawada H, Hayakawa H, Hirayama M, Futsuki A, Yamamoto N, Ito H, et al. Prediction of pulmonary to systemic flow ratio in patients with congenital heart disease using deep learning-based analysis of chest radiographs. JAMA Cardiol. 2020;5(4):449–57.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was partially supported by the National Science and Technology Council, Taiwan, under project number NSTC 112-2628-E-038-001-MY3, and partially supported by the Joint Research and Development Center of National Taipei University of Technology and Taipei Medical University, under project number 113TN10.

Author information

Authors and Affiliations

Division of Cardiology, Department of Pediatrics, Chang Gung Memoral Hospital Linkou Branch, Taoyuan, Taiwan
Yu-Shin Lee, Hung-Tao Chung, Mao-Sheng Hwang, Hao-Chuan Liu, Hsin-Mao Hsu & Ya-Ting Chang
In-Service Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, No.250, Wuxing St., Xinyi Dist., Taipei City, 110, Taiwan
Yu-Shin Lee & Syu-Jyun Peng
Division of Pediatric Intensive Care, Department of Pediatrics, Chang Gung Memorial Hospital, Linkou Branch, Taoyuan, Taiwan
Jainn-Jim Lin
Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei Medical University, Taipei, Taiwan
Syu-Jyun Peng

Authors

Yu-Shin Lee
View author publications
You can also search for this author inPubMed Google Scholar
Hung-Tao Chung
View author publications
You can also search for this author inPubMed Google Scholar
Jainn-Jim Lin
View author publications
You can also search for this author inPubMed Google Scholar
Mao-Sheng Hwang
View author publications
You can also search for this author inPubMed Google Scholar
Hao-Chuan Liu
View author publications
You can also search for this author inPubMed Google Scholar
Hsin-Mao Hsu
View author publications
You can also search for this author inPubMed Google Scholar
Ya-Ting Chang
View author publications
You can also search for this author inPubMed Google Scholar
Syu-Jyun Peng
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Acquisition of data: YSL, HTC, JJL, MSH, HCL, HMH, and YTC; Analysis and interpretation of data: YSL and SJP; Drafting the article: YSL and SJP; Critical revision of the manuscript for important intellectual content: YSL and SJP. All authors approved the final version of the manuscript submitted.

Corresponding author

Correspondence to Syu-Jyun Peng.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Ethics Committee of Chang Gung Memorial Hospital (Ethics Approval No. 202102195B0). The requirement for informed consent was waived owing to the retrospective observational nature of the study. The decision not to require informed consent was upheld by the Ethics Committee of the Chang Gung Memorial Hospital. All methods were conducted in strict accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

12887_2025_5628_MOESM1_ESM.png

Supplementary Material 1: Supplemental Figure 1. Representative loss curve for the prediction model utilizing the ResNet- 18 pre-trained model

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, YS., Chung, HT., Lin, JJ. et al. Prediction of significant congenital heart disease in infants and children using continuous wavelet transform and deep convolutional neural network with 12-lead electrocardiogram. BMC Pediatr 25, 324 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12887-025-05628-2

Download citation

Received: 21 December 2024
Accepted: 24 March 2025
Published: 24 April 2025
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12887-025-05628-2

Prediction of significant congenital heart disease in infants and children using continuous wavelet transform and deep convolutional neural network with 12-lead electrocardiogram

Abstract

Background

Methods

Results

Conclusions

Background

Participants and methods

Data sources

Patient grouping process

Model construction

Data acquisition

Signal preprocessing [21, 22]

Signal pre-processing:

Model training

Model evaluation

Statistical analysis

Ethical approval

Results

Discussion

Scope of CHD inclusion and model generalizability

Performance of proposed model in differentiating left and right heart disease

Detection of ECG abnormalities

Model design, hyperparameters, and oversampling strategy

Clinical significance and future implications

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

12887_2025_5628_MOESM1_ESM.png

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Pediatrics

Contact us