[HTML][HTML] Subgroups of children with Kawasaki disease: a data-driven cluster analysis

H Wang, C Shimizu, E Bainto, S Hamilton… - The Lancet Child & …, 2023 - thelancet.com
H Wang, C Shimizu, E Bainto, S Hamilton, HR Jackson, D Estrada-Rivadeneyra, M Kaforou
The Lancet Child & Adolescent Health, 2023thelancet.com
Summary Background Although Kawasaki disease is commonly regarded as a single
disease entity, variability in clinical manifestations and disease outcome has been
recognised. We aimed to use a data-driven approach to identify clinical subgroups. Methods
We analysed clinical data from patients with Kawasaki disease diagnosed at Rady
Children's Hospital (San Diego, CA, USA) between Jan 1, 2002, and June 30, 2022.
Patients were grouped by hierarchical clustering on principal components with k-means …
Background
Although Kawasaki disease is commonly regarded as a single disease entity, variability in clinical manifestations and disease outcome has been recognised. We aimed to use a data-driven approach to identify clinical subgroups.
Methods
We analysed clinical data from patients with Kawasaki disease diagnosed at Rady Children's Hospital (San Diego, CA, USA) between Jan 1, 2002, and June 30, 2022. Patients were grouped by hierarchical clustering on principal components with k-means parcellation based on 14 variables, including age at onset, ten laboratory test results, day of illness at the first intravenous immunoglobulin infusion, and normalised echocardiographic measures of coronary artery diameters at diagnosis. We also analysed the seasonality and Kawasaki disease incidence from 2002 to 2019 by subgroup. To explore the biological underpinnings of identified subgroups, we did differential abundance analysis on proteomic data of 6481 proteins from 32 patients with Kawasaki disease and 24 healthy children, using linear regression models that controlled for age and sex.
Findings
Among 1016 patients with complete data in the final analysis, four subgroups were identified with distinct clinical features: (1) hepatobiliary involvement with elevated alanine transaminase, gamma-glutamyl transferase, and total bilirubin levels, lowest coronary artery aneurysm but highest intravenous immunoglobulin resistance rates (n=157); (2) highest band neutrophil count and Kawasaki disease shock rate (n=231); (3) cervical lymphadenopathy with high markers of inflammation (erythrocyte sedimentation rate, C-reactive protein, white blood cell, and platelet counts) and lowest age-adjusted haemoglobin Z scores (n=315); and (4) young age at onset with highest coronary artery aneurysm but lowest intravenous immunoglobulin resistance rates (n=313). The subgroups had distinct seasonal and incidence trajectories. In addition, the subgroups shared 211 differential abundance proteins while many proteins were unique to a subgroup.
Interpretation
Our data-driven analysis provides insight into the heterogeneity of Kawasaki disease, and supports the existence of distinct subgroups with important implications for clinical management and research design and interpretation.
Funding
US National Institutes of Health and the Irving and Francine Suknow Foundation.
thelancet.com