J Stroke Search

CLOSE


J Stroke > Volume 28(1); 2026 > Article
Gao, Nguyen, Zhang, Li, Yu, Xu, Wang, Chen, Hu, and on Behalf of the ATTENTION Investigators: Early Predictors of Long-Term Outcome in Basilar Artery Occlusion: A Post Hoc Analysis of the ATTENTION Trial

Abstract

Background and Purpose

Accurately predicting long-term functional outcomes of basilar artery occlusion (BAO) remains challenging. We compared the predictive performance of the baseline, 24-hour, and 72-hour National Institutes of Health Stroke Scale (NIHSS) scores for 90-day BAO functional outcomes using the Acute Basilar Artery Occlusion: Endovascular Thrombectomy versus Standard Medical Treatment (ATTENTION) trial data. We identified the optimal assessment time point, determined treatment-specific NIHSS cutoff values, and explored the role of early neurological function in treatment effects.

Methods

This retrospective post hoc analysis included 324 patients with acute BAO with baseline NIHSS scores ≥10 and complete NIHSS assessments at each time point. The primary outcome was a favorable 90-day functional outcome (modified Rankin Scale score, 0-3). Receiver operating characteristic curve analysis was used to assess the predictive ability of NIHSS scores. The optimal 72-hour NIHSS predictive cutoff values were determined for the endovascular treatment (EVT) and best medical management (BMM) subgroups.

Results

The 72-hour NIHSS score showed the highest predictive accuracy for the primary outcome (area under the receiver operating characteristic curve [AUC]: 0.954), outperforming the 24-hour (AUC: 0.903) and baseline (AUC: 0.688) scores; its optimal predictive cut-off value was ≤11 in the EVT group (sensitivity: 85.6%, specificity: 92.9%, positive predictive value [PPV]: 91.8%, negative predictive value [NPV]: 87.4%) and ≤9 in the BMM group (sensitivity: 84.6%, specificity: 95.1%, PPV: 84.6%, NPV: 95.1%).

Conclusions

The 72-hour NIHSS score outperformed the baseline and 24-hour scores in predicting 90-day functional outcomes and mediating the effects of EVT. Treatment-specific 72-hour NIHSS cut-off values may guide early risk stratification and prognostic assessments.

Introduction

Basilar artery occlusion (BAO), a severe form of ischemic stroke, significantly contributes to the overall disease burden and mortality [1]. The 90-day modified Rankin Scale (mRS) score is the gold standard tool for assessing functional outcomes in BAO; however, its prolonged assessment period limits its clinical utility [2]. Accurately predicting a patient’s recovery trajectory is important. The concept of a “golden plastic period” following a stroke—during which time judicious and targeted interventions, including rehabilitation, can substantially improve disability status [3]— further underscores the critical need for precise early assessment tools.
The National Institutes of Health Stroke Scale (NIHSS)—an accessible, standardized tool for evaluating neurological deficits— is widely used in early stroke assessment [4,5]. However, research on the prognostic value of NIHSS scores in patients with BAO remains limited, impeding optimized prognostic evaluation. Furthermore, different treatment strategies—endovascular treatment (EVT) versus best medical management (BMM)—may impact the predictive performance and optimal cut-off values of NIHSS scores. Identifying the optimal timing for NIHSS assessment, determining treatment-specific cut-off values, and elucidating the role of early neurological function in mediating treatment effects are critical for guiding rehabilitation strategies, enhancing patient stratification, and improving prognostic assessment.
Herein, data from the Acute Basilar Artery Occlusion: Endovascular Thrombectomy versus Standard Medical Treatment (ATTENTION) trial [6] were used to systematically evaluate and compare the predictive performance of baseline, 24-hour, and 72-hour NIHSS scores for 90-day functional outcomes in patients with BAO. The objectives were (1) to determine the optimal NIHSS assessment time point for predicting long-term prognosis in BAO and (2) to explore and validate treatment-specific (EVT vs. BMM) optimized NIHSS cut-off values. We aimed to provide evidence to support early risk stratification and guide clinical post-treatment communication in patients with BAO.

Methods

Study design and patient sample

This retrospective post hoc analysis utilized data from the ATTENTION trial, a multicenter, prospective, randomized, open-label, blinded-endpoint, and clinical trial conducted from February 2021 to January 2022 across 36 stroke centers in China (ClinicalTrials.gov: NCT04751708). Participating centers were required to have performed over 100 EVT procedures for acute ischemic stroke in 2020. The ATTENTION trial’s protocol has been previously published [6]. Key eligibility criteria for the trial were as follows: (1) adult patients (≥18 years old) with a moderate-tosevere acute ischemic stroke due to BAO confirmed using computed tomography (CT) or magnetic resonance angiography; (2) randomization within 12 hours of estimated stroke onset; and (3) a baseline NIHSS score of ≥10. Key exclusion criteria were significant pre-stroke disability (a mRS score of ≥3 for patients <80 years or ≥1 for patients ≥80 years) and a posterior circulation Alberta Stroke Program Early CT Score (pc-ASPECTS) of <6 for patients <80 years and <8 for patients ≥80 years.
For this post hoc analysis, patients were included if they (1) were enrolled in the ATTENTION trial and (2) had complete NIHSS score data at baseline, 24-hour post-randomization, and 72-hour post-randomization. Sixteen patients with incomplete detailed records were excluded.

Data collection

All baseline, treatment, and follow-up data were directly extracted from the central database of the ATTENTION trial, which is managed by an independent clinical research organization (JetMed). The data were electronically exported to minimize transcription errors and ensure high fidelity.

Standard protocol approvals, registrations, and patient consent

The study protocol was approved by the Medical Ethics Committee of the First Affiliated Hospital of the University of Science and Technology of China and relevant local ethics boards (approval number: 2021-ky010) and was conducted in accordance with the Declaration of Helsinki. Written informed consent was obtained from all participants or their legal representatives before enrollment.

Outcome definition

The primary efficacy outcome was a favorable functional outcome at 90 days, defined as an mRS score of 0-3, which is a standard endpoint indicating functional independence in BAO trials. Secondary efficacy outcomes included mRS scores of 0-2 and 0-1.

Statistical analysis

All statistical analyses were conducted using R software (version 4.3.0; R Foundation for Statistical Computing, Vienna, Austria). Unless otherwise specified, all tests were two-sided, with a significance level of P<0.05.

Descriptive analysis

The baseline characteristics, treatment-related variables, and clinical outcomes were summarized using descriptive statistics. Continuous variables with a normal distribution, assessed via the Shapiro-Wilk test, are reported as mean±standard deviation, whereas variables with an abnormal distribution are presented as median (interquartile range, IQR). Categorical variables are expressed as counts and percentages. Continuous variables were compared using independent t-tests (parametric data) or the Mann-Whitney U test (non-parametric data). Categorical variables were analyzed using the chi-square test or Fisher’s exact test for expected cell counts <5.

Missing data handling

We conducted a sensitivity analysis using multiple imputations by chained equations to account for data from 16 excluded patients.

Predictive ability analysis

Receiver operating characteristic (ROC) curve analysis was used to evaluate the predictive performance of the NIHSS-related parameters for 90-day functional outcomes. Parameters included baseline, 24-hour, and 72-hour NIHSS scores; change in the NIHSS (ΔNIHSS) score from baseline to 24 hours and from baseline to 72 hours. The NIHSS score was calculated as the baseline NIHSS score minus the subsequent score. The predicted outcomes comprised primary (mRS score, 0-3) and secondary (mRS score, 0-2 or 0-1) endpoints. The area under the ROC curve (AUC) and 95% confidence interval (CI) were calculated for each parameter and outcome. In addition, DeLong’s test was used to compare the AUC of each predictor with that of the 72-hour NIHSS score.

Determination of optimal prognostic thresholds

For the EVT subgroup, the optimal NIHSS threshold was identified by maximizing the Youden index (sensitivity+specificity-1). The robustness of this threshold was confirmed through sensitivity analyses using metrics such as the Youden index and accuracy. For the BMM subgroup, a clinically informed strategy was employed because a lower a priori probability of favorable outcomes could lead to a low positive predictive value (PPV) if relying solely on the Youden index. Robust thresholds were explored via three sensitivity analysis strategies: (1) maximizing the Youden index while maintaining a PPV ≥80%, (2) maximizing sensitivity while maintaining a PPV ≥80%, and (3) maximizing the PPV while maintaining a sensitivity ≥80%. Sensitivity was defined as TP/(FN+TP), PPV=(sensitivity×prevalence)/(sensitivity ×prevalence)+[(1-specificity)×(1-prevalence)] [7].

Mediation analysis

Mediation analysis has been widely applied to assess early outcome prediction in stroke trials [8,9]. Adopting the framework of Dregan et al. [10], we present our results as total, direct, and indirect effects along with the proportion of mediation, which differs from the classic four-step method [11]. Specifically, we report the average causal mediation effect (ACME), which is an indirect effect, the average direct effect (ADE), and the total effect (TE). TE=ACME (indirect effect)+ADE; proportion mediated=indirect effect/TE. Linear regression was used to assess the effect of treatment assignment on the mediator variables. The link function is Gaussian. Probit regression was used to assess the effect of treatment assignment and 24/72-hour NIHSS score on 90-day mRS scores of 0-3. The link function is probit. Either the 72-hour or 24-hour NIHSS score was treated as a continuous variable. Transformations were not performed based on the NIHSS score, and the principle of temporality was adhered to. The mediator and outcome models were adjusted for age, sex, baseline NIHSS score, pre-stroke mRS score, and baseline pc-ASPECTS score. All metrics were measured before treatment. A quasi-Bayesian approach was used to estimate robust 95% CIs. Statistical significance was defined as a two-sided P-value <0.05.
Furthermore, a sensitivity analysis for sequential ignorability was conducted using the function in the mediation package to assess the robustness. This analysis was used to quantify the strength of an unmeasured confounder that would change the conclusions regarding ACME. The strength of this confounding is parameterized as the correlation between the error terms of the mediator and outcome models (ρ) and the proportion of the total variance in the mediator and the outcome that needs to be explained by the confounder (R2).

Delayed neurological improvement analysis

A Sankey diagram was generated to elucidate patient recovery trajectories. In this diagram, the baseline NIHSS scores are categorized into quartiles (Q1-Q4). The 72-hour NIHSS status was dichotomized based on the optimal prognostic cut-off values identified here for the EVT (≤11) and BMM (≤9) groups. Outcomes were categorized as mRS 0-3 or 4-6. Additionally, a flow chart was constructed to quantify the number of patients for whom the 72-hour NIHSS prognostic assessment was inconsistent with the final 90-day mRS outcome (defined as “crossed”). Exploratory delayed neurological improvement (DNI) analysis was used to identify the characteristics of patients with severe 72-hour NIHSS status achieving an mRS score of 0-3 at 90 days. A multivariable logistic regression model using Firth’s bias reduction method was used to characterize the subgroups with recovery potential despite the early onset of severe disease. This provides a parameter estimation for small sample sizes.

Results

Patient characteristics

This study included 324 patients with complete NIHSS scores at baseline, 24-hour, and 72-hour post-stroke onset. The baseline characteristics of the study cohort are presented in Table 1. The median age was 68 years (IQR, 58 to 74 years), and 103 patients (31.8%) were female. The median NIHSS scores were 24 (IQR, 15 to 35) at baseline, 23 (IQR, 9 to 36) at 24-hour, and 21 (IQR, 6 to 39) at 72-hour post-onset. The median ΔNIHSS score was 0 (IQR -5 to 10) at 24 hours and 2 (IQR, -7 to 13) at 72 hours. No significant baseline differences were observed between the EVT and BMM subgroups (Supplementary Table 1).

ROC curve analysis

Concerning the primary outcome, the 72-hour NIHSS score demonstrated the highest predictive ability across the entire cohort (n=324), with an AUC of 0.954 (95% CI: 0.933-0.975), followed by the ΔNIHSS score from baseline to 72 hours (AUC: 0.876, 95% CI: 0.838-0.914), the 24-hour NIHSS score (AUC: 0.903, 95% CI: 0.870-0.937), and the ΔNIHSS score from baseline to 24 hours (AUC: 0.820, 95% CI: 0.772-0.869). The baseline NIHSS score had the lowest AUC (0.688, 95% CI: 0.629-0.747) (Table 2 and Figure 1). DeLong’s test confirmed that the 72-hour NIHSS score performed significantly better than the 24-hour NIHSS score (P< 0.001) (Figure 2) and the other scores in terms of predictive ability (all P<0.001) (Table 2).
For the secondary outcomes, the 72-hour NIHSS score consistently yielded the highest AUCs: 0.925 (95% CI, 0.898-0.953) for an mRS score of 0-2 and 0.904 (95% CI: 0.869-0.939) for an mRS score of 0-1 (Table 2 and Figure 1). The 24-hour NIHSS score showed a strong predictive ability (AUC: 0.892, 95% CI: 0.855-0.929 for an mRS score of 0-2; 0.872, 95% CI: 0.823- 0.920 for an mRS score of 0-1). However, DeLong’s test indicated a significant superiority of the 72-hour NIHSS score over this score in predicting an mRS score of 0-2 (P<0.05) (Figure 2). The baseline NIHSS score had the lowest AUCs (0.678 for an mRS score of 0-2 and 0.718 for an mRS score of 0-1) (Table 2).

Optimal prognostic cut-off values

In the EVT subgroup, the optimal 72-hour NIHSS cut-off score for predicting a favorable functional outcome was ≤11. This threshold yielded a sensitivity, specificity, PPV, and negative predictive value (NPV) of 85.6%, 92.9%, 91.8%, and a NPV of 87.4%, respectively. Sensitivity analysis confirmed that this threshold exhibited a consistent optimal performance across multiple evaluation metrics. In the BMM subgroup, the conventional approach of maximizing Youden’s index resulted in a threshold of ≤22 with a low PPV (0.65). A clinically driven strategy was employed in this study to determine the optimal 72-hour NIHSS cut-off score, identifying it as ≤9. This threshold had sensitivity, specificity, PPV, and NPV of 84.6%, 95.1%, 84.6%, and 95.1%, respectively (Tables 3 and 4). A sensitivity analysis involving three distinct strategies based on different clinical priorities confirmed that ≤9 was the most robust cut-off score for this subgroup (Supplementary Table 2). The specific results for the other time points are presented in Supplementary Table 3.

Mediation effect analysis

When the 72-hour NIHSS score was considered as the mediator, a significant portion of the treatment effect was found to be indirect. In the primary analysis (n=324), the TE of EVT was a 23.6 percentage point increase in the probability of a favorable outcome (mRS 0-3) at 90 days (coefficient: 0.24, 95% CI: 0.14- 0.32; P<0.001). The ACME, or indirect effect, was 0.15 (95% CI: 0.09-0.20; P<0.001), while the ADE was 0.09 (95% CI: 0.02-0.15; P<0.001). This indicates that 62.1% (95% CI: 44.9%-89.4%) of the total benefit of EVT was mediated through its impact on improving neurological status at 72 hours. When the 24-hour NIHSS score was used as the mediator, the indirect effect (ACME) was 0.11 (95% CI: 0.06-0.16; P<0.001), accounting for 44.3% (95% CI: 29.7%-65.0%) of the TE (Figure 3 and Supplementary Table 4).
Moreover, the results of sensitivity analysis for sequential ignorability indicate that, for NIHSS at 24 hours, when ρ reached 0.5, the ACME was no longer significant: R^2_M*R^2_Y* at which ACME=0:0.25. R^2_M~R^2_Y~ at which ACME=0:0.0576. For NIHSS at 72 hours, the ACME became non-significant as it approached zero atρ=0.6; R^2_M*R^2_Y* at which ACME=0:0.36; R^2_M~R^2_Y~ at which ACME=0:0.0491. These findings suggest that the estimated mediation effect was robust against potentially unmeasured confounding factors (Supplementary Figures 1-4).

Sensitivity analysis using imputation

The results after imputation were highly consistent with those of the prior analysis. The 72-hour NIHSS score remained the most accurate predictor of the 90-day functional outcome (BMM group AUC: 0.963 [0.933-0.993]; EVT group AUC: 0.947 [0.917- 0.976]) (Supplementary Table 5), and all DeLong tests for the 72-hour NIHSS score compared with the others were P<0.05. For the primary outcome, the cutoff was 11 for the EVT group and 9 for the BMM group, remaining stable (Supplementary Tables 2 and 6). The results of the mediation analysis remained stable. With the 72-hour NIHSS as the mediator, the proportion mediated was 63.7% (ACME: 0.144, 95% CI: 0.09-0.21) (Supplementary Table 4).

DNI analysis

The Sankey diagram (Supplementary Figure 5) visually demonstrated that a small proportion of patients with severe baseline deficits (Q3 and Q4) transitioned to an unfavorable 72-hour NIHSS status (≤11) but achieved a good 90-day outcome (mRS 0-3). The flowchart (Supplementary Figure 6) provides a quantitative breakdown of the concordance between the 72-hour prognostic assessment and the final outcome. Among the 119 patients in the EVT group who did not reach the prognostic threshold (72-hour NIHSS >11), 15 ultimately achieved a favorable outcome (mRS 0-3). A similar pattern was observed in the BMM group, in which 4 of 82 patients with poor prognosis recovered favorably. In the EVT subgroup (n=119), age was a robust significant independent predictor of DNI (adjusted odds ratio: 0.94, 95% CI: 0.88-0.99, P=0.03), indicating that younger age was associated with an increased likelihood of recovery. For middle basilar artery occlusion and intravenous thrombolysis, the P-value indicates statistical significance. However, these results should be interpreted with caution because of the wide 95% CI. In the BMM subgroup (n=82), no baseline variables were significantly associated with the DNI (all P>0.05) (Supplementary Table 7).

Discussion

In this study, based on ATTENTION trial data, we identified the 72-hour NIHSS score as a reliable and superior early predictor of long-term outcomes in patients with BAO. Our findings show that neurological status at 72 hours more accurately predicts 90-day functional outcomes compared with that at earlier time points and better accounts for the clinical benefits of EVT. Furthermore, we established for the first time treatment-specific 72-hour NIHSS score thresholds with high predictive value for EVT and BMM. These findings provide novel evidence-based guidance for early prognostic communication and risk stratification of patients with BAO.
Compared with the baseline score, the 24-hour NIHSS score is a stronger predictor of long-term functional prognosis [4,12-14]. High-quality evidence from the Basilar Artery Occlusion Endovascular Intervention versus Standard Medical Treatment trial revealed direct comparisons of the predictive performance of percent change in NIHSS, 24-hour NIHSS, and ΔNIHSS. The absolute 24-hour NIHSS score was the strongest predictor [13]. In this study, we confirmed that the 24- and 72-hour NIHSS scores effectively predicted long-term functional outcomes (mRS score of 0-3), with the 72-hour assessment exhibiting superior predictive accuracy. This is consistent with later assessments that reached a higher agreement [15].
The advantage of the 72-hour NIHSS score was not coincidental. The evolution of cerebral edema is a time-dependent process, with its peak typically occurring 3-5 days post-onset [16]; therefore, an assessment performed at 24 hours may underestimate its impact, as the peak has not yet been reached. The full clinical impact of periprocedural complications (such as symptomatic intracranial hemorrhage) has a delayed manifestation. Perihematomal edema secondary to hemorrhage continues to develop after the initial 24 hours, with the average thickness of the edema reaching only 60% of its peak at 24 hours [17,18]. Neu-rological deficits can also be more accurately assessed after the effects of anesthesia (particularly general anesthesia) have stabilized. Reperfusion dynamics may also play crucial roles in this process. The “no-reflow” phenomenon, which is the failure of microcirculatory-level perfusion despite successful largevessel recanalization, can persist within the first 24 hours after vessel recanalization [17] and results in progressive, delayed tissue death [19,20].
The magnitude of the mediation effect observed at 72 hours is consistent with findings from existing literature. Prior studies reported that the 24-hour NIHSS score mediated between 54% and 68% of the treatment effect of EVT [8,9]. Kniep et al. [9] found that the mediation effect increased to 75% when the NIHSS score at hospital discharge was used as the mediator. Our finding that the 72-hour NIHSS score mediates 62.1% of the treatment effect aligns with this temporal trend. This finding suggests that the 72-hour assessment captures a larger proportion of the ultimate treatment benefit than the 24-hour assessment, while also being a more clinically timely and practical endpoint than discharge assessments.
For patients receiving EVT, a 72-hour NIHSS score of ≤11 was identified as a robust prognostic cut-off, based on its consistent optimality across multiple metrics. Notably, this threshold is higher than the 24-hour NIHSS cut-off score of ≤9 proposed in a previous key study [21]. This difference is likely attributable to the more severe patient population in our study. Crucially, this conclusion is supported by the subgroup analysis from Kniep et al. [21], which showed that the sensitivity of their ≤9 threshold dropped significantly in patients with a baseline NIHSS score ≥20. In contrast, our proposed threshold of ≤11 represents a prognostic marker with high clinical applicability for this more severe patient population. It is the most reliable prognostic marker for predicting favorable long-term functional outcomes in patients with moderate-to-severe BAO.
The prognostic cutoff for the 72-hour NIHSS score in the BMM subgroup was determined. The conventional method of maximizing Youden’s index produces a high risk of “false optimism,” reaching clinically intolerable levels. Such false positive predictions pose significant risks. This could hinder families from preparing for long-term disability and may even lead clinicians to incorrectly withhold effective treatment [22]. Furthermore, as shown in Supplementary Figure 7A, when predicting a favorable outcome (mRS score of 0-3), multiple NIHSS score thresholds have comparable Youden’s indices, indicating that relying solely on Youden’s index lacks clinical utility. We tested three strategies for the sensitivity analysis. Notably, all three analytical paths, each satisfying different priorities, consistently indicated the same optimal threshold. This finding greatly enhances the robustness of this threshold and confirms a core principle: in BMM cohorts with a low prior probability of favorable outcomes, prioritizing predictive reliability over maximizing Youden’s index is paramount to ensuring the model’s clinical applicability and supporting responsible physician-patient communication. When evaluating the strictest outcome (mRS 0-1), determining an ideal threshold that achieves a high PPV while balancing sensitivity and specificity is extremely challenging. As a rarer outcome, mRS 0-1 inherently has a lower PPV than mRS 0-3, significantly increasing the risk of the model making “false optimistic” predictions.
Delayed recovery from “sleeping neurons” may occur beyond 72 hours [23], potentially causing early NIHSS scores to overestimate post-EVT injury. Our exploratory analysis of DNI indicated that age was a key independent predictor in patients receiving EVT. Therefore, for younger patients, the prognosis should not be determined prematurely based solely on severe early neurological deficits, as they still have considerable potential for achieving favorable functional recovery at a later stage. These findings are consistent with those of previous research [16], possibly attributable to the role of the cerebral reserve [24], with younger patients’ brains exhibiting greater postoperative plasticity and tolerance to injury [25].
This study had certain limitations. First, the 72-hour NIHSS score is only a snapshot of the dynamic disease course, and its predictive value depends on subsequent conditions. Second, our clinically driven strategy for the BMM subgroup, in which a high PPV was prioritized, resulted in a trade-off with lower sensitivity. Third, our data showed the acceptability of this trade-off for the primary outcome (the sensitivity of a 72-hour NIHSS cut-off score of ≤9 remains above 80%). Fourth, our analysis did not extend beyond 72 hours. The NIHSS score after 72 hours may hold a stronger value, and subsequent observation times should be explored in future studies. Fifth, the NIHSS may not fully capture symptoms such as ataxia or dysphagia [26]. Nonetheless, its use remains appropriate, universal, and comparable in patients with BAO within the context of current practices [21]. Sixth, we were unable to adjust for post-treatment confounders between 24 and 72 hours, such as sedation weaning, early complications (e.g., pneumonia), or decompressive surgery. These events may represent significant confounders that could influence the mediating effect of NIHSS. Seventh, although the sensitivity analysis of the imputed data showed robust results, excluding patients with missing data was a potential source of bias that may limit the generalizability of our findings. Finally, our study population was restricted to patients with moderate-to-severe BAO (baseline NIHSS score ≥10). Therefore, our proposed thresholds cannot be generalized to patients with mild stroke presentations.

Conclusions

The 72-hour NIHSS score outperformed the ΔNIHSS score from baseline to 72 hours, the 24-hour NIHSS score, and the ΔNIHSS score from baseline to 24 hours in predicting 90-day functional outcomes. The 72-hour NIHSS score-based approach proposed in this study provides a valuable reference for clinical prognostic stratification.

Supplementary materials

Supplementary materials related to this article can be found online at https://doi.org/10.5853/jos.2025.04014.
Supplementary Table 1.
Patient demographics and baseline characteristics
jos-2025-04014-Supplementary-Table-1.pdf
Supplementary Table 2.
Strategy (assume threshold)
jos-2025-04014-Supplementary-Table-2,3.pdf
Supplementary Table 3.
Association of various thresholds of NIHSS (admission/24 hours/ΔNIHSS baseline to 24 hours and 72 hours) scores with 90-day outcomes (unimputed dataset, n=324)
jos-2025-04014-Supplementary-Table-2,3.pdf
Supplementary Table 4.
Mediation analysis for the associations between EVT and mRS scores 0-3
jos-2025-04014-Supplementary-Table-4-6.pdf
Supplementary Table 5.
ROC analysis of NIHSS scores for 90-day outcomes at various time points (sensitivity analysis for primary outcome on the complete dataset)
jos-2025-04014-Supplementary-Table-4-6.pdf
Supplementary Table 6.
Association of various thresholds of 72-hour NIHSS scores with 90-day outcomes (sensitivity analysis on the complete dataset)
jos-2025-04014-Supplementary-Table-4-6.pdf
Supplementary Table 7.
Results of the multivariable logistic regression model using Firth’s bias reduction in the delayed neurological improvement analysis
jos-2025-04014-Supplementary-Table-7.pdf
Supplementary Figure 1.
Sensitivity analysis based on correlation parameter (ρ) between error terms (for the 24-hour National Institutes of Health Stroke Scale score). ACME, average causal mediation effect.
jos-2025-04014-Supplementary-Fig-1,2.pdf
Supplementary Figure 2.
Sensitivity analysis based on R2 parameters for unobserved confounders (for the 24-hour National Institutes of Health Stroke Scale score). ACME, average causal mediation effect.
jos-2025-04014-Supplementary-Fig-1,2.pdf
Supplementary Figure 3.
Sensitivity analysis based on correlation parameter (ρ) between error terms (for the 72-hour National Institutes of Health Stroke Scale score). ACME, average causal mediation effect.
jos-2025-04014-Supplementary-Fig-3,4.pdf
Supplementary Figure 4.
Sensitivity analysis based on R2 parameters for unobserved confounders (for the 72-hour National Institutes of Health Stroke Scale score). ACME, average causal mediation effect.
jos-2025-04014-Supplementary-Fig-3,4.pdf
Supplementary Figure 5.
Trajectories from baseline to 90-day outcome, stratified by treatment group. The diagram shows the distribution of the 324 patients included in the study. The number of patients achieving a favorable (mRS 0-3) or unfavorable (mRS 4-6) outcome at 90 days is shown for each subgroup. “Crossed” indicates patients whose 90-day outcome was inconsistent with the prediction based on their 72-hour NIHSS status. EVT, endovascular treatment; NIHSS, National Institutes of Health Stroke Scale; mRS, modified Rankin Scale; BMM, best medical management.
jos-2025-04014-Supplementary-Fig-5.pdf
Supplementary Figure 6.
Patient recovery trajectories from baseline to 90-day outcome. The diagrams illustrate the flow of patients in the best medical management (BMM) group (below) and the endovascular treatment (EVT) group (above). The first axis represents the baseline National Institutes of Health Stroke Scale (NIHSS) score, categorized by quartiles (Q1-Q4). The second axis shows the 72-hour NIHSS status, dichotomized by the study’s optimal cut-off values (≤9 for BMM and ≤11 for EVT). The final axis represents the 90-day modified Rankin Scale (mRS) outcome, categorized as favorable (mRS 0-3) or unfavorable (mRS 4-6). The width of the colored flows is proportional to the number of patients following each specific trajectory.
jos-2025-04014-Supplementary-Fig-6.pdf
Supplementary Figure 7.
Cut point value analysis. (A) Maximize Youden index. (B) Minimum positive predictive value (PPV) ≥0.80, maximize Youden. (C) Minimum PPV ≥0.80, maximize Sen. (D) Minimum Sen ≥0.80, maximize PPV.
jos-2025-04014-Supplementary-Fig-7.pdf

Notes

Funding statement
This study was funded by the Fundamental Research Fund for Central Universities (grant number YD9110002014).
Conflicts of interest
The authors have no financial conflicts of interest.
Author contribution
Conceptualization: Feiyang Gao, Wei Hu. Study design: Feiyang Gao, Thanh N. Nguyen. Methodology: Feiyang Gao, Chao Zhang. Data collection: Rui Li, Dafan Yu, Pengfei Xu, Anmo Wang, Min Chen. Investigation: Feiyang Gao, Rui Li. Statistical analysis: Feiyang Gao, Chao Zhang. Writing—original draft: Feiyang Gao, Thanh N. Nguyen. Writing—review & editing: Feiyang Gao, Thanh N. Nguyen, Wei Hu. Funding acquisition: Wei Hu. Approval of final manuscript: all authors.
Acknowledgments
We extend our appreciation to all collaborators in the ATTENTION trial.

Figure 1.
Receiver operating characteristic curves for the total population. (A) mRS score 0-3. (B) mRS 0-2. (C) mRS 0-1. AUC, area under the receiver operating characteristic curve; NIHSS, National Institutes of Health Stroke Scale; mRS, modified Rankin Scale.
jos-2025-04014f1.jpg
Figure 2.
Receiver operating characteristic curves between 72 and 24 hours (DeLong’s test). (A) mRS score 0-3. (B) mRS 0-2. (C) mRS 0-1. AUC, area under the receiver operating characteristic curve; NIHSS, National Institutes of Health Stroke Scale; mRS, modified Rankin Scale.
jos-2025-04014f2.jpg
Figure 3.
Model of the hypothetical causal pathway in patients with acute ischemic stroke. Total effect=average causal mediation effect (ACME) (indirect effect)+average direct effect (ADE) (direct effect); proportion mediated=ACME (indirect effect)/total effect. The analysis was adjusted for age, sex, pre-stroke modified Rankin Scale (mRS) score, posterior circulation Alberta Stroke Program Early CT score, and baseline National Institutes of Health Stroke Scale (NIHSS) score.
jos-2025-04014f3.jpg
Table 1.
Patient demographics and baseline characteristics
Characteristic Value (n=324)
Age (yr) 68 (58, 74)
Sex
 Female 103 (31.8)
 Male 221 (68.2)
Pre-stroke mRS
 0 287 (88.6)
 1 30 (9.3)
 2 7 (2.2)
Baseline pc-ASPECTS 9 (8, 10)
Cause of stroke
 Large-artery atherosclerosis 145 (44.8)
 Cardioembolism 70 (21.6)
 Undetermined cause 106 (32.7)
 Other determined cause 3 (0.9)
Occlusion site
 Proximal basilar artery 102 (31.6)
 Middle basilar artery 86 (26.6)
 Distal basilar artery 109 (33.7)
 Vertebral artery V4 26 (8.0)
Intravenous thrombolysis
 No 224 (69.1)
 Yes 100 (30.9)
Treatment
 Best medical care 108 (33.3)
 Endovascular thrombectomy 216 (66.7)
NIHSS at admission 24 (15, 35)
NIHSS at 24 hours 23 (9, 36)
NIHSS at 72 hours 21 (6, 39)
ΔNIHSS baseline to 24 hours 0 (-5, 10)
ΔNIHSS baseline to 72 hours 2 (-7, 13)
Values are presented as median (interquartile range) or number (%).
mRS, modified Rankin Scale; pc-ASPECTS, posterior circulation Alberta Stroke Program Early CT Score; NIHSS, National Institutes of Health Stroke Scale.
Table 2.
ROC analysis of NIHSS scores for 90-day outcomes at various time points
Characteristic Total (n=324)
BMM (n=108)
EVT (n=216)
AUC (95% CI) P AUC (95% CI) P AUC (95% CI) P
mRS 0-3 at 90 days
 NIHSS at admission 0.688 (0.629-0.747) <0.001 0.697 (0.575-0.818) <0.001 0.708 (0.640-0.777) <0.001
 NIHSS at 24 hours 0.903 (0.870-0.937) <0.001 0.890 (0.821-0.958) 0.026 0.907 (0.866-0.947) 0.008
 NIHSS at 72 hours 0.954 (0.933-0.975) - 0.969 (0.942-0.996) - 0.943 (0.912-0.975) -
 ΔNIHSS baseline to 24 hours 0.820 (0.772-0.869) <0.001 0.832 (0.735-0.929) 0.002 0.797 (0.735-0.859) <0.001
 ΔNIHSS baseline to 72 hours 0.876 (0.838-0.914) <0.001 0.916 (0.859-0.973) 0.029 0.844 (0.790-0.898) <0.001
mRS 0-2 at 90 days
 NIHSS at admission 0.678 (0.612-0.744) <0.001 0.672 (0.506-0.838) <0.001 0.704 (0.631-0.778) <0.001
 NIHSS at 24 hours 0.892 (0.855-0.929) 0.0241 0.908 (0.832-0.984) 0.199 0.885 (0.840-0.931) 0.098
 NIHSS at 72 hours 0.925 (0.898-0.953) - 0.944 (0.892-0.996) - 0.914 (0.877-0.951) -
 ΔNIHSS baseline to 24 hours 0.824 (0.774-0.873) <0.001 0.896 (0.808-0.984) 0.289 0.779 (0.716-0.843) <0.001
 ΔNIHSS baseline to 72 hours 0.844 (0.802-0.886) <0.001 0.905 (0.846-0.963) 0.241 0.801 (0.743-0.859) <0.001
mRS 0-1 at 90 days
 NIHSS at admission 0.718 (0.641-0.795) <0.001 0.700 (0.512-0.888) 0.012 0.737 (0.652-0.823) <0.001
 NIHSS at 24 hours 0.872 (0.823-0.920) 0.076 0.912 (0.827-0.997) 0.810 0.852 (0.792-0.912) 0.078
 NIHSS at 72 hours 0.904 (0.869-0.939) - 0.916 (0.846-0.987) - 0.892 (0.847-0.936) -
 ΔNIHSS baseline to 24 hours 0.771 (0.709-0.834) <0.001 0.863 (0.748-0.978) 0.359 0.716 (0.640-0.792) <0.001
 ΔNIHSS baseline to 72 hours 0.782 (0.731-0.834) <0.001 0.877 (0.806-0.947) 0.337 0.729 (0.662-0.797) <0.001
Death within 90 days
 NIHSS at admission 0.646 (0.585-0.707) <0.001 0.640 (0.534-0.746) <0.001 0.667 (0.593-0.742) <0.001
 NIHSS at 24 hours 0.858 (0.817-0.898) <0.001 0.835 (0.759-0.912) 0.101 0.861 (0.810-0.912) <0.001
 NIHSS at 72 hours 0.932 (0.905-0.958) - 0.923 (0.874-0.971) - 0.924 (0.885-0.963) -
 ΔNIHSS baseline to 24 hours 0.786 (0.735-0.836) <0.001 0.796 (0.710-0.882) 0.003 0.763 (0.695-0.831) <0.001
 ΔNIHSS baseline to 72 hours 0.863 (0.820-0.906) <0.001 0.883 (0.814-0.952) 0.094 0.837 (0.777-0.898) <0.001
P-values were calculated using DeLong’s test to compare the AUC of each predictor with the AUC of the 72-hour NIHSS score.
ROC, receiver operating characteristic; NIHSS, National Institutes of Health Stroke Scale; AUC, area under the ROC curve; CI, confidence interval; BMM, best medical management; EVT, endovascular treatment; mRS, modified Rankin Scale; -, not applicable.
Table 3.
Association of various thresholds of the 72-hour NIHSS score with 90-day outcomes
Group AUC Direction Cutpoint Youden Sensitivity (%) Specificity (%) PPV (%) NPV (%)
EVT mRS 0-3 0.943 <= 11 0.7843407 85.6 92.9 91.8 87.4
EVT mRS 0-2 0.914 <= 10 0.7702128 93.3 83.7 75.3 95.9
EVT mRS 0-1 0.892 <= 10 0.6631579 95.6 70.8 46.2 98.4
BMM mRS 0-3 0.969 <= 22 0.8292683 100.0 82.9 65.0 100.0
BMM mRS 0-2 0.944 <= 8 0.8020833 91.7 88.5 50.0 98.8
BMM mRS 0-1 0.916 <= 8 0.7474747 88.9 85.9 36.4 98.8
NIHSS, National Institutes of Health Stroke Scale; AUC, area under the receiver operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; EVT, endovascular treatment; mRS, modified Rankin Scale; BMM, best medical management.
Table 4.
Association of various thresholds of NIHSS scores with 90-day outcomes for BMM mRS scores 0-3 (assumed thresholds)
Predictors AUC Direction Cutpoint Youden Sensitivity (%) Specificity (%) PPV (%) NPV (%)
NIHSS at admission 0.697 <= NA NA NA NA NA NA
NIHSS at 24 hours 0.890 <= 4 0.2954972 30.8 98.8 88.9 81.8
NIHSS at 72 hours 0.969 <= 9 0.7973734 84.6 95.1 84.6 95.1
ΔNIHSS baseline to 24 hours 0.832 >= 10 0.3217636 34.6 97.6 81.8 82.5
ΔNIHSS baseline to 72 hours 0.916 >= 12 0.4249531 46.2 96.3 80.0 84.9
NIHSS, National Institutes of Health Stroke Scale; BMM, best medical management; mRS, modified Rankin Scale; AUC, area under the receiver operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; NA, not applicable.

References

1. Mattle HP, Arnold M, Lindsberg PJ, Schonewille WJ, Schroth G. Basilar artery occlusion. Lancet Neurol 2011;10:1002-1014.
crossref pmid
2. Kerr DM, Fulton RL, Lees KR; VISTA Collaborators. Seven-day NIHSS is a sensitive outcome measure for exploratory clinical trials in acute stroke: evidence from the Virtual International Stroke Trials Archive. Stroke 2012;43:1401-1403.
crossref pmid
3. Wei X, Sun S, Zhang M, Zhao Z. A systematic review and meta-analysis of clinical efficacy of early and late rehabilitation interventions for ischemic stroke. BMC Neurol 2024;24:91.
crossref pmid pmc pdf
4. Meyer L, Broocks G, Bechstein M, Flottmann F, Leischner H, Brekenfeld C, et al. Early clinical surrogates for outcome prediction after stroke thrombectomy in daily clinical practice. J Neurol Neurosurg Psychiatry 2020;91:1055-1059.
crossref pmid
5. Rangaraju S, Frankel M, Jovin TG. Prognostic value of the 24-hour neurological examination in anterior circulation ischemic stroke: a post hoc analysis of two randomized controlled stroke trials. Interv Neurol 2016;4:120-129.
crossref pmid pmc pdf
6. Tao C, Nogueira RG, Zhu Y, Sun J, Han H, Yuan G, et al. Trial of endovascular treatment of acute basilar-artery occlusion. N Engl J Med 2022;387:1361-1372.
pmid
7. Bours MJ. Bayes’ rule in diagnosis. J Clin Epidemiol 2021;131:158-160.
crossref pmid
8. Ospel JM, Brown S, Bosshart S, Stebner A, Uchida K, Demchuk A, et al. Modified Rankin Scale at 90 days versus National Institutes of Health Stroke Scale at 24 hours as primary outcome in acute stroke trials. J Am Heart Assoc 2025;14:e037752.
crossref pmid pmc
9. Kniep H, Meyer L, Bechstein M, Broocks G, Guerreiro H, van Horn N, et al. How much of the thrombectomy related improvement in functional outcome is already apparent at 24 hours and at hospital discharge? Stroke 2022;53:2828-2837.
crossref pmid
10. Dregan A, Rayner L, Davis KAS, Bakolis I, Arias de la Torre J, Das-Munshi J, et al. Associations between depression, arterial stiffness, and metabolic syndrome among adults in the UK Biobank population study: a mediation analysis. JAMA Psychiatry 2020;77:598-606.
crossref pmid pmc
11. Boers AMM, Jansen IGH, Brown S, Lingsma HF, Beenen LFM, Devlin TG, et al. Mediation of the relationship between endovascular therapy and functional outcome by follow-up infarct volume in patients with acute ischemic stroke. JAMA Neurol 2019;76:194-202.
pmid pmc
12. You S, Wang Y, Wang X, Maeda T, Ouyang M, Han Q, et al. Twenty-four-hour post-thrombolysis NIHSS score as the strongest prognostic predictor after acute ischemic stroke: ENCHANTED study. J Am Heart Assoc 2024;13:e036109.
crossref pmid pmc
13. Mistry EA, Yeatts S, de Havenon A, Mehta T, Arora N, De Los Rios La Rosa F, et al. Predicting 90-day outcome after thrombectomy: baseline-adjusted 24-hour NIHSS is more powerful than NIHSS score change. Stroke 2021;52:2547-2553.
crossref pmid pmc
14. Rangaraju S, Jovin TG, Frankel M, Schonewille WJ, Algra A, Kappelle LJ, et al. Neurologic examination at 24 to 48 hours predicts functional outcomes in basilar artery occlusion stroke. Stroke 2016;47:2534-2540.
crossref pmid pmc
15. Rinkel LA, Ospel JM, Kappelhof M, Sehgal A, McDonough RV, Tymianski M, et al. Comparing early National Institutes of Health Stroke Scale versus 90-day modified Rankin Scale outcomes in acute ischemic stroke trials: a systematic review and analysis. J Am Heart Assoc 2025;14:e040304.
crossref pmid pmc
16. Battey TW, Karki M, Singhal AB, Wu O, Sadaghiani S, Campbell BC, et al. Brain edema predicts outcome after nonlacunar ischemic stroke. Stroke 2014;45:3643-3648.
crossref pmid pmc
17. Nie X, Leng X, Miao Z, Fisher M, Liu L. Clinically ineffective reperfusion after endovascular therapy in acute ischemic stroke. Stroke 2023;54:873-881.
crossref pmid
18. Lim-Hing K, Rincon F. Secondary hematoma expansion and perihemorrhagic edema after intracerebral hemorrhage: from bench work to practical aspects. Front Neurol 2017;8:74.
crossref pmid pmc
19. Jia M, Jin F, Li S, Ren C, Ruchi M, Ding Y, et al. No-reflow after stroke reperfusion therapy: an emerging phenomenon to be explored. CNS Neurosci Ther 2024;30:e14631.
crossref pmid pmc
20. Cheng Z, Wang H, Geng X, Rajah GB, Elmadhoun O, Peng G, et al. Time and tissue windows in futile reperfusion after ischemic stroke. Aging Dis 2024;16:2544-2552.
pmid pmc
21. Kniep H, Bechstein M, Broocks G, Brekenfeld C, Flottmann F, van Horn N, et al. Early surrogates of outcome after thrombectomy in posterior circulation stroke. Eur J Neurol 2022;29:3296-3306.
crossref pmid pdf
22. Bang OY, Park HY, Yoon JH, Yeo SH, Kim JW, Lee MA, et al. Predicting the long-term outcome after subacute stroke within the middle cerebral artery territory. J Clin Neurol 2005;1:148-158.
crossref pmid pmc
23. Meurer WJ. Delayed neurological improvement: dead or sleeping neurons? Stroke 2021;52:2218-2219.
crossref pmid
24. Rabinstein AA, Albers GW, Brinjikji W, Koch S. Factors that may contribute to poor outcome despite good reperfusion after acute endovascular stroke therapy. Int J Stroke 2019;14:23-31.
crossref pmid pdf
25. Galluzzi S, Lanni C, Pantoni L, Filippi M, Frisoni GB. White matter lesions in the elderly: pathophysiological hypothesis on the effect on brain plasticity and reserve. J Neurol Sci 2008;273:3-9.
crossref pmid
26. Sato S, Toyoda K, Uehara T, Toratani N, Yokota C, Moriwaki H, et al. Baseline NIH Stroke Scale score predicting outcome in anterior and posterior circulation strokes. Neurology 2008;70(24 Pt 2):2371-2377.
crossref pmid


ABOUT JoS
AUTHOR INFORMATION
ARTICLE CATEGORY

Browse all articles >

BROWSE ARTICLES
Editorial Office
Department of Neurology, Asan Medical Center,Ulsan University College of Medicine
88, Olympic-ro 43-gil, Songpa-gu, Seoul 05505, Korea
Submission, status and progress, etc ⟫ E-mail: editor@j-stroke.org
Website and system ⟫ E-mail: journal@m2community.co.kr
Publishing company ⟫ E-mail: ka72sus@smileml.com
Developed in M2PI
Copyright © 2026 by Korean Stroke Society.
Close layer
prev next