Comparing manual and automatic scoring of sleep monitoring data from portable polygraphy

J Sleep Res. 2021 Apr;30(2):e13036. doi: 10.1111/jsr.13036. Epub 2020 May 20.

Abstract

We used sleep monitoring data from a study that investigated the prevalence, characteristics, risk factors and type of sleep apnea (SA) in 579 patients with paroxysmal atrial fibrillation. Most patients were screened for two nights, resulting in 1,043 sleep recordings that each contained data from one night. SA was diagnosed using the Nox T3 portable sleep monitor. An experienced sleep specialist scored the recordings manually using Noxturnal software. A total of 157 women (27%) and 422 men (73%) were examined; 477 (82.7%) had an apnea-hypopnea index (AHI) ≥ 5/hr, whereas moderate to severe SA (AHI ≥ 15/hr) was diagnosed in 243 patients (42.1%). The AHI derived from automatic and manual scoring showed a good agreement (Pearson's r coefficient of 0.96). The median difference in AHI was very small (i.e., 0.72 [mean difference, 1.06]), but was statistically significant (p < .0001). Automatic scoring classified sleep recordings with more than 90% accuracy into SA categories of mild (AHI ≥ 5/hr), moderate (AHI ≥ 15/hr) and severe (AHI ≥ 30/hr). We found a minor (11%-21%) mis-estimation of the number of recordings right above and below the boundary separating mild and moderate SA. The accuracy of automatic scoring differed from recording to recording, especially regarding the sensitivity of detecting disrupted breathing events. We found low to moderate agreement for the duration of disrupted breathing events (r = .53), for which the automatic scoring led to a statistically significant overestimation by 5.22 s (p < .0001).

Keywords: Nox T3 (Noxturnal); apnea-hypopnea index; portable polygraphy; sleep apnea.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Female
  • Humans
  • Male
  • Polysomnography / methods*
  • Sleep Wake Disorders / diagnosis*