We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Functional Post-Clustering Selective Inference with Applications to EHR Data Analysis

Abstract: In electronic health records (EHR) analysis, clustering patients according to patterns in their data is crucial for uncovering new subtypes of diseases. Existing medical literature often relies on classical hypothesis testing methods to test for differences in means between these clusters. Due to selection bias induced by clustering algorithms, the implementation of these classical methods on post-clustering data often leads to an inflated type-I error. In this paper, we introduce a new statistical approach that adjusts for this bias when analyzing data collected over time. Our method extends classical selective inference methods for cross-sectional data to longitudinal data. We provide theoretical guarantees for our approach with upper bounds on the selective type-I and type-II errors. We apply the method to simulated data and real-world Acute Kidney Injury (AKI) EHR datasets, thereby illustrating the advantages of our approach.
Subjects: Methodology (stat.ME); Applications (stat.AP); Computation (stat.CO)
Cite as: arXiv:2405.03042 [stat.ME]
  (or arXiv:2405.03042v1 [stat.ME] for this version)

Submission history

From: Anru R. Zhang [view email]
[v1] Sun, 5 May 2024 20:07:47 GMT (503kb,D)

Link back to: arXiv, form interface, contact.