Cai TT, Ma R. Matrix Reordering for Noisy Disordered Matrices: Optimality and Computationally Efficient Algorithms.
IEEE TRANSACTIONS ON INFORMATION THEORY 2024;
70:509-531. [PMID:
39036782 PMCID:
PMC11257605 DOI:
10.1109/tit.2023.3305538]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/23/2024]
Abstract
Motivated by applications in single-cell biology and metagenomics, we investigate the problem of matrix reordering based on a noisy disordered monotone Toeplitz matrix model. We establish the fundamental statistical limit for this problem in a decision-theoretic framework and demonstrate that a constrained least squares estimator achieves the optimal rate. However, due to its computational complexity, we analyze a popular polynomial-time algorithm, spectral seriation, and show that it is suboptimal. To address this, we propose a novel polynomial-time adaptive sorting algorithm with guaranteed performance improvement. Simulations and analyses of two real single-cell RNA sequencing datasets demonstrate the superiority of our algorithm over existing methods.
Collapse