Noisy Sorting Capacity

Sorting is the task of ordering n elements using pairwise comparisons. It is well known that <inline-formula> <tex-math notation="LaTeX">m=\Theta (n\log n) </tex-math></inline-formula> comparisons are both necessary and sufficient when the outcomes of the comparison...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on information theory Vol. 70; no. 9; pp. 6121 - 6138
Main Authors Wang, Ziao, Ghaddar, Nadim, Zhu, Banghua, Wang, Lele
Format Journal Article
LanguageEnglish
Published IEEE 01.09.2024
Subjects
Online AccessGet full text
ISSN0018-9448
1557-9654
DOI10.1109/TIT.2024.3425281

Cover

More Information
Summary:Sorting is the task of ordering n elements using pairwise comparisons. It is well known that <inline-formula> <tex-math notation="LaTeX">m=\Theta (n\log n) </tex-math></inline-formula> comparisons are both necessary and sufficient when the outcomes of the comparisons are observed with no noise. In this paper, we study the sorting problem when each comparison is incorrect with some fixed yet unknown probability p. Unlike the common approach in the literature which aims to minimize the number of pairwise comparisons m to achieve a given desired error probability, we consider randomized algorithms with expected number of queries <inline-formula> <tex-math notation="LaTeX">\textsf {E}[M] </tex-math></inline-formula> and aim at characterizing the maximal sorting rate <inline-formula> <tex-math notation="LaTeX">\frac {n\log n}{\mathop {\mathrm {\textsf {E}}}\nolimits [M]} </tex-math></inline-formula> such that the ordering of the elements can be estimated with a vanishing error probability asymptotically. The maximal rate is referred to as the noisy sorting capacity. In this work, we derive upper and lower bounds on the noisy sorting capacity. The two lower bounds - one for fixed-length algorithms and one for variable-length algorithms - are established by combining the insertion sort algorithm with the well-known Burnashev-Zigangirov algorithm for channel coding with feedback. Compared with existing methods, the proposed algorithms are universal in the sense that they do not require the knowledge of p, while maintaining a strictly positive sorting rate. Moreover, we derive a general upper bound on the noisy sorting capacity, along with an upper bound on the maximal rate that can be achieved by sorting algorithms that are based on insertion sort.
ISSN:0018-9448
1557-9654
DOI:10.1109/TIT.2024.3425281