Measuring Lineup Difficulty By Matching Distance Metrics With Subject Choices in Crowd-Sourced Data

Graphics play a crucial role in statistical analysis and data mining. Being able to quantify structure in data that is visible in plots, and how people read the structure from plots is an ongoing challenge. The lineup protocol provides a formal framework for data plots, making inference possible. Th...

Full description

Saved in:
Bibliographic Details
Published inJournal of computational and graphical statistics Vol. 27; no. 1; pp. 132 - 145
Main Authors Chowdhury, Niladri Roy, Cook, Dianne, Hofmann, Heike, Majumder, Mahbubul
Format Journal Article
LanguageEnglish
Published American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America 02.01.2018
Subjects
Online AccessGet full text
ISSN1061-8600
1537-2715
DOI10.1080/10618600.2017.1356323

Cover

More Information
Summary:Graphics play a crucial role in statistical analysis and data mining. Being able to quantify structure in data that is visible in plots, and how people read the structure from plots is an ongoing challenge. The lineup protocol provides a formal framework for data plots, making inference possible. The data plot is treated like a test statistic, and lineup protocol acts like a comparison with the sampling distribution of the nulls. This article describes metrics for describing structure in data plots and evaluates them in relation to the choices that human readers made during several large Amazon Turk studies using lineups. The metrics that were more specific to the plot types tended to better match subject choices, than generic metrics. The process that we followed to evaluate metrics will be useful for general development of numerically measuring structure in plots, and also in future experiments on lineups for choosing blocks of pictures.
ISSN:1061-8600
1537-2715
DOI:10.1080/10618600.2017.1356323