Statistical Analysis and Design of Crowdsourcing Applications

Loading...
Thumbnail Image
Degree type
Doctor of Philosophy (PhD)
Graduate group
Statistics
Discipline
Subject
crowdsourcing
experimentation
machine learning
missing data
natural language processing
statistical methodology
Computer Sciences
Economics
Statistics and Probability
Funder
Grant number
License
Copyright date
2015-11-16T00:00:00-08:00
Distributor
Related resources
Contributor
Abstract

This thesis develops methods for the analysis and design of crowdsourced experiments and crowdsourced labeling tasks. Much of this document focuses on applications including running natural field experiments, estimating the number of objects in images and collecting labels for word sense disambiguation. Observed shortcomings of the crowdsourced experiments inspired the development of methodology for running more powerful experiments via matching on-the-fly. Using the label data to estimate response functions inspired work on non-parametric function estimation using Bayesian Additive Regression Trees (BART). This work then inspired extensions to BART such as incorporation of missing data as well as a user-friendly R package.

Advisor
Abba Krieger
Ed George
Date of degree
2014-01-01
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation