TY  - JOUR
T1  - Teaching Citizen Scientists to Categorize Glitches using Machine-Learning-Guided Training
JF  - Computers in Human Behavior
Y1  - 2020
A1  - Corey Jackson
A1  - Carsten Østerlund
A1  - Kevin Crowston
A1  - Mahboobeh Harandi
A1  - Sarah Allen
A1  - Sara Bahaadini
A1  - Scott Coughlin
A1  - Vicky Kalogera
A1  - Aggelos Katsaggelos
A1  - Shane Larson
A1  - Neda Rohani
A1  - Joshua Smith
A1  - Laura Trouille
A1  - Michael Zevin
AB  - <p>Training users in online communities is important for making high performing contributors. However, several conundrums exists in choosing the most effective approaches to training users. For example, if it takes time to learn to do the task correctly, then the initial contributions may not be of high enough quality to be useful. We conducted an online field experiment where we recruited users (N = 386) in a web-based citizen-science project to evaluate the two training approaches. In one training regime, users received one-time training and were asked to learn and apply twenty classes to the data. In the other approach, users were gradually exposed to classes of data that were selected by trained machine learning algorithms as being members of particular classes. The results of our analysis revealed that the gradual training produced “high performing contributors”. In our comparison of the treatment and control groups we found users who experienced gradual training performed significantly better on the task (an average accuracy of 90% vs. 54%), contributed more work (an average of 228 vs. 121 classifications), and were retained in the project for a longer period of time (an average of 2.5 vs. 2 sessions). The results suggests online production communities seeking to train newcomers would benefit from training regimes that gradually introduce them to the work of the project using real tasks.</p>
VL  - 105
ER  -