Unrestricted data and statistical code accompanying Hwang, J. and N. Naik. 2023. "Systematic Social Observation at Scale: Using Crowdsourcing and Computer Vision to Measure Visible Neighborhood Conditions"

Placeholder Show Content

Abstract/Contents

Abstract

This repository contains replication material for Jackelyn Hwang & Nikhil Naik's (2023) "Systematic Social Observation at Scale: Using Crowdsourcing and Computer Vision to Measure Visible Neighborhood Conditions" in Sociological Methodology. The code and data to replicate tables and figures in the paper are stored here. Code and data to replicate the SSO@S pipeline is available on Github (https://github.com/Changing-Cities-Research-Lab/rep-gsv-trash).

Article Abstract: Analysis of neighborhood environments is important for understanding inequality. Few studies, however, utilize direct measures of the visible characteristics of neighborhood conditions, despite their theorized importance in shaping individual and community well-being, because collecting data on the physical conditions of places across neighborhoods and cities and over time has required extensive time and labor. We introduce Systematic Social Observation at Scale (SSO@S)—a pipeline for using visual data, crowdsourcing, and computer vision to identify visible characteristics of neighborhoods at a large scale. We implement SSO@S on millions of street-level images across three physically distinct cities—Boston, Detroit, and Los Angeles—from 2007 to 2020 to identify trash across space and over time. We evaluate the extent to which this approach can be used to assist with systematic coding of street-level imagery through cross-validation and out-of-sample validation, class-activation mapping, and comparisons with other sources of observed neighborhood characteristics. The SSO@S approach produces estimates with high reliability that correlate with some expected demographic characteristics but not others, depending on the city. We conclude with an assessment of this approach for measuring visible characteristics of neighborhoods and the implications for methods and research.

Description

Type of resource Dataset, text
Date modified January 27, 2024
Publication date April 8, 2023

Creators/Contributors

Author Hwang, Jackelyn
Author Naik, Nikhil

Subjects

Subject Computational Social Science
Subject Neighborhoods
Subject Sociology, Urban
Subject Disorder
Subject Computer vision
Subject Crowdsourcing
Genre Data
Genre Documentation
Genre Tabular data
Genre Code
Genre Data sets
Genre Dataset
Genre Tables (data)
Genre Computer program

Bibliographic information

Related item
DOI https://doi.org/10.25740/xy095yh6422
Location https://purl.stanford.edu/xy095yh6422

Access conditions

Use and reproduction
User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License
This work is licensed under a Creative Commons Attribution 4.0 International license (CC BY).

Preferred citation

Preferred citation
Hwang, J. and Naik, N. (2023). Unrestricted data and statistical code accompanying Hwang, J. and N. Naik. 2023. "Systematic Social Observation at Scale: Using Crowdsourcing and Computer Vision to Measure Visible Neighborhood Conditions". Stanford Digital Repository. Available at https://purl.stanford.edu/xy095yh6422. https://doi.org/10.25740/xy095yh6422.

Collection

Contact information

Also listed in

Loading usage metrics...