Images in children’s books convey messages about society and the roles that people play in it. Understanding these messages requires systematic measurement of who is represented. Computer vision face detection tools can provide such measurements; however, state-of-the-art face detection models were trained with photographs, and 80% of images in children’s books are illustrated; thus existing methods both misclassify and miss classifying many faces. In this paper, we introduce a new approach to analyze images using AI tools, resulting in data that can assess representation of race, gender, and age in both illustrations and photographs in children’s books. We make four primary contributions to the fields of deep learning and social sciences: (1) We curate an original face detection data set (IllusFace 1.0) by manually labeling 5,403 illustrated faces with bounding boxes. (2) We train two AutoML-based face detection models for illustrations: (i) using IllusFace 1.0 (FDAI); (ii) using iCartoon, a publicly available data set (FDAI iC), each optimized for illustrated images, detecting 2.5 times more faces in our testing data than the established face detector using Google Vision (FDGV). (3) We curate a data set of the race, gender, and age of 980 faces manually labeled by three different raters (CBFeatures 1.0). (4) We train an AutoML feature classification model (FCA) using CBFeatures 1.0. We compare FCA with the performance of another AutoML model that we trained on UTKFace, a public data set (FCA UTK) and of an established model using FairFace (FCF). Finally, we examine distributions of character identities over the last century across the models. We find that FCA is 34% more accurate than FCF in its race predictions. These contributions provide tools to educators, caregivers, and curriculum developers to assess the representation contained in children’s content.

More on this topic

BFI Working Paper·Sep 30, 2025

A Study of the Microdynamics of Early Childhood Learning

James Heckman and Jin Zhou
Topics: Early Childhood Education
BFI Working Paper·Sep 30, 2025

Closing Early Math Gaps by Parental Education with Technology at Home

Daniela Bresciani, Ariel Kalil, Haoxuan Liu, Susan E. Mayer, and Rohen Shah
Topics: Early Childhood Education
BFI Working Paper·Sep 16, 2025

The Promise of Digital Technology and Generative AI for Supporting Parenting Interventions in Latin America

Ariel Kalil, Michelle Michelini, and Pablo Ramos
Topics: Early Childhood Education, Technology & Innovation