Digital Humanities

Decision-Making Under Pressure during My PhD: Lessons from whale songs and ocean noise

May 6, 2025
by Jaewon Saw. This blog post shares a story from a field experiment using Distributed Acoustic Sensing (DAS) to detect whale vocalizations in Monterey Bay. Most of the data got overwhelmed by noise from boat engines, wave motion, and cable instability. On the final day, a spur-of-the-moment decision to add loops to the fiber optic cable dramatically improved signal quality.

Predicting the Future: Harnessing the Power of Probabilistic Judgements Through Forecasting Tournaments

April 29, 2025
by Christian Caballero. From the threat of nuclear war to rogue superintelligent AI to future pandemics and climate catastrophes, the world faces risks that are both urgent and deeply uncertain. These risks are where traditional data-driven models fall short—there’s often no historical precedent, no baseline data, and no clear way to simulate a future world. In cases like this, how can we anticipate the future? Forecasting tournaments offer one answer, harnessing the wisdom of crowds to generate probabilistic estimates of uncertain future events. By incentivizing accuracy through structured competition and deliberation, these tournaments have produced aggregate predictions of future events that outperform well-calibrated statistical models and teams of experts. As they continue to develop and expand into more domains, they also raise urgent questions about bias, access, and whose knowledge gets to shape our collective sensemaking of the future.

Elijah Mercer

Data Science Fellow 2024-2025
School of Information

Elijah, originally from Newark, New Jersey, now resides in San Francisco, California, dedicated to social and juvenile justice. With a Criminology degree from American University, he began as a research intern at the Investigative Reporting Workshop, focusing on the Digital Divide.

Teaching in Baltimore with Teach for America reinforced his belief in research and data for marginalized communities. In roles at the Coalition Against Insurance Fraud, New York Police Department, and San Francisco District Attorney’s Office, Elijah used data to combat crime. Now...

Navigating AI Tools in Open Source Contributions: A Guide to Authentic Development

December 17, 2024
by Sahiba Chopra. The rise of ChatGPT has transformed how developers approach their work - but it might be hurting your reputation in the open-source community. While AI can supercharge your productivity, knowing when not to use it is just as crucial as knowing how to use it effectively. This guide reveals the unspoken rules of AI usage in open source, helping you navigate the fine line between leveraging AI and maintaining authenticity. Learn when to embrace AI tools and when to rely on your own expertise, plus get practical tips for building trust in the open-source community.

Teaching Data Science as a Tool for Empowerment

February 18, 2025
by Elijah Mercer. Data literacy is a powerful tool for empowerment, especially for historically marginalized communities. Through Data Cafecito at Roadmap to Peace and helping teach Data 4AC at UC Berkeley, Elijah Mercer helps bridge the gap between data, advocacy, and justice. Data Cafecito fosters culturally responsive data practices for Latinx-serving organizations, while Data 4AC challenges students to critically analyze data’s role in systemic inequities. Drawing from his experience in education, Mercer uses interactive teaching methods to make data accessible and meaningful. By centering storytelling and community-driven insights, he aims to equip individuals with the skills to use data for social change.

Field Experiments in Corporations

January 28, 2025
by Yue Lin. How do social science researchers conduct field experiments with private actors? Yue Lin provides a brief overview of the recent developments in political economy and management strategy, with a focus on filing field experiments within private corporations. Unlike conventional targets like individuals and government agencies, private companies are an emergent sweet spot for scholars to test for important theories, such as sustainability, censorship, and market behavior. After comparing the strengths and weaknesses of this powerful yet nascent method, Lin brainstorms some practical solutions to improve the success rate of field experimental studies. She aims to introduce a new methodological tool in a nascent research field and shed some light on improving experimental quality while adhering to ethical standards.

Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

February 11, 2025
by Taesoo Song. Asian Americans are often overlooked in discussions of racial inequality due to their high average socioeconomic attainment. Many academic and policy researchers treat Asians as a single racial category in their analysis. However, this broad categorization can mask significant within-group disparities, leaving many disadvantaged individuals without access to vital resources and policy support. Song emphasizes the importance of data disaggregation in revealing Asian American inequalities, particularly in areas like income and homeownership, and demonstrates how breaking down these categories can lead to more targeted and effective policy solutions.

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

Digitizing Inclusion: FinTech’s Promise and Pitfalls in the Global South

April 22, 2025
by Victoria Hollingshead. FinTech promises to revolutionize financial inclusion by harnessing data science to reach populations historically excluded from formal financial systems. By analyzing digital footprints, mobile payments, and behavioral data, startups and financial institutions have the potential to improve customer-lender interactions and revolutionize screening and monitoring techniques. While FinTech shows promise of enabling financial access, it also raises critical questions: how do we implement financial inclusion without reproducing the structures of the past? And more rhetorically, can financiers be the arbiters of financial inclusion, if their intrinsic role is to stratify and exclude?

Sharing Just Enough: The Magic Behind Gaining Privacy while Preserving Utility

April 15, 2025
by Sohail Khan. Netflix knows what you like, but does it need to know your politics too? We often face a frustrating choice: share our data and be tracked, or protect our privacy and lose personalization. But what if there was a third option? This article begins by introducing the concept of the privacy-utility trade-off, then explores the methods behind strategic data distortion, a technique that lets you subtly tweak your data to block sensitive inferences (like political views) while still maintaining useful recommendations. Finally, it looks ahead and advocates for a future where users, not platforms, shape the rules, reclaiming control of their own privacy.