Research Priorities

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Generative AI, particularly Large Language Models (LLMs), can expand access to expert guidance in domains like education, where such support is often limited. We introduce Tutor CoPilot, a Human-AI system that models expert thinking to assist tutors in real time. In a randomized controlled trial involving more than 700 tutors and 1,000 students from underserved communities, students with tutors using Tutor CoPilot were 4 percentage points more likely to master math topics (p0.01). Gains were highest for students of lower-rated tutors (+9 p.p.), and the tool is low-cost (about $20/tutor/year). Analysis of over 350,000 messages shows Tutor CoPilot promotes effective pedagogy, increasing the use of probing questions and reducing generic praise. In this work we show the potential for human-AI systems to scale expertise in a real-world domain, bridge gaps in skills, and create a future where high-quality education is accessible to all students.

The Post-Pandemic Promise of High-Impact Tutoring

As U.S. public schools emerged from the COVID-19 pandemic, longtime education policy wonk Liz Cohen saw that in many places, educators were finally taking tutoring seriously. 

For a year and a half in 2023 and 2024, Cohen traversed the country, interviewing educators, researchers and policymakers and observing tutoring sessions in seven states and the District of Columbia

Now the vice president of policy for the education group 50CAN, Cohen shares her findings in a new book, out today from Harvard Education Press: The Future of Tutoring: Lessons from 10,000 School District Tutoring Initiatives.

You've Paid for Tutoring. Here's How to Make Sure It Works.

Upon deeper review, however, these findings leave room for optimism. First, researchers found that lower-cost virtual tutoring models — approximately $1,200/student — were just as impactful as in-person models at $2,000/student, suggesting that tutoring can be less expensive without sacrificing impact.

Second, these findings highlight what's possible when students receivetutoring that comes closer to the definition of "high-impact." For example, the effect of tutoring was largest — about 3.5 months of learning — in a New Mexico district where students received more than 2,000 minutes of tutoring per year. Across all districts in the study, this amount most closely aligned with the recommendations for implementing a high-impact model.

How is ChatGPT impacting schools, really? Stanford researchers aim to find out

A new collaboration between Stanford’s SCALE and OpenAI, the creator of ChatGPT, strives to better understand how students and teachers use the popular AI platform and how it impacts learning

Education is one of the fastest-growing use cases of AI products. Students log on for writing assistance, brainstorming, image creation, and more. Teachers tap into tools like attendance trackers, get curriculum support to design learning materials, and much more.

Yet despite the rapid growth – and potential – a substantial gap remains in knowledge about the efficacy of these tools to support learning. 

A new research project from the Generative AI for Education Hub at SCALE, an initiative of the Stanford Accelerator for Learning, aims to help fill that gap by studying how ChatGPT is used in K-12 education. In particular, the research will examine how secondary level teachers and students use ChatGPT. 

The Impact of High-Impact Tutoring on Student Attendance: Evidence from a State Initiative

This study provides compelling evidence that tutoring can do more than boost test scores; it can actually get students back in the classroom. On average, students were 1.2 percentage points less likely to be absent on days when they were scheduled to receive tutoring, suggesting that they are motivated to participate in tutoring. This impact was even greater for middle schoolers and students who’d missed more than 30% of school days the prior year. The study also found that the design matters: tutoring only improved attendance when it combined at least two evidence-based features like small groups, frequent sessions, and in-school delivery.

2024-25 Snapshot of State Tutoring Policies

Over the past three school years, districts and states have worked to recover the academic ground they lost over the course of the COVID-19 pandemic and chip away at stubborn gaps in academic performance. Many turned to high-impact tutoring, a research-based approach to providing individualized instruction to students. In the School Pulse Panel Survey in October, 2024, 78% of responding schools reported having some type of tutoring for students and 37% reported offering high-dosage tutoring or what we refer to as high-impact tutoring.

Stanford initiative helps scale what works in education

Over the past couple of years, scaling well-researched solutions has been shown to also counter the negative effects of the pandemic, Loeb said, from widening achievement gaps and missed school time, to poorer social and emotional development. Her team recently launched the National Student Support Accelerator (NSSA) to address educational inequities resulting from the pandemic. NSSA conducts research on the most promising tutoring practices and works with district leaders and others to provide research-backed guidance on implementing high-impact tutoring. 

“Our students deserve this work,” Loeb said. “From our research, we learn so much about how to engage students and accelerate their learning. The practical, easy-to-use learnings from research need to reach decision makers so that our students can benefit.”

Tips by Text and NSSA are part of SCALE, Loeb’s new initiative at the Stanford Accelerator for Learning, a university-wide effort addressing some of the most challenging issues in education through research, partnerships, and technological innovation. 

Tags

2023-2024 Wittenberg University High-Impact Tutoring Program Implementation Report

In recent years, school districts across the U.S. have invested in high-impact tutoring as a promising approach to accelerate K12 student learning. Such efforts to scale tutoring have focused on design elements proven to be the most effective on student outcomes, namely consistent instruction from a trained tutor, integration with classroom instruction, tutoring informed by data, using quality curricula, and occurring at least three times per week (Nickow et al., 2024). Studies indicate that effective tutoring programs share these core characteristics, even while they vary in the types of tutors they employ, scheduling strategy, and in-person or virtual delivery model (Cortes et al., 2024; Robinson et al., 2024).

Stanford U’s Tutor CoPilot Transforms Real-Time Tutoring with AI-Driven Expert Guidance

Generative AI, including Language Models (LMs), holds the promise to reshape key sectors like education, healthcare, and law, which rely heavily on skilled professionals to navigate complex responsibilities. In education, for instance, effective teacher training with expert feedback is crucial yet costly, limiting opportunities to enhance educational quality on a larger scale.

Tags