Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Authors
Rose E.Wang,
Ana T. Ribeiro,
Carly D. Robinson,
Susanna Loeb,
Dora Demszky
Publication
National Student Support Accelerator
Year of Study
2025

 

Generative AI, particularly Large Language Models (LLMs), can expand access to expert guidance in domains like education, where such support is often limited. We introduce Tutor CoPilot, a Human-AI system that models expert thinking to assist tutors in real time. In a randomized controlled trial involving more than 700 tutors and 1,000 students from underserved communities, students with tutors using Tutor CoPilot were 4 percentage points more likely to master math topics (p<0.01). Gains were highest for students of lower-rated tutors (+9 p.p.), and the tool is low-cost (about $20/tutor/year). Analysis of over 350,000 messages shows Tutor CoPilot promotes effective pedagogy, increasing the use of probing questions and reducing generic praise. In this work we show the potential for human-AI systems to scale expertise in a real-world domain, bridge gaps in skills, and create a future where high-quality education is accessible to all students.

  Media Mentions

| AI CERTs

In many classrooms, teachers juggle planning, grading, and live guidance. Meanwhile, new tutor co-pilot systems promise a digital assistant at every elbow. These teacher-facing AI tools surface prompts, hints, and data during live sessions. As a result, even novice tutors can respond with expert-like moves. Randomized trials from Stanford to Nigeria now show tangible learning gains. Consequently, districts and donors are racing to pilot the technology. However, concerns around pedagogy, privacy, and equity still temper enthusiasm. Understanding the evidence, costs, and risks helps leaders decide when to adopt. This article analyzes tutor co-pilot systems, current data, and practical deployment steps. Along the way, we examine classroom AI trends and links to certified upskilling. Moreover, we highlight how student engagement improves when teachers gain timely insight. Finally, we point toward further research needed for responsible scale.

Tutor Co-Pilot Systems Impact

Stanford’s National Student Support Accelerator ran the largest randomized trial to date. Researchers embedded tutor co-pilot systems within 900 tutors serving 1,800 students. Overall mastery rose four percentage points over control groups. Moreover, students paired with lower-rated tutors gained nine points. World Bank teams replicated positive effects in Nigerian secondary English classes. The AI assistant there delivered 0.31 standard deviation growth within six weeks. Consequently, analysts equated the short program to almost two years of schooling.

...

| The 74

As 2024 reaches its end, it’s a good time to ask what’s coming next for K–12 education.

Nearly five years after the emergence of COVID, the pandemic’s after-effects still ripple through schools and communities, with student learning persistently failing to reach levels seen in 2019. Just under $200 billion in federal assistance to states, which was used to keep districts afloat during the crisis, expired in September — with no further help visible on the horizon.

Increasingly, though, the kids filling American schools have only dim memories of quarantines or virtual instruction. Their experience is instead defined by a rash of trends and technologies that sprang up, or became much more common, during the period when schooling was scrambled: a massive build-out of tutoring programs; the rapid adoption of artificial intelligence as a tool of both academic achievement and academic dishonesty; a rise in student despair and anxiety, which some experts attribute to the spread of smartphones; and, for adolescents, soaring recreational marijuana use under newly permissive state laws.

...

AI Could Get the Most out of Tutors

Tutoring programs exploded in the last five years as states and school districts searched for ways to counter plummeting achievement during COVID. But the cost of providing supplemental instruction to tens of millions of students can be eye-watering, even as the results seem to taper off as programs serve more students.  

That’s where artificial intelligence could prove a decisive advantage. A report circulated in October by the National Student Support Accelerator found that an AI-powered tutoring assistant significantly improved the performance of hundreds of tutors by prompting them with new ways to explain concepts to students. With the help of the tool, dubbed Tutor CoPilot, students assigned to the weakest tutors began posting academic results nearly equal to those assigned to the strongest. And the cost to run the program was just $20 per pupil. 

The paper suggests that tutoring initiatives may successfully adapt to the challenges of cost and scale. Another hopeful piece of evidence appeared this spring, when Stanford University researchers found that a “small burst” program in Florida produced meaningful literacy gains for young learners through micro-interactions lasting just 5–7 minutes at a time. If the success of such models can be replicated, there’s a chance that the benefits of tutoring could be enjoyed by millions more students.

...

| eSchool News

Key points:

...

Although the initial results of AI-powered tutoring are extremely promising, its widespread implementation faces technical, ethical, and practical obstacles. For example, the Tutor CoPilot project has identified several technical challenges, including the need for robust content knowledge, the ability to engage in multi-turn conversations, and the capacity to provide explanations tailored to individual students’ needs. These needs must be addressed before these tools are widely adopted, and they are likely best addressed by a collaborative effort between researchers, educators, and developers to refine and improve these systems to maximize impact.

Embedded evaluation and continuous improvement in all development and deployment phases will help ensure these tools truly deliver on their promise of improving educational outcomes for all students.

AI-enhanced tutoring represents an imminent and transformative opportunity to create a more accessible, equitable, and effective education system. By combining the proven benefits of high-dose tutoring with the scalability and adaptability of AI, educators have the potential to bridge long-standing achievement gaps and provide all students with the support they need to succeed. As AI in tutoring and education progresses, it’s essential to continue to invest in research, development, and implementation of these innovative tools to help bring about educational equity for all students.

...

| Synced

Generative AI, including Language Models (LMs), holds the promise to reshape key sectors like education, healthcare, and law, which rely heavily on skilled professionals to navigate complex responsibilities. In education, for instance, effective teacher training with expert feedback is crucial yet costly, limiting opportunities to enhance educational quality on a larger scale.

In a new paper Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise, a Stanford University research team presents Tutor CoPilot, a new model that offers expert-level guidance to tutors in real time. This study is the first of its kind—a randomized controlled trial testing a Human-AI system in live tutoring scenarios.

Tutor CoPilot aims to enhance K-12 education by providing immediate, actionable guidance to tutors, ultimately improving the live learning experience for students.

...

| Education Week

An AI-powered tutoring assistant increased human tutors’ capacity to help students through math problems and improved students’ performance in math, according to a Stanford University study.

The digital tool, Tutor CoPilot, was created by Stanford researchers to guide tutors, especially novices, in their interactions with students.

The study is the first randomized controlled trial to examine a human-AI partnership in live tutoring, according to the researchers. The study examines whether the tool is effective for improving tutors’ skills and students’ math learning.

It comes as tutoring has become a key learning-recovery tool. Schools, however, have run into challenges in scaling and sustaining tutoring programs because they require a lot of human tutors, time, and money.

In an interview with Education Week, Susanna Loeb, an education professor at Stanford and one of the study’s authors, discussed the creation of the tool, the trial findings, and its implications for schools.

This interview has been edited for brevity and clarity.

...