Curtis G. Northcutt

cgn |AT| mit |DOT| edu

Resume | Google Scholar | GitHub

Photo of Curtis G. Northcutt, Ph.D. Candidate at MIT.

I am a sixth-year Ph.D. Candidate in Computer Science at MIT specializing in algorithms for robust learning with noisy labels and broad applications, especially in online education.


Theme adapted from orderedlist

News Highlights -- updated 01-20-19

01-20-19: Announcing the L7 blog! A place for machine learning and human learning:
11-30-17: For a tutorial-style framing of the field of Artificial Intelligence in Online Education, including state-of-the-art solutions to important online education problems, as well as bits of my unpublished research, see these slides.
08-15-17: Rank Pruning is a state-of-the-art, robust, time-efficient, general algorithm for classification with noisy labels published at UAI ‘17.
04-20-17: Forum Ranking Diversification published at L@S ‘17.
09-20-16: CAMEO Cheating Detection in MOOCs and online courses published in Computers & Education ‘16.

See news for more.


Curtis G. Northcutt is a grad student at MIT working in machine learning, learning with noisy labels, and human learning with Isaac Chuang. He is supported by an NSF Fellowship and a MITx Digital Learning Research Fellowship. His work focuses on two goals: (1) uncertainty estimation for labels in machine learning datasets, (2) using artificial intelligence to enable human intelligence. To this end, Curtis invented confident learning, a family of theory and algorithms for handling label errors in datasets, and cleanlab, an open-source Python package for characterizing, learning with, and finding label errors in massive datasets.

Curtis has been fortunate to receive the MIT Morris Joseph Levin Masters Thesis Award, the NSF GRFP Fellowship, the Barry M. Goldwater National Scholarship, and the Vanderbilt Founder’s Medal (Valedictorian). Curtis created and manages the cheating detection system used by MITx and HarvardX online course teams, particularly in MicroMasters courses. While at MIT, he TA’d 6.867, a large graduate machine learning course.

Research Manifesto

Industry and Institutional Research

I am fortunate to have had the opportunity to work or intern at:

as well as academic collaborations and visiting research with MIT, Harvard, Vanderbilt, Notre Dame, and the University of Kentucky. Details here.

The Gift of Education

When you educate a person, you empower them within their community, and when you empower people socially, you give them hope, purpose, opportunity, and most importantly, you give them freedom.

Growing up below the poverty line in rural Kentucky, I experienced a glass ceiling of limited human and monetary resources. The ladder of opportunity often rises from prosperity rather than ability. My ladder was my education. Education led to exposure, then summer programs, then small scholarships, then bigger scholarships, and eventually opportunity. Everyone deserves access to quality educational resources – this underlies my motivation to pursue research that democratizes education.

To this end, I develop robust machine learning algorithms to enable open learning, i.e. to make advanced education more accessible. I work with edX student data to (1) infer user-intent across terabytes of noisy, massive interaction datasets and (2) implement prediction, inference, and detection algorithms distributed across 400+ MITx and HarvardX open online courses. For example, I ensure the legitimacy of online course certificates via cheating detection algorithms and with the help of exceptional colleagues, have demonstrated how machine learning can transform human learning with accurate proficiency estimation and diversification of comment rankings in discussion forums.

Life Mantras

When asked if I like rap, I always recommend PomDP the PhD rapper.