AI determines CEFR level at 90% accuracy

In collaboration with experienced English Language Teaching (ELT) authors, EDIA created a machine learning algorithm that classifies any text on the standard CEFR language difficulty scale. The algorithm computes the exact CEFR level with 90% accuracy, which is higher than what is currently achievable with human experts.

Training data & ELT Experts

The training data consists of a variety of English texts ranging from academic papers and news articles to children’s books. Each text was rated by three of our eight ELT experts, all of whom have significant experience writing textbooks, graded readers, assessments and similar content for renowned ELT publishers.

A less reliable expert vs the median of two other experts per text.

Algorithm

The algorithm is trained to minimise the distance to the median of the domain experts, per text. It takes a variety of measures into account, ranging from role and frequency of individual words to the grammatical structure of sentences.

Want to learn more about our CEFR classifier?

Register for free and try it out for yourself