Why do Yoti facial age estimation results published by NIST differ to those reported by Yoti in its white papers

profile picture Rachael Trotman 4 min read
Yoti's Facial Age Estimation results versus the NIST Age Estimation evaluation report

In September 2023, we submitted our facial age estimation model to the US National Institute of Standards and Technology (NIST), as part of a public testing process. This is the first time since 2014 that NIST has evaluated facial age estimation algorithms. NIST age estimation reports are likely to become a globally trusted performance guide for vendor models.

NIST assessed vendor Facial Age Estimation models using 4 data test sets at certain image sizes:

NIST image sizes used in the evaluation of Yoti's Facial Age Estimation

NIST provides some example images:

Fig. 5. The figure gives simulated samples of application type image used in the evaluation. Image source: Authors. Fig. 4. Examples of mugshot images used in the evaluation. Image source: NIST Special Database 32: Multiple Encounter Deceased Subjects (MEDS).

NIST note in their report that age estimation accuracy “will depend on the quality of the images” and the type of facial images captured.

For 6 years, Yoti have trained our model on primarily selfies of people looking into a mobile phone camera (or a laptop camera) because this is the obvious way customers can capture (live) their facial image to be age estimated. We capture these facial images at 720 x 800 pixels, with the face closely cropped to maximise the facial detail, because we have learned that we can attain higher age estimation accuracy for businesses by using this image size. 

We believe our training and testing on mobile phone images with closely cropped faces at 720 x 800 image size are key reasons why Yoti published MAEs (and FPRs) are lower (more accurate) for the Yoti model than the performance data published by NIST their 4 different test data sets.

Table displaying the differences in performance between the NIST evaluation results and Yoti's own testing results of Yoti's Facial Age Estimation.

NIST selected FPR objectives of 10%, 5% and 1% in their report as a way to benchmark their evaluation. As can be seen from the table above, NIST publish that Yoti’s age estimation model is more accurate on higher image size ‘Mugshot’ faces than lower image size ‘Application’ faces.  Consequently, the age thresholds required to meet FPRs of 10%, 5% and 1% are lower for Mugshot images than those needed using Application images. The age thresholds required to meet these FPR %s are lower still when the Yoti model is estimating age from mobile phone captured, higher image size, facial images. 

NIST used over 11 million facial images (with verified age) to test vendors. Some readers may wonder why NIST did not also test vendors with a test set of mobile phone camera facial images given, this is how most images will be captured for online age estimation.

The reality is that it is very challenging to capture, with consent, a database of millions of mobile phone facial images with ground truth date of birth evidence from individuals representative of many countries across the world.

Yoti is fortunate to have a very large set of consented and anonymised facial images, verified to government issued age data, from Yoti app users. By separating out ~120,000 of these images as diverse test data across each year of age, from the many millions of images used to train our algorithm, we have confidence in the accuracy figures we publish in our white paper (based primarily on mobile phone facial images at 720 x 800 pixels).

As part of our document authenticity in our identity verification service we compare the age estimation result of the selfie with the real age from their document, which also helps us test the accuracy of the model.

Finally Yoti’s facial age estimation model was first tested for accuracy, and positively certified, in November 2020 by ACCS, a UK accredited testing agency. Our age estimation model is used by some of the largest online brands, including Meta and OnlyFans, both of whom have publicly stated that it works very well.

Keep reading

zero trust authentication methods

How strong authentication powers Zero Trust and protects against cyber threats

Until recently, organisational cybersecurity typically relied on a fortress mentality, by building a strong perimeter with firewalls and VPNs, and trusting everything inside. But in today’s digital world of cloud apps, remote work and hiring, supply chain integrations, virtual connections and sophisticated attacks, that approach is no longer enough. Once criminals breach the walls, they can often move freely and undetected. If a business can’t reliably confirm who’s accessing its systems, it leaves the door open for cyber criminals. When authentication is weak, malicious actors can: Steal employee or customer login credentials through phishing and use them to access

6 min read
A screen showing a Shopify site selling knives. An additional screen shows the different ways that customers can prove their age.

Yoti age checks now available for Shopify stores

If you sell age-restricted products on Shopify, we’ve got good news. It’s now easier than ever to add secure, seamless age checks to your online store. Yoti has now officially integrated with Shopify – one of the biggest ecommerce platforms in the world. That means Shopify merchants can now offer fast, privacy-preserving age checks for their customers. If you’re selling alcohol, vapes, knives or other age-restricted items, this integration helps you meet legal requirements without adding unnecessary friction to your customers’ journey.   Why does this matter for Shopify merchants? Shopify powers millions of online businesses, including both independent

5 min read
An image of a woman looking directly at the camera. A guide over her face indicates that the image is a deepfake.

The rising challenge of detecting deepfakes

Artificial intelligence (AI) has come a long way in just a few years. What started as a tool for automating routine tasks and processing data more efficiently has now become integrated into nearly every industry. It seems as though it’s everywhere we look right now. One of the most controversial, and perhaps concerning, developments in AI is the rise of deepfakes. In simple terms, deepfakes are incredibly realistic synthetic media, such audio, video or images, generated by AI. These digital forgeries have become so convincing that telling real from fake is becoming a serious challenge. We look into how

8 min read