Buckinghamshire CC ANPR cameras

Ben Liddicott ben at liddicott.com
Tue Feb 7 18:44:35 GMT 2012


Perhaps worth reiterating that for anonymisation, the object is to have a 
relatively high, fairly uniform, level of collisions.

There are 18million-ish cars on the road. That means that if we are using a 
reasonable hash and masking to 24 bits, we have not performed any 
anonymisation whatsoever. Masking to 18 bits will give us approximately 
10-20 cars in each bucket, which is a bare minimum level of anonymisation at 
a national level.

However most driving doesn't occur at a national level. In Bucks there will 
(based on population) be around 250,000 to 300,000 cars registered.

To get basic anonymisation you really want to restrict the hash to 15,000 to 
30,000 buckets, = 14 to 15 bits at the absolute outside.

(Also recall that the new numbering scheme is regional. So if you want to 
play "where in the world is YY 01 ABC" (made up number) a good guess will 
start with Yorkshire, for example. Anonymisation needs to take this into 
account).

Bucket size may need to be higher/hash length lower if journey information 
can be combined with other information (such as a known start point, 
probable "first sight" entrypoints into the ANPR system etc). If such 
information is available to a malefactor you may struggle to justify more 
than four or five bits as anonymous.

Cheers, Ben

-----Original Message----- 
From: John Wilson
Sent: Sunday, February 05, 2012 2:00 PM
To: UK Cryptography Policy Discussion Group
Subject: Re: Buckinghamshire CC ANPR cameras

Rather to my surprise Bucks CC have given me the details of the
hashing scheme used by ANPR cameras which implement the UTMC protocol
(which is, I think, all of the civil and police ANPR cameras). This
was the result of an FoI request.

D 0 Q are replaced with O (Q isn't used in the current numbering scheme)
1 is replaced with I            (I isn't used in the current numbering 
scheme)
5 is replaced with S
Y is replaced with V
8 and B are replaced with 3  (this may cause problems after 2030)
Z is replaced with 2
F is replaced with E
C is replaced with G
M N W are replaced with H

In the scheme used since 2002 replacing a number by a letter or a
letter by a number will not cause extra collisions.

The transformed plate number is then hashed with the one-at-a-time
hash function described here
http://www.burtleburtle.net/bob/hash/doobs.html

The 32 bit result is reduced to 24 or 18 bits simply by masking.

This is described in the UTMC Technical Guide TR007.001b which, as far
as I can tell is not published on the UTMC site.

If anybody would like a copy of the document please contact me off list.

It would appear that the Highways Agency's statement that a large
prime number is used is untrue.

I'm going to be doing some experiments to see how well the function
does with some generated numberplate data.

John Wilson




More information about the ukcrypto mailing list