chiark / gitweb /
split out dcgi_get_cookie
[disorder] / server / README.dbversions
CommitLineData
8818b7fc
RK
1DisOrder Database Versions
2==========================
3
4If no _dbversion global preference is found then database version 1 is
5assumed. Database versions 2 and above always have a _dbversion
6global preference.
7
8Old database versions can be PARTIALLY emulated for testing purposes
9by setting the undocument dbversion configuration item. Setting it on
10a production system would be a terrible idea.
11
12Database Version 1
13------------------
14
15Path names are in UTF-8, but with no normalization applied: you get
16whatever the filesystem gives you.
17
18Search terms are split according to the old words() function.
19 - "/", ".", "+", "&", ":", "_" and "-" are considered to be separators
20 - anything in General_Category Cc, Cf, Co, Cs, Zl, Cp, Sz, Pe or Ps
21 is considered to be a separator
22 - anything else in General_Category Ll, Lm, Lo, Lt, Lu, Nd, Nl, No,
23 Sc, Sk, Sm or So is considered to be part of a word
24 - everything else is ignored
25
26Search terms are case-folded by applying the CaseFolding.txt mapping,
27without any attempt at normalization.
28
29Database Version 2
30------------------
31
32Path names are in UTF-8, normalized to NFC.
33
34Search terms are split according to the default Unicode word boundary
35detection algorithm.
36
37Search terms are case-folded using the Unicode case-folding algorithm,
38normalizing to NFKD.
39
40Things that haven't been done yet:
41 - undump support for new dbversion
42 - automatic upgrade from dbversion 1