Data Codebook - BRCA1/2 Mutation Carrier Studies - Revised Data Dictionary
Rules for Generating UIDs
General Rule
*UID - Unique identifier (formed by some combination of a center + family + individual code)
Format - Two letter center code + 5 digits for fam + 5 digits for ego; fill in gaps with zeros.
Example: CID: UP, FAM: 22; EGO: 451; UID= UP0002200451
Site Specific Information
AU=Austria:
1. Two letter center code (AU);
2. Austria has different subsites previously identified by a third/fourth letter in the UID. That letter will now be replaced with a corresponding number (A = 01,
B = 02,
F = 03,
G = 04,
KN = 05,
M = 06,
N = 07,
OÖ = 08,
ST = 09,
T = 10,
V = 11);
3. Three digits for the family number;
4. Five digits for the individual number
Example: Family A2 Individual A2.1: AU0100200001
BA=Baylor College of Medicine (PI: Plon): Follows general rule.
BI=Beth Israel use BIDMC code preceded by necessary zeros, the BIDMC #= EGO; there is no family number.
BU=Baylor University Medical Center-Dallas (PI: Blum): Will be assigned a UID by CREP; be sure to put BU informant as CID.
CR=Creighton: Follows general rule.
CH=City of Hope: From uniquefamilynum (UFN#) provided (which is a 9-digit numeric code), use the first five digits which are for the FAM. The rest of the UFN is followed by a dash and four digits for the date of birth (which can be ignored). Then use the UPN#, which is the ego, to generate the next five digits.
DF=Dana Farber: Follows general rule.
DU=Duke: Follows general rule.
EH=Evanston Northwestern Healthcare: EH’s study ID: EH05340 followed by three digits for the family number and two digits for the EGO.
FC=Fox Chase: UID Format: Two letter center code, four digits for FAM and six digits for EGO; fill in gaps with zeros. Example CID: FC, FAM: 22; EGO: 451; UID=FC0022000451.
GU=Georgetown: Follows general rule.
KF=Australia (KF for KCon Fab): UID is generated by placing KF0 preceding the UPN, and removing all decimal points.
Example: UPN= 04.009.0682 (generated using: year recruited. clinic. incrementing number), UID= KF0040090682. Family ID will be retained in separate column and not as part of UID.
Clinic codes:
001,
002,
003,
004,
005,
006,
007,
008,
009,
010,
888.
LA=University of California, Los Angeles: Follows general rule.
MA=Mayo: Follows general rule.
NK=Netherlands Cancer Institute: Has no FAM, just EGO; just put in all zeros for the FAM ID.
RM=Royal Marsden: FAM and EGO are both six digits and often the same number. Use the EGO as the UID preceded by four zeros. When the FAM doesn’t match the EGO, put in "01" in the beginning to indicate the discrepancy.
Example:
UID |
Center Code |
FAM |
EGO |
|
| RM0000368074 | RM | 368074 | 368074 | |
| RM0100374715 | RM | 367873 | 374715 |
SM=St. Mary’s: Strip letters from ego (GE, AF, UF etc…), then follow general rule.
SW=UT Southwestern: Did their own, stick with what they send.
UC=Univ. of Chicago: First two digits for year of ascertainment (part of FAM), next three digits used to complete the FAM ID and then last five digits for the EGO. Example #1: UC 94-19-1: UC9401900001; Example #2: UC 2000-134-1: UC0013400001.
UP=Univ. of Pennsylvania: Follows general rule.
UT=Univ. of Utah: In some cases have to strip FAM numbers from EGO (if those numbers have been duplicated in the EGO), but generally follows general rule.
WC=Women’s College Hospital: In some cases have to strip FAM numbers from EGO, then follow general rule.
YA= Yale University: Have year of ascertainment, then follow general rule.
