Background
Methods
Study population and DNA extraction
Primer design
Name | Sequence (5′-3′) | Product size |
---|---|---|
A3D (12.1 kb) | ||
A3D Forward primer | AGGAAGCCTCGCTCTCTCA | 12,069 bp |
A3D Forward primer | CAGGCAGGGTCTTGATCTGT | |
A3D Amplicon 1F | AAAAAGAGGGAGACTGGGACAAGCGTATCTAAGA | 4,300 bp |
A3D Amplicon 1R | GAGTGTGGGTGAGGGGGTGTAACCATGAG | |
A3D Amplicon 2F | AGCTAGGAGAGGTCACCCTG | 3,188 bp |
A3D Amplicon 2F | CAGGAGGCTAGAAGAGACAGACCATGAGGC | |
A3F (13.31 kb) | ||
A3F 1st round f | ACCAGAAAGAGGGTGAGAGACTGAGGAAGATAAAG | 13,142 bp |
A3F 1st round rv | AGCCATTTATTGCAGAAGCTATGGATAAAGCTGGT | |
A3F Amplicon 1 f | ACCAGAAAGAGGGTGAGAGACTGAGGAAGATAAAG | 4,918 bp |
A3F Amplicon 1 rv | GGGTGAGGGGTGTAACCATG | |
A3F Amplicon 2 f | TTCAGAAACCCGATGGAGGC | 4,478 bp |
A3F Amplicon 2 rv | AGCCATTTATTGCAGAAGCTATGGATAAAGCTGGT | |
A3G (10.74 kb) | ||
A3G 1st round f | TGTTAACCAGAGGCTGCTCTTCCCAGG | 11,852 bp |
A3G 1st round rv | TCCCTGGGACTCAGCTCC | |
A3G Amplicon 1 f | ATTTGTCCCCAGCTCTGTGG | 3,231 bp |
A3G Amplicon 1 rv | AGAGGACCTGGTCTGGAACA | |
A3G Amplicon 2 f | CAAGGGAGGAAGCGTGGAG | 3,908 bp |
A3G Amplicon 2 rv | TGCATTGCTTTGCTGGTGTC | |
A3H (6.8 kb) | ||
APOBEC3 H forward primer full length | TCTGTTGCACAGAAACACGATGG | 3522bp |
APOBEC3 H reverse primer full length | CAACTGACATGCCCCAGGG | |
APOBEC3 H forward primer Exon2 (A3HfE2) | TCTGTTGCACAGAAACACGATGG | 452bp |
APOBEC3 H Reverse primer Exon 2(A3HrE2) | TTCCCGAAGTAGTGACTGAGC | |
APOBEC3 H forward primer Exon 3 &4(A3HfE3/4 | GCCACGCACTAGAAAGTTCAC | 934bp |
APOBEC3 H Reverse primer Exon 3&4(A3HrE3/4) | ACAGTGCCTCACCTTTATCC |
Polymerase chain reaction (PCR) to amplify A3D, A3F, A3G and A3H genes
Fragmentation, tagmentation and addition of Illumina indices
Library normalization, pooling and sequencing
Demultiplexing and sequence quality control evaluation
Sequence filtering, trimming, mapping and variant calling
Statistical analysis
Hardy-Weinberg equilibrium (HWE) and allele frequency comparisons
Pairwise linkage disequilibrium (LD) and haplotype assignment
Results
Single nucleotide polymorphisms (SNPs), detection of indels and verification
APOBEC3D
Amino acid change and variant ID | Type of change | Genotypes nt # in CDS | Exon | Frequencies (%) | Hardy Weinberg equilibrium |
---|---|---|---|---|---|
APOBEC 3D nonsynonymous changes (n = 168) | |||||
R6K (NI) | AGA➔AAA Transition | 17G/G 17G/A | 1 | 151 (89.9) 17 (10.1) | P-value = 0.49 X2 = 0.48 |
R97C (rs75858538) | CGC➔TGC Transition | 289 C/C 289 C/T 289 T/T | 1 | 148 (88.1) 19 (11.3) 1 (0.6) | P-value = 0.65 X2 = 0.20 |
L221R (NI) | CTG➔CGG Transition | 662 T/T 662 T/G | 5 | 165 (98.2) 3 (1.8) | P-value = 0.91 X2 = 0.01 |
C224Y (rs772893975) | TGT➔TAT Transition | 671G/G 671G/A | 5 | 161 (95.8) 7 (4.2) | P-value = 0.78 X2 = 0.076 |
T238A (rs201709403) | ACA➔GCA Transition | 712A/A 712A/G | 5 | 154 (91.7) 14 (8.3) | P-value = 0.57 X2 = 0.32 |
T238I (NI) | ACA➔ATA Transition | 713C/C 713C/T | 5 | 160(95.2) 8(4.8) | P-value = 0.75 X2 = 0.10 |
R248K (rs61748819) | AGG➔AAG Transition | 743 G/G 743 G/A 743 A/A | 5 | 133 (79.2) 33 (19.6) 2 (1.2) | P-value = 0.98 X2 = 0.001 |
C320Y (rs61999342) | TGC➔TAC Transition | 959 G/G 959 G/A | 6 | 167 (99.4) 1 (0.6) | P-value = 0.97 X2 = 0.001 |
APOBEC3D synonymous changes | |||||
L221 L (rs769426665) | CTG ➔CTC Transversion | 663G/G 663G/C | 5 | 165 (98.2) 3 (1.8) | P-value = 0.91 X2 = 0.01 |
T316 T (rs184448269) | ACC➔ACT Transition | 948 C/C 948 C/T | 6 | 161 (95.8) 7 (4.2) | P-value =0.78 X2 = 0.08 |
APOBEC 3F nonsynonymous changes (n = 154) | |||||
R48P (rs35053197) (in isoform 201 and 202) | CGT➔CCC Transversion | 143 G/G 143 G/C | 2 | 142 (92.2) 12 (7.8) | P-value = 0.61 X2 = 0.25 |
A78V (rs5750728) (only in isoform 201) | GCC➔GTC Transition | 233 C/C 233 C/T 233 T/T | 4 | 95 (61.7) 56 (36.4) 3 (1.9) | P-value = 0.10 X2 = 2.64 |
I87L (rs146543452) (only in isoform 201) | ATC➔CTC Transversion | 259 A/A 259 A/C | 4 | 153 (99.4) 1(0.6) | P-value = 0.97 X2 = 0.002 |
Q87L (rs114704208) (only in isoform 202) | CAG➔CTG Transversion | 260 A/A 260 A/T | 3 | 145 (94.2) 9 (6.8) | P-value = 0.71 X2 = 0.14 |
A108S (rs2020390) (only in isoform 201) | GCT➔TCT Transversion | 322 G/G 322 G/T 322 T/T | 4 | 54 (35.1) 84 (54.5) 16 (10.4) | P-value = 0.04 X2 = 4.02 |
V231I (rs2076101) (only in isoform 201) | GTC➔ATC Transition | 691 G/G 691 G/A 691 A/A | 5 | 122 (79.2) 30 (19.5) 2 (1.3) | P-value = 0.92 X2 = 0.01 |
Y307C (rs12157816) (only in isoform 201) | TAC➔TGC Transition | 920 A/A 920 A/G | 6 | 139 (90.3) 15 (9.7) | P-value = 0.52 X2 = 0.40 |
APOBEC3F synonymous changes | |||||
I117I (NI) (only in isoform 201) | ATC➔ATT Transition | 351 C/C 351 C/T | 4 | 152 (98.7) 2 (1.3) | P-value =0.94 X2 = 0.007 |
S118S (rs35928287) (only in isoform 201) | TCC➔TCT Transition | 354 C/C 354 C/T | 4 | 113 (73.4) 41 (26.6) | P-value =0.06 X2 = 3.63 |
R143R (rs4821862) (only in isoform 201) | CGC➔CGT Transition | 429 C/C 429 C/T 429 T/T | 4 | 19 (12.3) 91 (59.1) 44 (28.6) | P-value = 0.01 X2 = 7.04 |
Y196Y (rs765418322) (only in isoform 201) | TAT➔TAC Transition | 588 T/T 588 T/C 588 C/C | 4 | 126 (81.8) 24 (15.6) 4 (2.6) | P-value = 0.04 X2 = 4.09 |
S229S (rs549550231) (only in isoform 201) | TCA➔TCG Transition | 687 A/A 687 A/G | 5 | 152 (98.7) 2 (1.3) | P-value = 0.94 X2 = 0.007 |
E245E (rs113109079) (only in isoform 201) | GAG➔GAA Transition | 735 G/G 735 G/A 735 A/A | 5 | 146 (94.8) 7 (4.5) 1 (0.7) | P-value = 0.01 X2 = 6.09 |
S327S (rs35895636) (only in isoform 201) | TCC➔TCT Transition | 981 C/C 981 C/T 981 T/T | 5 | 128 (83.2) 23 (14.9) 3 (1.9) | P-value = 0.12 X2 = 2.39 |
APOBEC 3G nonsynonymous changes (n = 165) | |||||
H186R (rs8177832) | CAC➔CGC Transition | 557 A/A5 557 A/G 557 G/G | 4 | 63 (38.2) 82 (49.7) 20 (12.1) | P-value = 0.39 X2 = 0.73 |
R256H (rs17000736) | CGC➔CAC Transition | 767 G/G 767 G/A | 6 | 161(97.6) 4 (2.4) | P-value = 0.87 X2 = 0.02 |
Q275E (rs17496046) | CAG➔GAG Transversion | 823 C/C 823 C/G 823 G/G | 6 | 111 (67.3) 49 (29.7) 5 (3.0) | P-value = 0.88 X2 = 0.02 |
G363R (rs148267053) | GGA➔AGA Transition | 1087 G/G 1087 G/A | 7 | 148 (89.7) 17 (10.3) | P-value = 0.49 X2 = 0.49 |
APOBEC3G synonymous changes | |||||
S60S (rs112603901) | TCC➔TCT Transition | 180 C/C 180 C/T | 3 | 147 (89.1) 18 (10.9) | P-value =0.46 X2 = 0.55 |
A109A (rs375760983) | GCC➔GCT Transition | 327 C/C 327 C/T | 3 | 164 (99.4) 1 (0.6) | P-value =0.97 X2 = 0.002 |
F119F (rs5757465) | TTT➔TTC Transition | 357 T/T 357 T/C | 3 | 164 (99.4) 1 (0.6) | P-value =0.97 X2 = 0.002 |
L371 L (rs11545130) | CTG➔TTG Transition | 1111 C/C 1111 C/T | 7 | 158 (95.8) 7 (4.2) | P-value =0.78 X2 = 0.08 |
APOBEC 3H nonsynonymous changes (n = 133) | |||||
N15Δ (rs140936762) | -CAA Deletion | 45 CAA/CAA 45 CAA/ Δ 45 Δ / Δ | 1 | 29 (21.8) 49 (36.8) 55 (41.4) | P-value = 0.001 X2 = 10.25 |
R18L (rs139293) | CGC➔CTC Transversion | 53 G/G 53 G/T 53 T/T | 1 | 112 (84.2) 15 (11.3) 6 (4.5) | P-value = 0.00 X2 = 15.9 |
G105R (rs139297) | GGC➔CGC Transversion | 313 G/G 313 G/C 313 C/C | 2 | 1 (0.8) 4 (3.0) 128 (96.2) | P-value = 0.00 X2 = 13.4 |
K121E (rs139298) | AAG➔GAG Transition | 361 A/A 361 A/G 361 G/G | 2 | 1 (0.8) 6 (4.5) 126 (94.7) | P-value = 0.01 X2 = 6.9 |
K140E (rs139300) | AAG➔GAG Transition | 418 A/A 418 G/G | 2 | 0133 (100) | P-value = N/A X2 = N/A |
E178D (rs139302) | GAG➔GAC Transversion | 534 G/G 534 G/C 534 C/C | 3 | 3 (2.2) 11 (8.3) 119 (89.5) | P-value = 0.00 X2 = 12.7 |
APOBEC3H synonymous changes | |||||
T43 T (rs139294) | ACG➔ACC Transversion | 129 G/G 129 G/C 129 C/C | 1 | 6 (4.5) 7 (5.3) 120 (90.2) | P-value =0.00 X2 = 48.4 |
APOBEC3F
APOBEC3G
APOBEC3H
Determination of APOBEC 3 haplotypes
Variation (amino acid and its position) | Frequency (%) | Haplotypes within individuals |
---|---|---|
Confirmed APOBEC3D Haplotypes (n = 168) | ||
97R, 238 T, 248R | 88 (52.3) | i/i |
97C, 238 T, 248R | 1 (0.6) | ii/ii |
97C(het), 238 T, 248R | 18 (10.7) | i/ii |
97R, 238A(het), 248R | 10 (6) | i/iii |
97R, 238 T, 248 K | 1 (0.6) | iv/iv |
97R, 238 T, 248 K(het) | 27 (16) | i/iv |
Minor variant frequency < 5% | 7 (4.1) | Not assigned |
Othersa | 16 (9.7) | Not assigned |
Unconfirmed APOBEC3D Haplotypes | ||
None | ||
Confirmed APOBEC3F Haplotypes (n = 154) | ||
48R, 78A, 87I, 108A, 231 V, 307Y | 5 (3.2) | i/i |
48R, 78A, 87I, 108S, 231 V, 307Y | 2 (1.3) | iii/iii |
48R, 78A, 87I, 108S(het), 231 V, 307Y | 32 (20.8) | i/iii |
48R, 78A, 87I, 108A, 231 V, 307C(het) | 5 (3.2) | i/iv |
48R, 78 V, 87I, 108S, 231 V, 307Y | 1 (0.6) | vi/vi |
48R, 78 V (het), 87I, 108S, 231 V, 307Y | 2 (1.3) | vi/iii |
Minor variant frequency < 5% | 5 (3.2) | Not assigned |
Othersa | 51 (33.1) | Not assigned |
Unconfirmed APOBEC3F Haplotypes | ||
48R, 78 V (het), 87I, 108S (het), 231 V, 307Y | 21 (13.6) | Not assigned |
48R, 78 V (het), 87I, 108S (het), 231I (het), 307Y | 14 (9.1) | Not assigned |
48R, 78 V (het), 87I, 108S, 231I (het), 307Y | 8 (5.2) | Not assigned |
48R, 78 V (het), 87I, 108S (het), 231I, 307Y | 2 (1.3) | Not assigned |
48P (het), 78A, 87I, 108S (het), 231 V, 307Y | 6 (3.9) | Not assigned |
Confirmed APOBEC3G Haplotypes (n = 165) | ||
186H, 275Q, 363G | 5 (3.1) | i/i |
186R, 275Q, 363G | 20 (12.1) | ii/ii |
186R (het), 275Q, 363G | 43 (26.1) | i/ii |
186H, 275E, 363G | 3 (1.8) | iii/iii |
186H, 275E(het), 363G | 22 (13.3) | i/iii |
186H, 275Q, 363R(het) | 9 (5.5) | iv/iv |
Minor variant frequency < 5% | 11 (6.6) | Not assigned |
Othersa | 52 (31.5) | Not assigned |
Unconfirmed APOBEC3G Haplotypes | ||
None | ||
Confirmed APOBEC3H Haplotypesb (n = 133) | ||
15 N, 18R, 105R, 121E, 178D | 38 (28.6) | ii/ii |
15Δ,18R, 105R, 121E,178E | 25 (18.8) | x/x |
15Δ(het), 18R, 105R, 121E, 178D | 36 (27.1) | ii/iii |
15Δ, 18 L, 105R, 121E, 178D | 6 (4.5) | iv/iv |
15Δ, 18 L(het), 105R, 121E, 178D | 7 (5.3) | iv/iii |
Othera | 10 (7.4) | Not assigned |
Unconfirmed APOBEC3H Haplotypesb | ||
15Δ(het), 18R, 105R, 121E(het), 178D | 4 (3) | Not assigned |
15Δ(het), 18 L(het), 105R, 121E, 178D | 7 (5.3) | Not assigned |
Allele frequencies and their comparison with other populations
Amino acid change and variant ID | Allele (2n) | SA (336) | EAS (1008) | EUR (1006) | AFR (1322) | AMR (694) | SAS (978) | ExAC (121412) |
APOBEC 3D nonsynonymous allele frequencies (%) | ||||||||
R6K (NI) | G (R) A (K) | 94.9 5.1 | NI | NI | NI | NI | NI | NI |
R97C (rs75858538) | C (R) T (C) | 93.8 6.2 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 96.6 3.4 (P = 0.0395)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 99.7 0.3 (P < 0.0001)* |
L221R (NI) | T (L) G (R) | 99.1 0.9 | NI | NI | NI | NI | NI | NI |
C224Y (rs772893975) | G (C) A (Y) | 97.9 2.1 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
T238A (rs201709403) | A (T) G (A) | 95.8 4.2 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* |
T238I (NI) | C (T) T (I) | 97.6 2.4 | NI | NI | NI | NI | NI | NI |
R248K (rs61748819) | G (R) A (K) | 89.0 11.0 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 89.0 11.0 (P = 1.0) | 99.0 1.0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 99.0 1.0 (P < 0.0001)* |
C320Y (rs61999342) | G (C) A (Y) | 99.7 0.3 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
APOBEC3D synonymous allele frequencies (%) | ||||||||
L221 L (rs769426665) | G (L) C (L) | 99.10.9 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
T316 T (rs184448269) | C (T) T (T) | 97.9 2.1 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 98.8 1.2 (P = 0.2917) | 99.6 0.4 (P = 0.0169)* | 100 0 (P < 0.0001)* | 99.9 0.1 (P < 0.0001)* |
APOBEC 3F nonsynonymous allele frequencies (%) | ||||||||
R48P (rs35053197) | G (R) C (P) | 96.1 3.9 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 96.9 3.1 (P = 0.4762) | 99.4 0.6 (P = 0.0003)* | 99.4 0.6 (P = 0.0001)* | 99.7 0.3 (P < 0.0001)* |
A78V (rs5750728) | C (A) T (V) | 79.9 20.1 | 29 71 (P < 0.0001)* | 51 49 (P < 0.0001)* | 80 20 (P = 0.9371) | 38 62 (P < 0.0001)* | 39 61 (P < 0.0001)* | 52.3 47.8 (P < 0.0001)* |
I87L (rs146543452) | A (I) C (L) | 99.7 0.3 | 100 0 (P = 0.2340) | 100 0 (P = 0.2344) | 99.0 1.0 (P = 0.4897) | 100 0 (P = 0.3074) | 100 0 (P = 0.2395) | 100 0 (P < 0.0001)* |
Q87L (rs114704208) | A (Q) T (L) | 97.1 2.9 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 94.5 5.5 (P = 0.0606) | 99.6 0.4 (P = 0.0019)* | 100 0 (P < 0.0001)* | 99.7 0.3 (P < 0.0001)* |
A108S (rs2020390) | G (A) T (S) | 62.3 37.7 | 29.0 71.0 (P < 0.0001)* | 51.0 49.0 (P = 0.0005)* | 68.0 32.0 (P = 0.0600) | 37.0 63.0 (P < 0.0001)* | 40.0 60.0 (P = 0.0001)* | 52.4 47.6 (P < 0.0001)* |
V231I (rs2076101) | G (V) A (I) | 89.0 11.0 | 29.0 71.0 (P < 0.0001)* | 51.0 49.0 (P < 0.0001)* | 81.0 19.0 (P = 0.0008)* | 38.0 62.0 (P < 0.0001)* | 39.0 61.0 (P < 0.0001)* | 48.4 51.6 (P < 0.0001)* |
Y307C (rs12157816) | A (Y) G (C) | 95.1 4.9 | 100 0 (P < 0.0001)* | 98.0 2.0 (P = 0.0133)* | 97.0 3.0 (P = 0.1148) | 98.0 2.0 (P = 0.0225)* | 100 0 (P < 0.0001)* | 98.7 1.3 (P < 0.0001)* |
APOBEC3F synonymous allele frequencies (%) | ||||||||
I117I (NI) | C (I) T (I) | 99.4 0.6 | NI | NI | NI | NI | NI | NI |
S118S (rs35928287) | C (S) T (S) | 86.7 13.3 | ND | ND | ND | ND | ND | 99.7 0.3 (P < 0.0001)* |
R143R (rs4821862) | C (R) T (R) | 41.958.1 | 29.0 71.0 (P < 0.0001)* | 51.0 49.0 (P = 0.0040)* | 45.0 55.0 (P = 0.2781) | 36.0 64.0 (P = 0.1038) | 39.0 61.0 (P = 0.4613) | 45.5 54.6 (P = 0.1787) |
Y196Y (rs765418322) | T (Y) C (Y) | 89.6 10.4 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
S229S (rs549550231) | A (S) G (S) | 99.4 0.6 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
E245E (rs113109079) | G (E) A (E) | 97.1 3.9 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 99.0 1.0 (P = 0.0231)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* |
S327S (rs35895636) | C (S) T (S) | 90.6 9.4 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 98.3 1.7 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 99.8 0.2 (P < 0.0001)* |
APOBEC 3G nonsynonymous allele frequencies (%) | ||||||||
H186R (rs8177832) | A (H) G (R) | 63.0 37.0 | 92.8 7.2 (P < 0.0001)* | 97.0 3.0 (P < 0.0001)* | 57.0 43.0 (P = 0.0530) | 92.5 7.5 (P < 0.0001)* | 99.2 0.8 (P < 0.0001) | 93.6 6.4 (P < 0.0001)* |
R256H (rs17000736) | G (R) A (H) | 98.8 1.2 | 100 0 (P = 0.0036)* | 100 0 (P = 0.0037)* | 98.6 1.4 (P = 1.0) | 100 0 (P = 0.0107)* | 100 0 (P = 0.0040)8 | 100 0 (P = 0.0002)* |
Q275E (rs17496046) | C (Q) G (E) | 82.1 17.9 | 97.3 2.7 (P < 0.0001)* | 94.6 5.4 (P < 0.0001)* | 87.5 12.5 (P = 0.0119)* | 96.0 4.0 (P < 0.0001)* | 98.7 1.3 (P < 0.0001)* | 94.3 5.7 (P < 0.0001)* |
G363R (rs148267053) | G (G) A (R) | 94.8 5.2 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 98.6 1.4 (P = 0.3220) | 99.9 0.1 (P = 0.0017)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* |
APOBEC3G synonymous allele frequencies (%) | ||||||||
S60S (rs112603901) | C (S) T (S) | 94.5 5.5 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 99.7 0.3 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* |
A109A (rs375760983) | C (A) T (A) | 99.7 0.3 | ND | ND | ND | ND | ND | 100 0 (P < 0.0001)* |
F119F (rs5757465) | T (F) C (F) | 99.7 0.3 | 77.6 22.4 (P < 0.0001)* | 55.3 44.7 (P < 0.0001)* | 97.1 2.9 (P = 0.0035)* | 60.2 39.8(P < 0.0001)* | 55.5 44.5 (P < 0.0001)* | 62.8 37.2 (P < 0.0001)* |
L371 L (rs11545130) | C (L) T (L) | 97.9 2.1 | 100 0 (P < 0.0001)* | 100 0 (P < 0.0001)* | 97.0 3.0 (P = 0.4614) | 99.6 0.4 (P = 0.0157)* | 100 0 (P < 0.0001)* | 99.7 0.3 (P < 0.0001)* |
APOBEC 3H nonsynonymous allele frequencies (%) | ||||||||
N15Δ (rs140936762) | CAA(N) Δ | 40.2 59.8 | 74 26 (P < 0.0001)* | 66 34 (P < 0.0001)* | 69 31 (P < 0.0001)* | 72 28 (P < 0.0001)* | 60 40 (P < 0.0001)* | 65.7 34.3 (P < 0.0001)* |
R18L (rs139293) | G (R) T (L) | 89.8 10.2 | 84.1 15.9 (P = 0.0192)* | 70.7 29.3 (P < 0.0001)* | 93.0 7.0 (P = 0.0972) | 75.8 24.2 (P < 0.0001)* | 69.4 30.6 (P < 0.0001)* | 73.2 26.8 (P < 0.0001)* |
G105R (rs139297) | G (G) C (R) | 2.2 97.8 | 68.5 31.5 (P < 0.0001)* | 53.6 46.4 (P < 0.0001)* | 12.5 87.5 (P < 0.0001)* | 61.7 38.3 (P < 0.0001)* | 57.1 42.9 (P < 0.0001)* | 51.9 48.1 (P < 0.0001)* |
K121E (rs139298) | A (K) G (E) | 3.0 97.0 | 68.5 31.5 (P < 0.0001)* | 52.3 47.7 (P < 0.0001)* | 12.5 87.5 (P < 0.0001)* | 61.2 38.8 (P < 0.0001)* | 56.1 43.9 (P < 0.0001)* | 51.5 48.5 (P < 0.0001)* |
K140E (rs139300) | A (K) G (E) | 0100 | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) |
E178D (rs139302) | G (E) C (D) | 6.4 93.6 | 67.0 33.0 (P < 0.0001)* | 54.6 45.4 (P < 0.0001)* | 15.3 84.7 (P < 0.0001)* | 65.4 34.6 (P < 0.0001)* | 56.1 43.9 (P < 0.0001)* | 52.3 47.7 (P < 0.0001)* |
APOBEC3H synonymous allele frequencies (%) | ||||||||
T43 T (rs139294) | G (T) C (T) | 7.1 92.9 | 66.7 33.3 (P < 0.0001)* | 54.6 45.4 (P < 0.0001)* | 17.2 82.8 (P < 0.0001)* | 62.0 38.0 (P < 0.0001)* | 56.0 44.0 (P < 0.0001)* | 52.4 47.6 (P < 0.0001)* |
Amino acid change and variant ID | Allele (2n) | SA (336) | ACB (192) | ASW (122) | ESN (198) | GWD (226) | LWK (198) | MSL (170) | YRI (216) |
APOBEC 3D nonsynonymous allele frequencies (%) | |||||||||
R6K (NI) | G (R) A (K) | 94.9 5.1 | NI | NI | NI | NI | NI | NI | NI |
R97C (rs75858538) | C (R) T (C) | 93.8 6.2 | 97 3 (P = 0.1506) | 94 6 (P = 1.0) | 97 3 (P = 0.0614) | 98 2 (P = 0.0253)* | 94 6 (P = 1.0) | 98 2 (P = 0.0262)* | 97 3 (P = 0.1631) |
L221R (NI) | T (L) G (R) | 99.1 0.9 | NI | NI | NI | NI | NI | NI | NI |
C224Y (rs772893975) | G (C) A (Y) | 97.9 2.1 | ND | ND | ND | ND | ND | ND | ND |
T238A (rs201709403) | A (T) G (A) | 95.8 4.2 | 100 0 (P = 0.0030)* | 99 1 (P = 0.1322) | 100 0 (P = 0.0016)* | 100 0 (P = 0.0012)* | 100 0 (P = 0.0016)* | 100 0 (P = 0.0036)* | 100 0 (P = 0.0013)* |
T238I (NI) | C (T) T (I) | 97.6 2.4 | NI | NI | NI | NI | NI | NI | NI |
R248K (rs61748819) | G (R) A (K) | 89.0 11.0 | 91 9 (P = 0.4596) | 96 4 (P = 0.0267)* | 86 14 (P = 0.3375) | 92 8 (P = 0.3890) | 87 13 (P = 0.5785) | 86 14 (P = 0.4668) | 87 13 (P = 0.5018) |
C320Y (rs61999542) | G (C) A (Y) | 99.7 0.3 | ND | ND | ND | ND | ND | ND | ND |
APOBEC3D synonymous allele frequencies (%) | |||||||||
L221 L (rs769426665) | G (L) C (L) | 99.1 0.9 | ND | ND | ND | ND | ND | ND | ND |
T316 T (rs184448269) | C (T) T (T) | 97.9 2.1 | 99 1 (P = 0.3657) | 99 1 (P = 0.3657) | 98 2 (P = 0.3657) | 99 1 (P = 0.3657) | 99 1 (P = 0.3657) | 98 2 (P = 0.3657) | 99 1 (P = 0.3657) |
APOBEC 3F nonsynonymous allele frequencies (%) | |||||||||
R48P (rs35053197) | G (R) C (P) | 96.1 3.9 | 98 2 (P = 0.3074) | 98 2 (P = 0.3670) | 95 5 (P = 0.8202) | 99 1 (P = 0.1100) | 95 5 (P = 0.6559) | 98 2 (P = 0.2763) | 95 5 (P = 0.6663) |
A78V (rs5750728) | C (A) T (V) | 79.9 20.1 | 78 22 (P = 0.6518) | 73 27 (P = 0.1236) | 82 18 (P = 0.6453) | 77 23 (P = 0.3943) | 76 24 (0.3756) | 85 15 (P = 0.2183) | 86 14 (P = 0.0798) |
I87L (rs146543452) | A (I) C (L) | 99.7 0.3 | 100 0 (P = 1) | 100 0 (P = 1) | 100 0 (P = 1) | 98 2 (P = 0.0878) | 99 1 (P = 1) | 99 1 (P = 1) | 100 0 (P = 1) |
Q87L (rs114704208) | A (Q) T (L) | 97.1 2.9 | 92.2 7.8 (P = 0.0171)* | 93.4 6.6 (P = 0.0994) | 96.5 3.5 (P = 0.7961) | 96.0 4.0 (P = 0.6287) | 93.0 7.0 (P = 0.0718) | 94.7 5.3 (P = 2141) | 94.4 5.6 (P = 0.1738) |
A108S (rs2020390) | G (A) T (S) | 62.3 37.7 | 67 33 (P = 0.2922) | 66 34 (P = 0.5796) | 69 31 (P = 0.1268) | 62 38 (P = 0.8572) | 65 35 (P = 0.6371) | 76 24 (P = 0.0031)* | 70 30 (P = 0.0619) |
V231I (rs2076101) | G (V) A (I) | 89.0 11.0 | 79 21 (P = 0.0041)* | 73 27 (P = 0.0001)* | 84 16 (P = 0.1053) | 78 22 (P = 0.0007)* | 80 20 (P = 0.0092)* | 85 15 (P = 0.2485) | 87 13 (P = 0.5829) |
Y307C (rs12157816) | A (Y) G (C) | 95.1 4.9 | 95 5 (P = 0.8368) | 98 2 (P = 0.1703) | 96 4 (P = 0.5129) | 98 2 (P = 0.1649) | 98 2 (P = 0.1489) | 96 4 (P = 0.8218) | 95 5(P = 1.0) |
APOBEC3F synonymous allele frequencies (%) | |||||||||
I117I (NI) | C (I) T (I) | 99.4 0.6 | NI | NI | NI | NI | NI | NI | NI |
S118S (rs35928287) | C (S) T (S) | 86.7 13.3 | ND | ND | ND | ND | ND | ND | ND |
R143R (rs4821862) | C (R) T (R) | 41.9 58.1 | 45 55 (P = 0.5775) | 45 55 (P = 5892) | 39 61 (P = 0.6433) | 50 50 (P = 0.0794) | 45 55 (P = 0.4624) | 46 54 (P = 0.3372) | 47 53 (P = 0.2840) |
Y196Y (rs765418322) | T (Y) C (Y) | 89.6 10.4 | ND | ND | ND | ND | ND | ND | ND |
S229S (rs549550231) | A (S) G (S) | 99.4 0.6 | ND | ND | ND | ND | ND | ND | ND |
E245E (rs113109079) | G (E) A (E) | 97.1 2.9 | 99 1 (P = 0.2175) | 98 2 (P = 0.7356) | 100 0 (P = 0.0139)* | 98 2 (P = 0.5717) | 99 1 (P = 0.0971) | 98 2 (P = 0.5517) | 98 2 (P = 0.1350) |
S327S (rs35895636) | C (S) T (S) | 90.6 9.4 | 98 2 (P = 0.0013)* | 99 1 (P = 0.0006)* | 96 4 (P = 0.0126)* | 100 0 (P = 0.0001)* | 97 3 (P = 0.0064)* | 100 0 (P = 0.0001)* | 98 2 (P = 0.0004)* |
APOBEC 3G nonsynonymous allele frequencies (%) | |||||||||
H186R (rs8177832) | A (H) G (R) | 63.037.0 | 56 44 (P = 0.1147) | 75 25 (P = 0.0249)* | 49 51 (P = 0.0026)* | 57 43 (P = 0.1344) | 68 32 (P = 0.3013) | 49 51 (P = 0.0030)* | 52 48 (P = 0.0101)* |
R256H (rs17000736) | G (R) A (H) | 98.8 1.2 | 98 2 (P = 0.7118) | 98 2 (P = 0.6631) | 99 1 (P = 0.6550) | 97 3 (P = 0.1316) | 98 2 (P = 1.0) | 99 1 (P = 0.6660) | 99 1 (P = 1.0) |
Q275E (rs17496046) | C (Q) G (E) | 82.117.9 | 90 10 (P = 0.0026)* | 91 9 (P = 0.0064)* | 86 14 (P = 0.0725) | 87 13 (P = 0.0482)* | 83 17 (P = 0.3526) | 91 9 (P = 0.0027)* | 87 13 (P = 0.0473)* |
G363R (rs148267053) | G (G) A (R) | 94.8 5.2 | 98 2 (P = 0.1066) | 99 1(P = 0.0532) | 100 0 (P = 0.0005)* | 98 2(P = 0.0429)* | 99 1(P = 0.0142)* | 98 2 (P = 0.0902) | 98 2 (P = 0.1209) |
APOBEC3G synonymous allele frequencies (%) | |||||||||
S60S (rs112603901) | C (S) T (S) | 94.5 5.5 | 99 1 (P = 0.0027)* | 98 2(P = 0.1191) | 100 0 (P = 0.0002)* | 100 0 (P = 0.0001)* | 99 1 (P = 0.0026)* | 100 0 (P = 0.0006)* | 100 0 (P = 0.0001)* |
A109A (rs375760983) | C (A) T (A) | 99.7 0.3 | ND | ND | ND | ND | ND | ND | ND |
F119F (rs5757465) | T (F) C (F) | 99.7 0.3 | 93 7 (P = 0.0001)* | 89 11 (P = 0.0001)* | 100 0(P = 1.0) | 98 2 (P = 0.1639) | 98 2 (P = 0.1507) | 99 1 (P = 1.0) | 99 1 (P = 0.5654) |
L371 L (rs11545130) | C (L) T (L) | 97.9 2.1 | 98 2 (P = 0.7523) | 98 2 (P = 1.0) | 97 3 (P = 0.7698) | 98 2 (P = 1.0) | 95 5 (P = 0.1239) | 98 2 (P = 1.0) | 95 5 (P = 0.1299) |
APOBEC 3H nonsynonymous allele frequencies (%) | |||||||||
N15Δ (rs140936762) | CAA(N) Δ | 40.2 59.8 | 71 29 (P < 0.0001)* | 70 30 (P < 0.0001)* | 61 39 (P = 0.0004)* | 78 22 (P < 0.0001)* | 62 38 (P = 0.0002)* | 75 25 (P < 0.0001)* | 68 32 (P < 0.0001)* |
R18L (rs139293) | G (R) T (L) | 89.8 10.2 | 93 7 (P = 0.3230) | 87 13 (P = 0.3885) | 93 7 (P = 0.3212) | 94 6 (P = 0.0972) | 94 6 (P = 0.0873) | 93 7 (P = 0.3053) | 96 4 (P = 0.0076)* |
G105R (rs139297) | G (G) C (R) | 2.2 97.8 | 15 85 (P < 0.0001)* | 25 75 (P < 0.0001)* | 10 90 (P = 0.0007)* | 13 87 (P < 0.0001)* | 9 91 (P = 0.0012)* | 11 89 (P < 0.0002)* | 9 91 (P < 0.0016)* |
K121E (rs139298) | A (K) G (E) | 3.0 97.0 | 15 85 (P < 0.0001)* | 25 75 (P < 0.0001)* | 10 90 (P = 0.0025)* | 13 87 (P < 0.0001)* | 9 91 (P = 0.0070)* | 11 89 (P = 0.0008)* | 9 91 (P = 0.0086)* |
K140E (rs139300) | A (K) G (E) | 0100 | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) | 0100 (P = 1.0) |
E178D (rs139302) | G (E) C (D) | 6.4 93.6 | 17 83 (P = 0.0004)* | 29 71 (P < 0.0001)* | 11 89 (P = 0.0900) | 16 84 (P = 0.0007)* | 16 84 (P = 0.0018)* | 13 87 (P = 0.0248)* | 11 89 (P = 0.0991) |
APOBEC3H synonymous allele frequencies (%) | |||||||||
T43 T (rs139294) | G (T) C (T) | 7.1 92.9 | 22 78 (P < 0.0001)* | 30 70 (P < 0.0001)* | 16 84 (P = 0.0026)* | 16 84 (P = 0.0024)* | 13 87 (P = 0.0545) | 17 83 (P = 0.0016)* | 13 87 (P = 0.0439)* |