Sequence | Protein | Name | Coordinates^{a} | Effect size^{b} | Bootstrap frequency^{c} | |
---|---|---|---|---|---|---|

Start | Stop | |||||

KAFSPEVIPMF | Gag (p24) | KF11 | 30 | 40 | 2.49 | 0.946 |

RLRDLLLIVTR | Env (gp41) | RR11 | 259 | 269 | 4.06 | 0.754 |

GIPHPAGLK | Pol (RT) | GK9 | 93 | 101 | 5.26 | 0.746 |

HTQGYFPDW | Nef | HW9 | 116 | 124 | 4.86 | 0.743 |

AEAMSQVTNS | Gag (p2) | AS10 | 1 | 10 | 4.30 | 0.638 |

SAEPVPLQL | Rev | SL9 | 67 | 75 | 2.57 | 0.626 |

QAISPRTLNAW | Gag (p24) | QW11 | 13 | 23 | 1.21 | 0.599 |

RIKQIINMW | Env (gp120) | RW9 | 419 | 427 | 2.90 | 0.513 |

↵a HXB2 coordinates with respect to protein domains (p24, RT, gp41, and gp120). See Table S1 in the supplemental material for coordinates with respect to polyproteins (Gag, Pol, Env).

↵b The natural logarithm of the odds ratio that an individual will be a progressor or not given that he or she targets this epitope. Parameters are estimated using a nonregularized logistic regression model including all HLAs and epitopes.

↵c The fraction of bootstrap samples in which the sequence is selected for inclusion in the predictive model.