基于高维特征非线性筛选的HLA-A★0201限制性CTL表位预测

高活性细胞毒T细胞(CTL)表位鉴定是设计肿瘤疫苗的关键内容.采用天然氨基酸的531个物理化学性质参数表征HLA-A*0201限制性表位9肽, 从531×9个初始描述子出发, 经二元矩阵重排过滤器粗筛和多轮末尾淘汰精细筛选, 获得18个物理化学意义明确的保留描述子. 18个保留描述子主要涉及除1位、5位外各位置残基的疏水性和空间结构特征, 3位残基疏水性对活性影响最大, 且2位、4位、9位残基共占10个保留描述子,支持2位和9位残基为锚点、3位为关键位点以及4位残基为标志链的现有认知. 对18个保留描述子以支持向量回归构建定量序效模型,其拟合、留一法交叉验证决定系数R2、Qcv2分别为0.95...

Full description

Saved in:
Bibliographic Details
Published in物理化学学报 Vol. 29; no. 9; pp. 1945 - 1953
Main Author 韩娜 袁哲明 陈渊 代志军 王志明
Format Journal Article
LanguageChinese
Published 2013
Subjects
Online AccessGet full text
ISSN1000-6818

Cover

More Information
Summary:高活性细胞毒T细胞(CTL)表位鉴定是设计肿瘤疫苗的关键内容.采用天然氨基酸的531个物理化学性质参数表征HLA-A*0201限制性表位9肽, 从531×9个初始描述子出发, 经二元矩阵重排过滤器粗筛和多轮末尾淘汰精细筛选, 获得18个物理化学意义明确的保留描述子. 18个保留描述子主要涉及除1位、5位外各位置残基的疏水性和空间结构特征, 3位残基疏水性对活性影响最大, 且2位、4位、9位残基共占10个保留描述子,支持2位和9位残基为锚点、3位为关键位点以及4位残基为标志链的现有认知. 对18个保留描述子以支持向量回归构建定量序效模型,其拟合、留一法交叉验证决定系数R2、Qcv2分别为0.957、0.708; 独立预测决定系数及均方根误差Qext2 、RMSEext分别为0.818、0.366, 明显优于文献报道. 通过对全组合虚拟9肽的预测, 得到了多条预测活性高于已知表位肽的9肽, 可供实验验证. 较全面阐明了特定位置残基对多肽亲和性的影响规律, 为高活性多肽疫苗分子设计提供了切实指导.
Bibliography:Determining highly active epitopes of the cytotoxic T lymphocyte (CTL) is essential for the computational design of peptide vaccines for tumors. In this study, we characterized each residue in the restricted CTL epitopes using 531 physicochemical properties. We selected 18 descriptors with clear meanings from 531 x9 descriptors for each peptide of length nine using the binary matrix shuffling filter and worst descriptor elimination multi-round methods. Most of the 18 selected descriptors were the hydrophobic and steric properties of the residues. Among the 18 descriptors, 10 descriptors were related to the second, fourth, and ninth residues, which is consistent with the known facts. We then constructed a support-vector- regression-based quantitative sequence activity model (QSAM) using 18 selected descriptors. The values of the accuracies of fitting (R^2), leave-one-out cross validation (Qcv^2), and extra-sample prediction (Qext^2, RMSEext) were 0.957, 0.708, 0.818, and 0.366, respectively. These results, whi
ISSN:1000-6818