Click-through rate (CTR) predictions are important for internet companies. The CTR is closely related to the context, user attributes and advertising attributes, with effective CTR predictions essential for improving company revenue. The traditional LR model was optimized to predict the relationship between the user and advertiser characteristics for the CTR which were added to the Sigmoid function to obtain a new features conjunction model. The online optimization algorithm follow-the-regularized-leader (FTRL) was used to improve the efficiency of the parameter, and the mixed regularization was used to prevent over fitting. Tests on a real-world advertising dataset show that this method has better accuracy, efficiency, parameter sensitivity and reliability compared with previous algorithms.
摘要 点击率 (click-through rate, CTR) 预测是互联网公司中重要的研究课题, 预测结果与上下文、用户属性和广告属性息息相关, CTR的有效预测对提高广告公司的收入至关重要。该文在对传统逻辑回归 (logistic regression, LR) 模型的相关原理和参数优化算法介绍的基础上, 抽离出用户特征和广告特征, 将用户与广告之间特征的关联信息添加到Sigmoid函数中得到一种特征关联模型。与以往求解方法不同, 该方法采用在线最优化算法FTRL (follow-the-regularized-leader) 提高参数计算效率, 采用混合正则化来防止训练过拟合。真实的广告数据集上的实验结果表明：该方法与传统的模型和方法相比具有更好的预测精度、效率、参数敏感性和可靠性。