Frequently, when dealing with many machine learning models, optimization problems appear to be challenging due to a limited understanding of the constructions and characterizations of the objective functions in these problems. Therefore, major complications arise when dealing with first-order algorithms, in which gradient computations are challenging or even impossible in various scenarios. For this reason, we resort to derivative-free methods (zeroth-order methods). This paper is devoted to an approach to minimizing quasi-convex functions using a recently proposed comparison oracle only. This oracle compares function values at two points and tells which is larger, thus by the proposed approach, the comparisons are all we need to solve the optimization problem under consideration. The proposed algorithm to solve the considered problem is based on the technique of comparison-based gradient direction estimation and the comparison-based approximation normalized gradient descent. The normalized gradient descent algorithm is an adaptation of gradient descent, which updates according to the direction of the gradients, rather than the gradients themselves. We proved the convergence rate of the proposed algorithm when the objective function is smooth and strictly quasi-convex in , this algorithm needs comparison queries to find an -approximate of the optimal solution, where is an upper bound of the distance between all generated iteration points and an optimal solution.
Journal Russian Journal of Nonlinear Dynamics Optimization
On quasi-convex smooth optimization problems by a comparison oracle
arXiv:2502.01862
Cite this paper
On quasi-convex smooth optimization problems by a comparison oracle
@inproceedings{gasnikov2024quasi,
title = {On quasi-convex smooth optimization problems by a comparison oracle},
author = {Alexander Gasnikov and Mohammad Alkousa and Aleksandr Lobanov and Yuriy Dorn and Fedor Stonyakin and Ilya Kuruzov and Sanjeev Singh},
booktitle = {Russian Journal of Nonlinear Dynamics},
year = {2024},
url = {https://arxiv.org/abs/2502.01862}
}