ОБЗОР МЕТОДОВ ОБУЧЕНИЯ ГЛУБОКИХ НЕЙРОННЫХ СЕТЕЙ

Research output: Contribution to journal › Article › peer-review

DOI

https://doi.org/10.14529/cmse170303
Final published version

Андрей Владимирович Созыкин

At present, deep learning is becoming one of the most popular approach to creation of the artificial intelligences systems such as speech recognition, natural language processing, computer vision and so on. Thepaper presents a historical overview of deep learning in neural networks. The model of the artificial neural networkis described as well as the learning algorithms for neural networks including the error backpropagation algorithm, which is used to train deep neural networks. The development of neural networks architectures is presentedincluding neocognitron, autoencoders, convolutional neural networks, restricted Boltzmann machine, deep beliefnetworks, long short-term memory, gated recurrent neural networks, and residual networks. Training deep neuralnetworks with many hidden layers is impeded by the vanishing gradient problem. The paper describes theapproaches to solve this problem that provide the ability to train neural networks with more than hundred layers.An overview of popular deep learning libraries is presented. Nowadays, for computer vision tasks convolutionalneural networks are utilized, while for sequence processing, including natural language processing, recurrentnetworks are preferred solution, primarily long short-term memory networks and gated recurrent neural networks.

Translated title of the contribution	AN OVERVIEW OF METHODS FOR DEEP LEARNING IN NEURAL NETWORKS
Original language	Russian
Pages (from-to)	28-59
Number of pages	32
Journal	Вестник Южно-Уральского государственного университета. Серия: Вычислительная математика и информатика
Volume	6
Issue number	3
DOIs	https://doi.org/10.14529/cmse170303
Publication status	Published - 2017

GRNTI

27.00.00 MATHEMATICS

Level of Research Output

VAK List

ID: 2138821