Spurious Valleys in the Error Surface of Recurrent Networks—Analysis and Avoidance |
| |
Authors: | Horn J De Jesus O Hagan MT |
| |
Affiliation: | Agilent Technol., High Freq. Technol. Center, Santa Clara, CA; |
| |
Abstract: | This paper gives a detailed analysis of the error surfaces of certain recurrent networks and explains some difficulties encountered in training recurrent networks. We show that these error surfaces contain many spurious valleys, and we analyze the mechanisms that cause the valleys to appear. We demonstrate that the principle mechanism can be understood through the analysis of the roots of random polynomials. This paper also provides suggestions for improvements in batch training procedures that can help avoid the difficulties caused by spurious valleys, thereby improving training speed and reliability. |
| |
Keywords: | |
|
|