1. Long short term memory;Hochreiter;Neural Comput.,1997
2. Untersuchungen zu Dynamischen Neuronalen Netzen;Hochreiter,1991
3. Learning long-term dependencies with gradient descent is difficult;Bengio;IEEE Trans. Neural Netw.,1994
4. Neural ordinary differential equations;Chen;Adv. Neural Inf. Process. Syst.,2018
5. Deep equilibrium models;Bai;Adv. Neural Inf. Process. Syst.,2019