1. Dara Bahri , Heinrich Jiang , Yi Tay , and Donald Metzler . 2021 . Scarf: Self-Supervised contrastive learning using random feature corruption . In International Conference on Learning Representations. Dara Bahri, Heinrich Jiang, Yi Tay, and Donald Metzler. 2021. Scarf: Self-Supervised contrastive learning using random feature corruption. In International Conference on Learning Representations.
2. James Bergstra , Rémi Bardenet , Yoshua Bengio , and Balázs Kégl . 2011. Algorithms for hyper-parameter optimization. Advances in neural information processing systems 24 ( 2011 ). James Bergstra, Rémi Bardenet, Yoshua Bengio, and Balázs Kégl. 2011. Algorithms for hyper-parameter optimization. Advances in neural information processing systems 24 (2011).
3. Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell 2020. Language models are few-shot learners. In Neural Information Processing Systems. Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell 2020. Language models are few-shot learners. In Neural Information Processing Systems.
4. End-to-End Object Detection with Transformers
5. Emerging Properties in Self-Supervised Vision Transformers