Papers

2023

LongNet: Scaling Transformers to 1,000,000,000 Tokens
Pre-Trained Image Processing Transformer
Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines
Voyager: An Open-Ended Embodied Agent with Large Language Models
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
A Neural Corpus Indexer for Document Retrieval
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information *
Giving BERT a Calculator *
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback *
How well do Large Language Models perform in Arithmetic tasks? *
ToolCoder: Teach Code Generation Models to use API search tools *
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs *
Small Models are Valuable Plug-ins for Large Language Models *
API-Bank: A Benchmark for Tool-Augmented LLMs *
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information *
ART: Automatic multi-step reasoning and tool-use for large language models *
TALM: Tool Augmented Language Models *
Tool Learning with Foundation Models *
Toolformer: Language Models Can Teach Themselves to Use Tools *
LoRA: Low-Rank Adaptation of Large Language Models
Scaling Transformer to 1M tokens and beyond with RMT
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Sparks of Artificial General Intelligence: Early experiments with GPT-4
(Ada) Human-Timescale Adaptation in an Open-Ended Task Space
* denotes literature review for ECU research

2022

(VPT) Video PreTraining: Learning to Act by Watching Unlabeled Online Videos
(OPT) Open Pre-trained Transformer Language Models
(LaMDA) Language Models for Dialog Applications
Attention Is All You Need
(Imagen) Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
(DALLE2) Hierarchical Text-Conditional Image Generation with CLIP Latents
(GATO) A Generalist Agent
Thinking Fast and Slow in Ai

2021

(Attribute2Font) Creating Fonts You Want From Attributes
(DALLE) Zero-Shot Text-to-Image Generation
(GPT3) Language Models are Few-Shot Learners
(MNIST) Backpropagation Applied to Handwritten Zip Code Recognition