×
Please Note! This page is still a work in progress.
Natural Language Processing (NLP)
Est. read time: 1 minute | Last updated: December 17, 2024 by John Gentile
Contents
Overview
Large Language Models (LLM)
- LLaMA: Open and Efficient Foundation Language Models - arXiv: Meta AI open-source LLM model.
- llama3 implemented from scratch
- llama.cpp: fast, low overhead inference of LLaMA in C/C++.
References
- Attention Is All You Need- Arxiv: introduces concepts of transformers and attention
- Hugging Face: pre-trained NLP models & reference
- Training data-efficient image transformers & distillation through attention- Facebook AI
- The Illustrated Transform: NLP walk-through