A list of questions with short answers on AI and related topics.
These are questions I’ve had, or that I thought others who are exploring AI and the most recent advances in language models. What I hope these become are answers put into the simplest possible terms while backfilling what is often presumed to be common knowledge in the field.
Each section will be kept short, and then link to more details elsewhere.
- What is the transformer model?
Vector databases
- What is a vector database?
- What are word vectors?
- What is one-hot encoding?
Embeddings
- What are embeddings?
- How are embeddings stored?
- What does it mean to shrink an embedding?
Attention
- What is attention?
- What is an attention head?
- Why do we need multiple attention heads?
Mamba
- What is Mamba?
What is NLP?
- What is a token?