Completed
Chunking with RecursiveCharacterTextSplitter
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
LangChain Data Loaders, Tokenizers, Chunking, and Datasets - Data Prep
Automatically move to the next video in the Classroom when playback concludes
- 1 Data preparation for LLMs
- 2 Downloading the LangChain docs
- 3 Using LangChain document loaders
- 4 How much text can we fit in LLMs?
- 5 Using tiktoken tokenizer to find length of text
- 6 Initializing the recursive text splitter in Langchain
- 7 Why we use chunk overlap
- 8 Chunking with RecursiveCharacterTextSplitter
- 9 Creating the dataset
- 10 Saving and loading with JSONL file
- 11 Data prep is important