[YouTube] Practical Data Considerations for building Production-Ready LLM Applications

September 4th, 2023

簡單摘要:

RAG (Retrieval Augmented Generation) 主要講的是透過查詢資料，透過 LLM 做為結果的生成回覆。

準備資料: 讀取資料，切割成 chunk ，加上 Embedding ，放入 Vector DB 讀取資料: 輸入轉換成 Embedding 比對 Vector ，找出 Chunk ，放入 LLM 來生成回答

產品介紹： LlamaIndex https://www.llamaindex.ai/

支援多種 Loader: Llama hub
支援 Document Update:
- https://gpt-index.readthedocs.io/en/stable/core_modules/data_modules/index/usage_pattern.html#handling-document-update
  - Metadata Extraction
  - Document Management

Attitude is everything