Monday, May 13, 2024
Invoice Data Preprocessing for LLM
Data preprocessing is important step for LLM pipeline. I show various approaches to preprocess invoice data, before feeding it to LLM. This is quite challenging step, especially to preprocess tables.
Monday, May 6, 2024
You Don't Need RAG to Extract Invoice Data
Documents like invoices or receipts can be processed by LLM directly, without RAG. I explain how you can do this locally with Ollama and Instructor. Thanks to Instructor, structured output from LLM can be validated with your own Pydantic class.
Subscribe to:
Posts (Atom)