Andrej Baranovskij Blog
Blog about Oracle, Full Stack, Machine Learning and Cloud
Monday, May 27, 2024
Hybrid RAG with Sparrow Parse
›
To process complex layout docs and improve data retrieval from invoices or bank statements, we are implementing Sparrow Parse. It works in c...
Monday, May 20, 2024
Sparrow Parse - Data Processing for LLM
›
Data processing in LLM RAG is very important, it helps to improve data extraction results, especially for complex layout documents, with lar...
Monday, May 13, 2024
Invoice Data Preprocessing for LLM
›
Data preprocessing is important step for LLM pipeline. I show various approaches to preprocess invoice data, before feeding it to LLM. This ...
Monday, May 6, 2024
You Don't Need RAG to Extract Invoice Data
›
Documents like invoices or receipts can be processed by LLM directly, without RAG. I explain how you can do this locally with Ollama and Ins...
Monday, April 29, 2024
LLM JSON Output with Instructor RAG and WizardLM-2
›
With Instructor library you can implement simple RAG without Vector DB or dependencies to other LLM libraries. The key RAG components - good...
Monday, April 22, 2024
Local RAG Explained with Unstructured and LangChain
›
In this tutorial, I do a code walkthrough and demonstrate how to implement the RAG pipeline using Unstructured, LangChain, and Pydantic for ...
Monday, April 15, 2024
Local LLM RAG with Unstructured and LangChain [Structured JSON]
›
Using unstructured library to pre-process PDF document content, to be in a cleaner format. This helps LLM to produce more accurate response....
‹
›
Home
View web version