CiscoGundamSeed

Why don't ppl just have llm read PDFs instead of fine tuning models

Say you have proprietary data in txt, pdf, doc, xls, etc, and you want to create a chatbot to learn it and summarize it for you, why dont you just download llama or Mistral or smth and write some python script to feed the data using langchain or smth? I see ppl telling me that they gotta re-train or fine tune the llm with the proprietary data. That sounds like an overkill TC 160

Poll

44 Participants

Select only one answer

Just have llm read the pdf through langchain

Gotta re train or fine tune

Op is a dumbass. Dont understand the question

🍿

Sort by :

ex-Amazon DoggyBezos Jun 14

OP is mixing up concept. Reason below: 1. Limited context window. Proprietary data is huge. Can’t shove the entire proprietary database into the context window. 2. Alternatively you can convert proprietary data into vector base. Many companies do it already. It’s called RAG

Cisco GundamSeed OP Jun 14

Ahh so that's what rag is. Keep hearing the phrase a lot, thx. Gonna look into it

Google hidone Jun 17

Also, converting pdf to something like markdown for searchability is a huge problem on its own (when I last checked some months ago)

Hide company name

0 credits left

Sort by : ...

Why don't ppl just have llm read PDFs instead of fine tuning models

Poll

44 Participants

Select only one answer

Just have llm read the pdf through langchain

Gotta re train or fine tune

Op is a dumbass. Dont understand the question

🍿

Sort by :

ex-Amazon DoggyBezos Jun 14

Cisco GundamSeed OP Jun 14

Ahh so that's what rag is. Keep hearing the phrase a lot, thx. Gonna look into it

Google hidone Jun 17

Also, converting pdf to something like markdown for searchability is a huge problem on its own (when I last checked some months ago)

Hide company name

0 credits left

Industries

Job Groups

General Topics

Why don't ppl just have llm read PDFs instead of fine tuning models

Sponsored

Most Read

Why don't ppl just have llm read PDFs instead of fine tuning models