overview for happy

table extraction from pdf in c/localllama@poweruser.forum

[–] happy_dreamer10@alien.top 1 points 11 months ago

thanks :) i dont want to go through training process , currently i m converting it to latex format which is working pretty fine.

table extraction from pdf in c/localllama@poweruser.forum

[–] happy_dreamer10@alien.top 1 points 11 months ago (1 children)

can it extract tables automatically ? that too one with merged cells ?

table extraction from pdf in c/localllama@poweruser.forum

[–] happy_dreamer10@alien.top 1 points 11 months ago (1 children)

thanks will check it out . have you tried it ?

1

table extraction from pdf (alien.top)

submitted 11 months ago by happy_dreamer10@alien.top to c/localllama@poweruser.forum

9 comments fedilink

anyone knows some robust open source library for extracting tables from pdf , even ocr library is fine

P.S- i have already tried tabula ,camelot , ing2table, unstructured.io and most of the document loader in langchain , none of them are even 95% robust