happy_dreamer10

joined 9 months ago
[–] happy_dreamer10@alien.top 1 points 9 months ago

thanks :) i dont want to go through training process , currently i m converting it to latex format which is working pretty fine.

[–] happy_dreamer10@alien.top 1 points 9 months ago (1 children)

can it extract tables automatically ? that too one with merged cells ?

[–] happy_dreamer10@alien.top 1 points 9 months ago (1 children)

thanks will check it out . have you tried it ?

 

anyone knows some robust open source library for extracting tables from pdf , even ocr library is fine

P.S- i have already tried tabula ,camelot , ing2table, unstructured.io and most of the document loader in langchain , none of them are even 95% robust