hkaren
2014-12-30 07:00:46 UTC
Hello everyone.
I am using LocationTextExtractionStrategy for extracting a text from PDF.
In most of all cases it works fine.
However, there are some cases when the iText library returns wrong X
coordinate (disParallelStart/disParallelEnd) for a word.
For Instance an original line from a PDF file is:
01/01/2008 *44566020 TVA A* RECU FFG FFG3801024 004 FFG LF42032
But when I extract it I get the following line
01/01/2008 RECU FFG FFG3801024 004 FFG LF42032 *44566020 TVA A*
As it's visible the *"44566020 TVA A"* part goes to the end of a line after
extraction.
How to solve this problem?
Thanks
Karen
--
View this message in context: http://itext-general.2136553.n4.nabble.com/Wrong-extraction-of-X-disParallelStart-disParallelEnd-coordinate-tp4660642.html
Sent from the iText - General mailing list archive at Nabble.com.
------------------------------------------------------------------------------
Dive into the World of Parallel Programming! The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net
_______________________________________________
iText-questions mailing list
iText-***@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php
I am using LocationTextExtractionStrategy for extracting a text from PDF.
In most of all cases it works fine.
However, there are some cases when the iText library returns wrong X
coordinate (disParallelStart/disParallelEnd) for a word.
For Instance an original line from a PDF file is:
01/01/2008 *44566020 TVA A* RECU FFG FFG3801024 004 FFG LF42032
But when I extract it I get the following line
01/01/2008 RECU FFG FFG3801024 004 FFG LF42032 *44566020 TVA A*
As it's visible the *"44566020 TVA A"* part goes to the end of a line after
extraction.
How to solve this problem?
Thanks
Karen
--
View this message in context: http://itext-general.2136553.n4.nabble.com/Wrong-extraction-of-X-disParallelStart-disParallelEnd-coordinate-tp4660642.html
Sent from the iText - General mailing list archive at Nabble.com.
------------------------------------------------------------------------------
Dive into the World of Parallel Programming! The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net
_______________________________________________
iText-questions mailing list
iText-***@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php