[coptic]Namenra] tyrou> My oun ouon ouontaf `n[/coptic](DJVU-TO-PDF-CONVERTER)? Does anyone have "DJVU-TO-PDF-CONVERTER"? [coptic]oujai qen `P=[=c[/coptic]
dear ophadece, in thinking about this very technical question I got this simple idea: have you tried to print a djvu page to a pdf virtual printer driver?
Document live links may be lost and some image formats/resolutions may change. It is sometimes better to print every page alone.
[coptic]Jon_C `cnau so pamenrit> ]sep`hmot `ntotk emasw - aijemc ouoh acerhwb alla pi`problyma `mmauatf pe ]`skw] an hancaji euvwleb ebol[/coptic] Thank you very much - I found it and it worked but the only problem is that I cannot search for separate words [coptic]oujai qen `P=[=c[/coptic]
Now if the original file is a collection of scanned pages (they are thus images i.e., not text that can be searched) then the converted pdf version will be kept as images. Otherwise I suggest you look for a missed option in the virtual printer properties.
[coptic]JonC `cnau so> ]sem`hmot `ntotk emasw - ;ai te etcohi qen oujwk `nje pejak nyi[/coptic] Thank you very much - it is abolutely right what you told me [coptic]oujai qen `P=[=c[/coptic]
[coptic]Jon-C `cnau so Pamenrit> My `k`s]tott ouoh ouwrp nyi `noumour e;be ou[/coptic](OCR)[coptic] `mprogramma `ntek]co[ni ejen pwc ]naer,rac;e?[/coptic] Can you help me and send me the link for an OCR program giving me advice on how I use it? [coptic]oujai qen `P=[=c[/coptic]
You may like to try this prog above. I once used another prog named ABBYY FineReader OCR - you can also Google for Iris software.
The page is scanned before or via the OCR program, which tries to locate text in the image then reads it, photos (and sometimes tables) to reconstruct it into a typed one with regular editable search able text, perhaps also placed photos, tables and keep the page formatting.
Ideally the scanned page must be clear and of moderate to best resolution esp with bad prints, sometimes it needs some photo editing to enhance it (brightness/contrast, clear background, etc.) - but sometimes a too high resolution does mislead the OCR prog.
When a page is too complex, you can help the prog by making selection(s) using the mouse around area(s) to OCR.
[coptic]Jon pamenrit> ]sep`hmot `ntotk emasw. ai]qack oun emasw ouoh qen `pqae aieratsasni petaiouws[/coptic] I thank you very much. I put you out very much too and in the end I failed to get what I wanted [coptic]oujai qen `P=[=c[/coptic]
I'd still like to help you.... Can you send me a sample of the file so I try things on it (or be very descriptive about this)? You can also send me the details in a PM.
[coptic]Jon pamenrit> ]sep`hmot `ntotk e;be tek`cpody qen pekjin[ont e]tott[/coptic] Thank you for your enthusiasm in you trying to help me [coptic]pijwm `nte Krwm `k`sjemf ejen paimour[/coptic]: http://www.metalog.org/files/crum.html The file of Crum you can find on that link: http://www.metalog.org/files/crum.html [coptic]vai pe `nhouo `nnis] `njwm e;be vai ai`souwrpf an nak[/coptic] It is a very big file that is why I couldn't send it to you [coptic][ont `nqytf ouoh ajoc nyi `ntekouoi[/coptic] Try with it and tell me about your progress [coptic]oujai qen `P=[=c[/coptic]
[coptic]Jon pamenrit> qen oume;myi pilogicmoc `mmauatf pe je ai[iselet qen ourompi etqae ouoh ]ouws an e]hymi e;be nimys `n,ai qen pima`nnat - ]helpic je `kka] eroi[/coptic] In fact the only reason is that I married last year and I don't want to pay for many things on the internet - I hope you understand me [coptic]`ksan]tott> ebolqen pek`hmot joc nyi `mpwc ouoh ]na[ont eerhwb ;yet]`s`iri `n]totk[/coptic] If you can help me please tell me how and I will try doing what I can to help you [coptic]]sep`hmot `ntotk emasw[/coptic] I thank you very much [coptic]oujai qen `P=[=c[/coptic]
Comments
in thinking about this very technical question I got this simple idea: have you tried to print a djvu page to a pdf virtual printer driver?
Document live links may be lost and some image formats/resolutions may change.
It is sometimes better to print every page alone.
http://www.pdfonline.com/easypdf/
http://www.pdfmachine.com/genp/overview.html
http://www.download.com/Adobe-PDF-Printer-Driver-Plug-in/3000-2296_4-10018717.html for Mac OS
GBU
]sep`hmot `ntotk emasw - aijemc ouoh acerhwb alla pi`problyma `mmauatf pe ]`skw] an hancaji euvwleb ebol[/coptic]
Thank you very much - I found it and it worked but the only problem is that I cannot search for separate words
[coptic]oujai qen `P=[=c[/coptic]
You do use this WinDjView?
http://windjview.sourceforge.net/
Now if the original file is a collection of scanned pages (they are thus images i.e., not text that can be searched) then the converted pdf version will be kept as images. Otherwise I suggest you look for a missed option in the virtual printer properties.
GBU
]sem`hmot `ntotk emasw - ;ai te etcohi qen oujwk `nje pejak nyi[/coptic]
Thank you very much - it is abolutely right what you told me
[coptic]oujai qen `P=[=c[/coptic]
What is useful then is to try an OCR program (may detect the text out of a scanned page image).
GBU
My `k`s]tott ouoh ouwrp nyi `noumour e;be ou[/coptic](OCR)[coptic] `mprogramma `ntek]co[ni ejen pwc ]naer,rac;e?[/coptic]
Can you help me and send me the link for an OCR program giving me advice on how I use it?
[coptic]oujai qen `P=[=c[/coptic]
http://www.softi.co.uk/freeocr.htm
this is freeware, that does NOT mean bad but rarely may be less functionality than a high end
You may like to try this prog above. I once used another prog named ABBYY FineReader OCR - you can also Google for Iris software.
The page is scanned before or via the OCR program, which tries to locate text in the image then reads it, photos (and sometimes tables) to reconstruct it into a typed one with regular editable search able text, perhaps also placed photos, tables and keep the page formatting.
Ideally the scanned page must be clear and of moderate to best resolution esp with bad prints, sometimes it needs some photo editing to enhance it (brightness/contrast, clear background, etc.) - but sometimes a too high resolution does mislead the OCR prog.
When a page is too complex, you can help the prog by making selection(s) using the mouse around area(s) to OCR.
GBU
edited:
I think this one has the best versatility
http://www.irislink.com/c2-670-225/Readiris-Pro-11-Corporate-Middle-East---Features.aspx
]sep`hmot `ntotk emasw. ai]qack oun emasw ouoh qen `pqae aieratsasni petaiouws[/coptic]
I thank you very much. I put you out very much too and in the end I failed to get what I wanted
[coptic]oujai qen `P=[=c[/coptic]
I'd still like to help you....
Can you send me a sample of the file so I try things on it (or be very descriptive about this)?
You can also send me the details in a PM.
And you're much welcome!
GBU
]sep`hmot `ntotk e;be tek`cpody qen pekjin[ont e]tott[/coptic]
Thank you for your enthusiasm in you trying to help me
[coptic]pijwm `nte Krwm `k`sjemf ejen paimour[/coptic]: http://www.metalog.org/files/crum.html
The file of Crum you can find on that link: http://www.metalog.org/files/crum.html
[coptic]vai pe `nhouo `nnis] `njwm e;be vai ai`souwrpf an nak[/coptic]
It is a very big file that is why I couldn't send it to you
[coptic][ont `nqytf ouoh ajoc nyi `ntekouoi[/coptic]
Try with it and tell me about your progress
[coptic]oujai qen `P=[=c[/coptic]
At first sight the English section seems to be the easiest to work on.
Proposed in electronic format (at top of web page):
www.logos.com/products/prepub/details/2529
Have you checked this?
I'll reply back here when I have a good solution.
GBU
qen oume;myi pilogicmoc `mmauatf pe je ai[iselet qen ourompi etqae ouoh ]ouws an e]hymi e;be nimys `n,ai qen pima`nnat - ]helpic je `kka] eroi[/coptic]
In fact the only reason is that I married last year and I don't want to pay for many things on the internet - I hope you understand me
[coptic]`ksan]tott> ebolqen pek`hmot joc nyi `mpwc ouoh ]na[ont eerhwb ;yet]`s`iri `n]totk[/coptic]
If you can help me please tell me how and I will try doing what I can to help you
[coptic]]sep`hmot `ntotk emasw[/coptic]
I thank you very much
[coptic]oujai qen `P=[=c[/coptic]