![]() Hence, it is not the best tool to extract the text and the images from a PDF.Īnyway, the script will ask if you want a reflowable text ePub or a fixed layout ePub if you install the Calibre software ( apt install calibre) or if you use the image dodeeric/pdf2epubex:calibre (image much bigger than dodeeric/pdf2epubex). pdf2htmlEX is THE tool to maintain the original layout. ![]() This script is converting a PDF to a fixed layout ePub. More about fixed layout (FXL) ePub version 3 specifications (IDPF / W3C): Fixed Layouts (EPUB Content Documents 3.2) and Fixed-Layout Properties (EPUB Packages 3.2). Please also note that there is a Web browser reader available. Please note that: a) the ePub file has to pass a pre-check to be able to be hosted in the Google cloud b) if you upload a PDF, all pages (text + images) will be converted into images (the text and vector images are rasterized, and no hidden text layer will be added: it means no text search or copy/paste is possible). ![]() You can also upload eBooks from the Google Play Books web interface (see the Upload files button on the top right corner). The uploaded eBooks (PDF or ePub) will be available on all devices using the same Google account. To use Google Play Books, you have to go to Settings, then set Enable uploading. A smartphone is not adapted most of the time because of the too small screen size.Ī lot of ePub reader apps exist (to read reflowable text ePub and fixed layout ePub) available on different platforms (Android, iOS, Windows, MacOS, or Linux): Google Play Books, BookShelf, PocketBook, Adobe Digital Editions, Apple Books (only on iOS formely known as Apple iBooks), etc.Īmazon Kindle does not support the standard ePub format (they have their own format which is based on the ePub format). To read a fixed layout ePub, the best device is a tablet (Android or iOS/iPad). It is available on Amazon and on Googgle Play Books. The script is based on the method described in my book published in 2014: Fixed Layout ePub: A Practical Guide to Publish eBooks from PDF Files. JPG (bitmap format, compression with loss):.PNG (bitmap format, lossless compression):.Vector image quality in different formats (zoom of 500 %): (248 pages, lot of bitmap images in the PDF) (49 pages, bitmap and vector images in the PDF) (24 pages, only vector images in the PDF) ePub written in bold: the recommended ePub version.This does not mean the ePub will not be displayed properly in most ePub readers. Hashtag in parentheses: the ePub file does not pass the epub check validation using version epub 3.2 rules (commands not allowed in some svg files).Number in parentheses: the size of the file in MB.Sometime, ePub is referred as "website in a box". svg) that's in fact what basicaly the pdf2epubEX script does before wripping all the files in one ePub container file (.epub). pdf2htmlEX can also put all that content in different files (.html. In the examples below, the HTML version is one big file including everything (all the pages with HTML5, CSS, JS, fonts and images fonts and images are coded in Base64, which can make the file quite big). The ePub cover image will be made from the first page of the PDF file (png format). For eBooks with mainly vector images, it is better to chose PNG (lossless compression). if you chose svg (vector and bitmap format), the vector images of the PDF will remain in vector format, but: a) you cannot chose the resolution of the bitmap images (it is the one from the PDF) b) the bitmap images will be included in the svg files (Base64 coded) c) this format is not always correctly rendered by eBook readers d) the generated epub file is not always passing the epub check.Ī vector image can be as simple as a line, a rectangle, a table frame, a colored background, etc.įor eBooks with a lot of bitmap images, it is better to chose JPG (compression with loss) to not have a file too big.if you chose png or jpg (bitmap formats), the vector images of the PDF will be converted in bitmap images (rasterized).If you want, you can hit ENTER to all the questions. Title, author, publisher, year, language, ISBN number, subject.Resolution of the images in the epub in dpi (e.g.: 150 or 300).Format of the images in the epub (png, jpg or svg).Once you launch pdf2epubEX, some information will be displayed like the book/PDF width and height (in inches and cm), then some questions will be asked like: Remark: use the dodeeric/pdf2epubex:original Docker image to use the original version of pdf2htmlEX (coolwanglu). Docker run -ti -rm -v `pwd`:/temp dodeeric/pdf2epubex pdf2htmlEX myfile.pdf
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |