Archive for the ‘PDF Library’ category

Memory and speed related improvements in Docotic.Pdf 5.4

Hi,

We have just released new version of Docotic.Pdf library on our site and on NuGet.

Main changes in Docotic.Pdf 5.4 are memory consumption improvements. We also worked very hard to improve the speed of opening existing documents.

In order to achieve the improvements, we had to mark some methods and properties obsolete. No functionality loss happened, since we added alternative methods for everything we marked obsolete. Please review the changes and migrate to the alternative methods. We will remove obsolete methods and properties in the next release of the library.

Briefly speaking, random access to images, widgets, controls, patterns, color profiles, actions, fonts and views on document level should no longer be used. The same is true for images on page level. Instead, please access all these in sequential way using new enumerating methods.

We added ability to remove font duplicates. This should help to compressing PDF documents even better. Please let us know how this new feature works for you.

As usual, text processing become better in the new version. We fixed copying of text with invalid or incomplete encoding information. And you no longer need to use fallback fonts to copy existing text. Take a look at the updated Copy text, paths and images sample code.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.4 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf 5.3 Breaking changes, improvements and bug fixes

Hello,

We have released Docotic.Pdf 5.3 on our site and on NuGet.

This release contains some breaking changes. These changes probably won’t affect each and every of our customers but please read about the changes anyway.

The first breaking change is: MediaBox, CropBox, BleedBox, TrimBox and ArtBox properties of the PdfPage class now return PdfBox structure, which is expressed in PDF’s default user space coordinate system. In default user space the positive Y axis extends vertically upward. Previously these properties returned PdfRectangle structure, which is expressed in PdfPage coordinate system. In PdfPage coordinate system origin is located at the upper-left corner of the page, positive Y axis extends vertically downward.

Other breaking changes are also related to coordinates. These changes affect properties and methods of PdfDocumentView class.

Please review complete list of breaking changes in the Version History document.

This release brings improvements to text extraction speed. Docotic.Pdf 5.3 opens documents faster. And it draws documents with blend modes and soft masks faster, too.

We added two small features: now you can make text boxes scrollable and you can set mapping names for controls.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.3 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Improved support for text markup annotations in Docotic.Pdf 5.2

Hello,

We released Docotic.Pdf 5.2 on our site and on NuGet.

In the new version we added ability to create text markup annotations. You can create highlight, jugged underline, strikeout and underline annotations. Take a look at the Text markup annotations sample for more detail. In addition, now it is possible to modify existing text annotations. We added SetTextBounds method and Color and Contents properties to PdfTextMarkupAnnotation class for that.

The new version can import 16 bit TIFF images without any loss. And it can extract 16 bit images from existing documents.

To provide our customers with more diagnostic means, we added PdfDocument.Error event. You can subscribe to the event and receive notifications when the library detects an error in the PDF document structures and/or when the library can’t process the PDF properly for some reason. It’s a good idea to send all documents for which this event occurs to Bit Miracle.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.2 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf and .NET 2

Hi,

We are starting to move away from .NET 2.

Almost 12 years passed since version 2.0 was released and a lot have changed since then. Microsoft keeps improving the framework and the C# language. And with all the new features and improvements we no longer can justify the use of the outdated version of the framework.

We will be happy to hear your opinion about the following roadmap, so please feel free to contact us using e-mail or the support form.

The roadmap is as follows:

  1. Docotic.Pdf 5.1 released on February 11th will be the last version with full .NET 2 support
  2. Next versions will target 4.0 or newer and .NET Standard 1.3/.NET Core/ASP.NET Core
  3. We will continue to ship NET 2 version of the library for a year (till February 2018) but the version won’t include any new features, only bug fixes.
  4. Customers using .NET 2 version of the library will receive fixes for found bugs until February 2018.

So, basically, we will fix bugs in .NET 2 version for a year more. We will drop any support for .NET versions earlier than .NET 4.0 in February 2018.

We believe most of our customers already use .NET 4 or newer, so the move won’t bring trouble to anyone.

Posted in

Docotic.Pdf 5.1 with brand new JPEG 2000 codec

Hello,

We have published Docotic.Pdf 5.1 on our site and on NuGet.

The new version comes with brand new JPEG 2000 codec. The codec is much better than the one we had in previous versions of the library. The decoder part of the new codec allowed us to fix some nasty bugs which affected drawing of documents with particular JPEG 2000 images.

The good news is we now have encoder part in our JPEG 2000 codec, too. We added PdfImage.RecompressWithJpeg2000() methods, so you can recompress images using JPEG 2000 compression either lossless or using specified conversion ratio. You can also recompress images with JPEG 2000 when adding them to document. PdfImageCompression.Jpeg2000 is now properly supported in such a use case.

Just as some of our customers asked, we added custom exception classes for specific error groups. All the new exception classes are descendants of the PdfException class, so the change is not a breaking one. Here is the list of the new classes: CannotDecryptPdfException, CannotParseFdfException, CannotShowTextException, IncorrectPasswordException, UnexpectedStructureException, UnsupportedFontException, UnsupportedImageException, UnsupportedMetadataException.

In Docotic.Pdf 5.1 we added ability to draw whole PDF document to multipage TIFF image. The PdfDocument.SaveAsTiff() methods produce one TIFF from all pages of a PDF document. The produced TIFF will consist of multiple pages (frames).

With the latest version you can extract visible text only. Please take a look at the new PdfTextExtractionOptions.SkipInvisibleText property. Use PdfDocument.GetText(PdfTextExtractionOptions) or PdfPage.GetText(PdfTextExtractionOptions) methods to extract visible text only.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.1 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf 5. Now includes version for .NET Standard Library and ability to extract vector paths.

Hi Everyone,

We have just released Docotic.Pdf 5.0.

This release includes version for .NET Standard Library 1.3. So, starting from 5.0, Docotic.Pdf can be used in .NET Core and Universal Windows Platform apps and services. This is in addition to .NET 2 and later frameworks, of course.

Our customers asked us to add the ability to extract vector paths from documents. And here it is. By the way, you can extract clip regions, too. Take a look at the Extract text, paths and images sample to get an idea about how it works.

Version 5 adds ability to open and create PDF documents encrypted with AES-256. This should come in handy if you need stronger security algorithms in your workflow.

We also fixed number of issues related to drawing and extraction of images. And as always, text extraction was also improved.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.0 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Support for PDF layers, improvements in forms handling and more precise drawing

Hello everyone,

I am pleased to announce that Docotic.Pdf 4.9 is out on our site and on NuGet.

In the latest version we added support for layers (also known as optional content groups). The library can now create layers, enumerate existing layers, add widgets, controls and XObjects to layers. Check the Layers group of samples for some examples of what was added.

Starting from Docotic.Pdf 4.9 the library can draw documents with soft masks properly. And we also improved support for blend modes and transparency groups. By the way, the new version can also remove transparency groups from PDF documents. This feature is important if you are converting existing documents to PDF/A.

We significantly improved forms handling in the library. The library does better job in parsing of controls and their associated actions. We fixed issues related to drawing of forms and changing properties of existing controls. And we didn’t miss opportunity to fix some bugs in form filling and flattening.

There are number of fixes for text extraction and drawing. They include fixes for vertically written, stroked or rotated text.

We also fixed images-related issues. The new version handles JPEG, JBIG2 and PNG images (including images in CMYK color space) better.

There are quite a lot of other changes. Read about all new features and improvements in Docotic.Pdf 4.9 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Form flattening and other new features in Docotic.Pdf 4.8

Hi!

We released new version of Docotic.Pdf library on our site and on NuGet.

This version adds ability to flatten PDF forms. Flattening locks controls from editing. Flattening can significantly reduce file size if the document is then saved with PdfSaveOptions.RemoveUnusedObjects option set to true. To flatten form fields in a PDF file please use PdfDocument.FlattenControls() method.

We also added support for list boxes (there is Listboxes sample illustrating how to create them). The PdfCheckbox got new ExportValue property. You can use this property to extract export value of a checkbox.

Docotic.Pdf 4.8 fixes number of issues related to processing of controls and widgets. The new version parses, draws and fills PDF forms better than before.

The new version fixes some issues related to extraction of JPEG and CCITT (Fax) images. Text drawing and text extraction also got better. And the library is now more prepared to work with not completely correct PDFs.

Read about all new features and improvements in Docotic.Pdf 4.8 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Better images and text processing in Docotic.Pdf 4.7

Hello!

We released new version of Docotic.Pdf library on our site and on NuGet.

The new version adds ability to specify the place where library should keep its intermediate data. You can tell the library to keep all the data in memory or to keep smaller streams in memory and other streams in temporary files. The latter is the default option. You can create your own stream provider and tell the library to use it. You can read more about this in documentation for IPdfStreamProvider interface and PdfConfigurationOptions class.

We fixed a number of issues related to image processing. The library now draws and extracts images better, spends less memory to recompress them and more prepared to handle not exactly correct images.

Docotic.Pdf 4.7 fixes some text related issues and, thanks to some of our customers, we were able to improve support for CJK text drawing and extraction.

This version also better draws documents with transparency groups and documents which use blend modes.

And as always, we improved the library so it can work with more of those not completely correct PDFs created by unknown generators.

Read about all new features and improvements in Docotic.Pdf 4.7 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Improved memory management in Docotic.Pdf 4.6

Hello!

The new version of the Docotic.Pdf library got published on our site and on NuGet.

Substantial amount of improvements in Docotic.Pdf 4.6 is related to memory management. Latest version uses much less memory to open or draw PDF documents. Extraction of JPEG 2000 and JBIG2 images also uses less memory in Docotic.Pdf 4.6.

The new version also contains improvements to text extraction, text rendering and font processing features. Version 4.6 handles Japanese text better than any previous version.

We also dedicated some time to rendering features of the library. The latest release draws text, patterns, XObjects, and forms better. Just better.

Read about all new features and improvements in Docotic.Pdf 4.6 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in