Latest news

Docotic.Pdf 5.7 extracts text better

Hi,

We have released Docotic.Pdf 5.7 on our site and on NuGet.

In this version we added ability to extract text as vector paths. For this we added PdfPage.GetObjects(PdfObjectExtractionOptions) method and PdfObjectExtractionOptions.ExtractTextAsPath property. Please take a look at them.

We also improved extraction of text with vertical writing mode. And fixed some bugs related to text extraction.

There are new features and improvements related to forms. Now you can flatten individual form fields using PdfControl.Flatten() method. The PdfDocument.GetControl method now performs searches not only by control name but also by control full name. Thanks to our customers, we fixed some forms filling related bugs.

As our users suggested, in the new version we added ability to extract file specifications associated with rich media annotations. Take a look at the new PdfRichMediaAnnotation class. And ability to extract raw contents of XMP metadata using one of the new XmpMetadata.Extract() methods.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.7 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

AES-256 and speed improvements in Docotic.Pdf 5.6

Hello,

We have released Docotic.Pdf 5.6 on our site and on NuGet.

The new version adds support for documents encrypted with AES-256. This encryption algorithm is usually found in PDF 2.0 documents. Now Docotic.Pdf can read such documents.

In version 5.6 we also paid a lot of attention to memory and speed improvements. The new version of the library uses less memory when opening documents. It also opens them faster.

The new version brings ability to cache form XObjects. This cache can improve speed of processing. The cache can either increase or decrease memory consumption depending on the contents of the PDF. We suggest you take a look at the PdfCache class and the PdfConfigurationOptions.XObjectCache property. After that you could try and see for yourself if enabling the cache helps in your case. It will be great if you tell us about your experience with the cache.

We made some improvements for documents that use blend modes. Now Docotic.Pdf draws them faster.

There are new properties in PdfEmbeddedFile and PdfFileSpecification classe. These properties were added to help creating Factur-X / ZUGFeRD invoices or similar documents with Docotic.Pdf library.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.6 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Welcome Jpeg2000.Net

Hello,

I am pleased to announce that we have just released first version of Jpeg2000.Net.

Jpeg2000.Net is our new royalty-free library for JPEG 2000 file format. The library can compress and decompress JPEG 2000 images. It supports lossy and lossless compression.

With help of the library you can decode only part of an image. Or you can decode only specified number of quality layers. There are other advanced options available.

The library is 100% managed, without unsafe blocks and without external dependencies. It is available for WinForms, WPF and ASP.NET. A version for .NET Standard Library / .NET Core / ASP.NET Core is also available.

We encourage you to download and try the first version of Jpeg2000.Net. The library is also available on NuGet.

Please tell us your thoughts about the library using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf 5.5 brings improvements in version for .NET Standard, new features and more

Hi,

The new version of Docotic.Pdf library is now available on our site and on NuGet.

Version for .NET Standard 2.0 now has all the same features as in the version for .NET 4. It means that PDF to image conversion, extraction of images with masks, scaling and resizing of images, creation of image masks, loading of fonts from the collection of system fonts are now supported in the version for .NET Standard 2.0.

Some of our customers suggested us to provide a way to save PDF documents to bitonal TIFFs. Such images are also known as black-and-white or monochrome. For documents with mostly text content monochrome TIFFs can save a lot of space without affecting visual quality. We heard what our customers wanted and added the ability to save whole PDF files or individual PDF pages as black and white TIFFs. There is a new sample code for the feature.

We clarified and extended interface of the PdfDrawOptions class. It includes marking a few properties obsolete. Please check the Version History document for more info about what was marked obsolete.

As always, we worked on speed improvements. Text extraction got better. We improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.5 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Memory and speed related improvements in Docotic.Pdf 5.4

Hi,

We have just released new version of Docotic.Pdf library on our site and on NuGet.

Main changes in Docotic.Pdf 5.4 are memory consumption improvements. We also worked very hard to improve the speed of opening existing documents.

In order to achieve the improvements, we had to mark some methods and properties obsolete. No functionality loss happened, since we added alternative methods for everything we marked obsolete. Please review the changes and migrate to the alternative methods. We will remove obsolete methods and properties in the next release of the library.

Briefly speaking, random access to images, widgets, controls, patterns, color profiles, actions, fonts and views on document level should no longer be used. The same is true for images on page level. Instead, please access all these in sequential way using new enumerating methods.

We added ability to remove font duplicates. This should help to compressing PDF documents even better. Please let us know how this new feature works for you.

As usual, text processing become better in the new version. We fixed copying of text with invalid or incomplete encoding information. And you no longer need to use fallback fonts to copy existing text. Take a look at the updated Copy text, paths and images sample code.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.4 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf 5.3 Breaking changes, improvements and bug fixes

Hello,

We have released Docotic.Pdf 5.3 on our site and on NuGet.

This release contains some breaking changes. These changes probably won’t affect each and every of our customers but please read about the changes anyway.

The first breaking change is: MediaBox, CropBox, BleedBox, TrimBox and ArtBox properties of the PdfPage class now return PdfBox structure, which is expressed in PDF’s default user space coordinate system. In default user space the positive Y axis extends vertically upward. Previously these properties returned PdfRectangle structure, which is expressed in PdfPage coordinate system. In PdfPage coordinate system origin is located at the upper-left corner of the page, positive Y axis extends vertically downward.

Other breaking changes are also related to coordinates. These changes affect properties and methods of PdfDocumentView class.

Please review complete list of breaking changes in the Version History document.

This release brings improvements to text extraction speed. Docotic.Pdf 5.3 opens documents faster. And it draws documents with blend modes and soft masks faster, too.

We added two small features: now you can make text boxes scrollable and you can set mapping names for controls.

As always, we improved support for broken and incorrect documents. And we fixed some bugs of our own.

Read about all new features and improvements in Docotic.Pdf 5.3 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Improved support for text markup annotations in Docotic.Pdf 5.2

Hello,

We released Docotic.Pdf 5.2 on our site and on NuGet.

In the new version we added ability to create text markup annotations. You can create highlight, jugged underline, strikeout and underline annotations. Take a look at the Text markup annotations sample for more detail. In addition, now it is possible to modify existing text annotations. We added SetTextBounds method and Color and Contents properties to PdfTextMarkupAnnotation class for that.

The new version can import 16 bit TIFF images without any loss. And it can extract 16 bit images from existing documents.

To provide our customers with more diagnostic means, we added PdfDocument.Error event. You can subscribe to the event and receive notifications when the library detects an error in the PDF document structures and/or when the library can’t process the PDF properly for some reason. It’s a good idea to send all documents for which this event occurs to Bit Miracle.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.2 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf and .NET 2

Hi,

We are starting to move away from .NET 2.

Almost 12 years passed since version 2.0 was released and a lot have changed since then. Microsoft keeps improving the framework and the C# language. And with all the new features and improvements we no longer can justify the use of the outdated version of the framework.

We will be happy to hear your opinion about the following roadmap, so please feel free to contact us using e-mail or the support form.

The roadmap is as follows:

  1. Docotic.Pdf 5.1 released on February 11th will be the last version with full .NET 2 support
  2. Next versions will target 4.0 or newer and .NET Standard 1.3/.NET Core/ASP.NET Core
  3. We will continue to ship NET 2 version of the library for a year (till February 2018) but the version won’t include any new features, only bug fixes.
  4. Customers using .NET 2 version of the library will receive fixes for found bugs until February 2018.

So, basically, we will fix bugs in .NET 2 version for a year more. We will drop any support for .NET versions earlier than .NET 4.0 in February 2018.

We believe most of our customers already use .NET 4 or newer, so the move won’t bring trouble to anyone.

Posted in

Docotic.Pdf 5.1 with brand new JPEG 2000 codec

Hello,

We have published Docotic.Pdf 5.1 on our site and on NuGet.

The new version comes with brand new JPEG 2000 codec. The codec is much better than the one we had in previous versions of the library. The decoder part of the new codec allowed us to fix some nasty bugs which affected drawing of documents with particular JPEG 2000 images.

The good news is we now have encoder part in our JPEG 2000 codec, too. We added PdfImage.RecompressWithJpeg2000() methods, so you can recompress images using JPEG 2000 compression either lossless or using specified conversion ratio. You can also recompress images with JPEG 2000 when adding them to document. PdfImageCompression.Jpeg2000 is now properly supported in such a use case.

Just as some of our customers asked, we added custom exception classes for specific error groups. All the new exception classes are descendants of the PdfException class, so the change is not a breaking one. Here is the list of the new classes: CannotDecryptPdfException, CannotParseFdfException, CannotShowTextException, IncorrectPasswordException, UnexpectedStructureException, UnsupportedFontException, UnsupportedImageException, UnsupportedMetadataException.

In Docotic.Pdf 5.1 we added ability to draw whole PDF document to multipage TIFF image. The PdfDocument.SaveAsTiff() methods produce one TIFF from all pages of a PDF document. The produced TIFF will consist of multiple pages (frames).

With the latest version you can extract visible text only. Please take a look at the new PdfTextExtractionOptions.SkipInvisibleText property. Use PdfDocument.GetText(PdfTextExtractionOptions) or PdfPage.GetText(PdfTextExtractionOptions) methods to extract visible text only.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.1 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in

Docotic.Pdf 5. Now includes version for .NET Standard Library and ability to extract vector paths.

Hi Everyone,

We have just released Docotic.Pdf 5.0.

This release includes version for .NET Standard Library 1.3. So, starting from 5.0, Docotic.Pdf can be used in .NET Core and Universal Windows Platform apps and services. This is in addition to .NET 2 and later frameworks, of course.

Our customers asked us to add the ability to extract vector paths from documents. And here it is. By the way, you can extract clip regions, too. Take a look at the Extract text, paths and images sample to get an idea about how it works.

Version 5 adds ability to open and create PDF documents encrypted with AES-256. This should come in handy if you need stronger security algorithms in your workflow.

We also fixed number of issues related to drawing and extraction of images. And as always, text extraction was also improved.

There are other changes. Read about all new features and improvements in Docotic.Pdf 5.0 in the Version History document.

We encourage you to download and try the new version. This version is also available on NuGet.

Please tell us your thoughts about the new version using e-mail or via the support form. Don’t hesitate to write us your questions, suggest features or ask for help.

Posted in