KeyView Viewing SDK

23.3.0

New in this Release

  • KeyView format detection has been extended, with support for 61 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • The third-party sqlite library was upgraded to version 3.42.0.

  • The third-party libical library was upgraded to version 3.0.16.

    NOTE: This change was implemented in KeyView 23.2, but was not included in the release notes.

  • The third-party Apache Arrow library was upgraded to version 11.0.0. This change includes upgrades to the following dependencies: 

    • boost was upgraded to version 1.75.0

    • brotli was upgraded to version1.0.9

    • jemalloc was upgraded to version 5.3.0

    • re2 was upgraded to version 2022-06-01

    • thrift was upgraded to version 0.16.0

    • utf8proc was upgraded to version 2.7.0

    • zStandard was upgraded to version 1.5.2

    NOTE: This change was implemented in KeyView 23.2, but was not included in the release notes.

Resolved Issues

  • (Security update)KeyView depended on outdated LZMA code for detection of 7z encryption. This dependence has been removed.

  • When using the pdfsr reader to process PDFs that contained fonts with certain predefined encodings, spaces sometimes appeared in the wrong places in output text.

  • When using the pdfsr reader to process PDFs, output text was sometimes broken onto additional new lines or missed characters.

  • KeyView leaked memory when filtering some Apple iWork Keynote (.key) (IWPG_Fmt) files.

  • KeyView incorrectly identified some attachments to Outlook .msg files as being inline pictures rather than non-inline attachments.

  • Changing the global locale could cause KeyView to fail to process some file formats.

  • When attempting to filter NIST_ITL_Fmt files, KeyView returned KVERR_FormatNotSupported.

  • Some child windows were not displayed in the viewing window. For example, this issue meant that mail bodies were not displayed and attachments could not be opened.

  • When viewing Microsoft Excel documents (XLSX) that contained merged cells, KeyView could exit unexpectedly.

  • When processing HTML files that specify an ISO-2022 character encoding, KeyView output some entities as unprocessed strings, rather than the correct unicode character.

23.2.0

New in this Release

  • KeyView format detection has been extended, with support for 56 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • KeyView has been simplified so that it is much easier to map file formats to readers. File formats no longer have an associated "category". The KeyView configuration files such as formats.ini, formats_e.ini, and kvsdk.ini now identify file formats using the same file format numbers that are returned by format detection.

  • KeyView can now extract all platform-specific embedded files from PDF_Fmt documents.

  • KeyView can now process certain OpenOffice Text files that it would previously reject, and it can process image alt text in OpenOffice Text as hidden text.

  • KeyView now supports TIFF (TIFF_Fmt) files that use WebP compression.

  • Handling of Arabic diacritics (tashkil) has been significantly improved when using the pdfsr reader to process PDFs.

  • Text ordering has been improved when using the pdfsr reader to process PDFs.

  • The FreeType third-party library has been upgraded to version 2.12.1.

  • The ODA third-party library has been upgraded to version 2023.12.

  • The zlib third-party library has been upgraded to version 1.2.13.

  • The libxml2 third-party library has been upgraded to version 2.10.3.

  • The expat third-party library has been upgraded to version 2.5.0.

  • The ICU third-party library has been upgraded to version 72.1.

  • The openssl third-party library has been upgraded to version 3.0.8.

  • The libde265 third-party library has been upgraded to version 1.0.11.

  • The XMP-Toolkit third-party library has been upgraded to version 2022.06.

  • The wavpack third-party library has been upgraded to version 5.6.0.

  • The sqlite third-party library has been upgraded to version 3.41.0.

Resolved Issues

  • (Security update) The third-party libtiff library has been upgraded to version 4.5.0 to resolve known vulnerabilities, including CVE-2022-2056, CVE-2022-2057, CVE-2022-2058, CVE-2022-3452, CVE-2022-3570, CVE-2022-3597, CVE-2022-3598, CVE-2022-3599, CVE-2022-3626, and CVE-2022-3627.
  • (Security update) The third-party protobuf library has been upgraded to version 3.21.12 to resolve known vulnerabilities, including CVE-2022-1941.

  • (Security update) The libjpeg third-party library has been upgraded to version 9e to resolve potential vulnerabilities.
  • (Security update) The libwebp third-party library has been upgraded to version 1.3.0.

  • KeyView could truncate long sections of text in PDF_Fmt documents.

  • KeyView did not retrieve the Image Width, Image Height and Bits Per Pixel in summary information from Tagged Image File Format (TIFF) TIFF_Fmt files.

  • KeyView did not view TIFF documents with images in multiple colourspaces correctly.

  • For some Microsft Excel (XLSX) files with a lot of cells using Rich Data Types, KeyView output the names of those types incorrectly, using a number instead of a type name.

  • In Word Processor formats such as Microsoft Word (DOCX), embedded images with text wrapping around them were sometimes rendered above the text, obstructing it. To avoid this, ViewingSDK now places images at the start of paragraphs to ensure that the text is readable, making the output more consistent with HTML Export.

  • KeyView could skip some user defined properties in summary information for some OLE-based files like MS_Project_2007_Fmt.

  • When using the pdfsr reader for text in right-to-left languages, diacritic characters were sometimes not extracted correctly.

  • Some PDF files took longer to process in version 12.13.0 of the SDK than in version 12.12.0.

  • KeyView could exit unexpectedly when processing some Microsoft Visio (.vsd) files.

  • KeyView missed text from some Microsoft Visio (.vsd) files.

  • Some base-64 encoded attachments to ICS files were extracted incorrectly.

  • KeyView could report duplicate metadata from Tagged Image File Format (TIFF) files with multiple pages.

  • KeyView could output incorrect metadata names for some PDF files.

  • KeyView could omit metadata entries for some PDF files.

  • When using the pdfsr reader to process PDFs that contained right-to-left (RTL) text, some text at the top of the file was not included in the output.

  • KeyView could process some CSV files incorrectly, meaning fields were output in the wrong columns.

  • The reader kpPDF2rdr was not thread-safe.
  • The reader kpPDF2rdr failed to release memory each time a file was processed.
  • The reader kpPDF2rdr could output incorrect values for page width and height.

Deprecated Features

The following features are deprecated and might be removed in a future release.

Category Deprecated Feature Deprecated Since
Readers

The following readers have been deprecated:

  • cebsr

  • lwpsr

23.2.0