NewswireToday - /newswire/ -
Richmond, VA, United States, 2012/08/08 - Compiled Services, the developers of litigation support and e-discovery software ReadySuite, announces the software is now incorporating BeyondRecognition’s technology to improve the mass redaction feature within both PDFs and TIFFs.
The new feature, which is now available to download in ReadySuite, is called BeyondRedaction.
BeyondRedaction in ReadySuite works by reading the word dictionary created by BeyondRecognition’s Global Glyph Catalog and applying positional logic to seek out each user-designated word or pattern as it appears on every page of every document in the review set. Redactions can be specified on a per-word basis or based on a pattern, such as social security and credit card numbers, using custom regular expressions. Further, a redaction log is automatically created logging the reason for the redaction, if specified, which can be saved into a format compatible with Microsoft Excel.
“This new redaction feature in ReadySuite will save users hours of manual redaction time and will increase the accuracy of redactions,” said Justin Blessing, Director of Product Development at Compiled Services. “By reading from the Global Glyph Catalog in BeyondRecognition, and given its 99.5%+ word accuracy, ReadySuite is able to mass redact TIFF and PDF files based on words and patterns specified by the user.”
Users can expect to process documents at a rate of 150,000 pages per hour in TIFF format, and 30,000 pages per hour for PDF documents. The text elements associated with the image redactions are also removed either from the associated text file (in TIFF) or from the text layer of the PDF document. This methodology assures the user that no redacted information can “leak” out and completely obviates the need to re-OCR redacted documents.
ReadySuite, developed by Compiled Services, is a bundled suite of specialized litigation software for handling various document related tasks. Included in the suite are modules for reviewing, validating, merging, and manipulating load files, converting among multiple image formats such as TIF and PDF, applying or removing endorsements from image sets, generating searchable text and PDF files using OCR, and batch printing image sets to high capacity printers.
About Compiled Services
Since 2008, Compiled Services (compiledservices.com) has created specialized software tools to aid in meeting tight litigation deadlines while adhering to strict quality standards. ReadySuite, the bundled suite of these tools, gives tech-savvy e-discovery experts the ability to perform a number of quality control, load file manipulation and image processing steps to data involved in litigation or regulatory matters.
BeyondRecognition (BeyondRecognition.net) has developed unique character, word, and document attribute recognition and extraction capabilities for analyzing image-based documents. Its glyph clustering and cataloging approach enables rapid, globally-editable text recognition with accuracy rates far beyond traditional OCR. BeyondRecognition also clusters documents based on visual similarity and permits location-based, cluster-specific data element extraction for coding or abstracting data elements from the documents. Clustering by document type permits prioritized data element extraction using the powerful graphical user interface to highlight zones, and to write and instantly test and verify extraction rules.
Although nominally a “startup,” the principal technologists at BeyondRecognition have been working in the fields of document conversion, electronic evidence forensics and processing for decades. CEO John Martin was previously a founder of Cricket Technologies and RedFile LLC.