Skip to main content

Get Full Text

AutomatR.Nanonets.Activities.GetFullText

The "Get Full Text" activity in AutomatR is part of the Nanonets Full Text OCR activities package, allowing you to extract full text from images or PDF files using the Nanonets OCR model. This activity supports processing multiple files simultaneously and provides the extracted text as a list of FullOCRProperties for further automation.

Properties

NameDescription
Input
File PathSpecifies the file paths (PDF or image files) from which to extract full text. Provide multiple file paths separated by commas. String variables containing the file paths.
Misc
Display NameProvides a customizable name for the activity displayed in the workflow. The display name enhances clarity and organization within the automation project. String variables containing the desired display name.
Optional
DelaySpecifies the amount of time (in seconds) to wait before executing the "Get Full Text" activity. This can be useful for handling synchronization issues. Integer variables containing the delay duration. Ex.: If the amount of time is 1000 milliseconds or 1 sec, i.e., 1.
Output
ResultOutputs the full text extracted from the input files in the form of a list of FullOCRProperties. Variables of type list to store the extracted text for each file.

How to use:

  1. Drag and drop the "Get Full Text" activity onto the workflow.
  2. Configure the properties by specifying the file paths (PDF or image files) from which to extract full text.
  3. Optionally, configure the delay.
  4. Execute the workflow to extract full text from the specified files using the Nanonets OCR model.

Example: Consider an example where the "Get Full Text" activity is used to extract full text from two files, an image file ("document.jpg") and a PDF file ("report.pdf"):

Get Full Text:
File Path: "/path/to/document.jpg, /path/to/report.pdf"
Delay: 3
Result: extractedTextList

In this example, the activity extracts full text from the specified image and PDF files with a delay of 3 seconds. The extracted text is stored in the list variable "extractedTextList" for further handling in the workflow.