Get Full Text
AutomatR.Nanonets.Activities.GetFullText
The "Get Full Text" activity in AutomatR is part of the Nanonets Full Text OCR activities package, allowing you to extract full text from images or PDF files using the Nanonets OCR model. This activity supports processing multiple files simultaneously and provides the extracted text as a list of FullOCRProperties
for further automation.
Properties
Name | Description |
---|---|
Input | |
File Path | Specifies the file paths (PDF or image files) from which to extract full text. Provide multiple file paths separated by commas. String variables containing the file paths. |
Misc | |
Display Name | Provides a customizable name for the activity displayed in the workflow. The display name enhances clarity and organization within the automation project. String variables containing the desired display name. |
Optional | |
Delay | Specifies the amount of time (in seconds) to wait before executing the "Get Full Text" activity. This can be useful for handling synchronization issues. Integer variables containing the delay duration. Ex.: If the amount of time is 1000 milliseconds or 1 sec, i.e., 1. |
Output | |
Result | Outputs the full text extracted from the input files in the form of a list of FullOCRProperties . Variables of type list to store the extracted text for each file. |
How to use:
- Drag and drop the "Get Full Text" activity onto the workflow.
- Configure the properties by specifying the file paths (PDF or image files) from which to extract full text.
- Optionally, configure the delay.
- Execute the workflow to extract full text from the specified files using the Nanonets OCR model.
Example: Consider an example where the "Get Full Text" activity is used to extract full text from two files, an image file ("document.jpg") and a PDF file ("report.pdf"):
Get Full Text:
File Path: "/path/to/document.jpg, /path/to/report.pdf"
Delay: 3
Result: extractedTextList
In this example, the activity extracts full text from the specified image and PDF files with a delay of 3 seconds. The extracted text is stored in the list variable "extractedTextList" for further handling in the workflow.