Read PDF Table

AutomatR.DefaultActivities.PDF.ReadPDFTable

The "Read PDF Table" activity in AutomatR is part of the PDF activities package, allowing you to extract tables from PDF documents. This activity simplifies the process of reading and processing tabular data from PDF files in automation workflows.

Properties

Name	Description
*Input*
Each Page Has Headers	When selected, each page containing the table has headers. Boolean variables to indicate whether each page has headers.
File Path	The path of the PDF from which you want to extract the table. String variables containing the file path.
Has Headers	When selected, the first row of the table is treated as a header and will not be copied. When not selected, the first row is copied. This option is selected by default. Boolean variables to determine whether the table has headers.
Header Filter	Header of the table you want to read. Use commas to separate multiple headers. String variables containing the header(s).
Header Orientation	Select the direction of headers in the table. Choose 'Horizontal' if columns are present in a row, and 'Vertical' if present in a column.
Index	Specify the index of the table you want to read. Integer variables containing the table index.
Table in Multiple Pages	When selected, the table is spread across multiple pages in the PDF. Boolean variables to indicate whether the table spans multiple pages.
*Misc*
Display Name	Provides a customizable name for the activity displayed in the workflow. The display name enhances clarity and organization within the automation project. String variable or argument containing the desired display name.
*Optional*
Delay	Specifies the wait time (in seconds) before starting the activity. This is useful for handling synchronization issues. Integer variables containing the delay duration (e.g., 5 for 5 seconds).
*Output*
Result	Returns the DataTable with specified Header Filter/Orientation, Index, Has Headers criteria in the given File Path. DataTable variables to store the extracted table.

How to use:

Drag and drop the "Read PDF Table" activity onto the workflow.
Configure the properties by specifying the file path, table index, header filter, and other relevant details.
Optionally, customize the delay duration.
Execute the workflow to extract the specified table from the PDF.

Example:

Consider an example where the "Read PDF Table" activity is used to extract a table from a PDF file:

Read PDF Table:
  Delay: 2
  File Path: "C:\Documents\Sample.pdf"
  Index: 1
  Header Filter: "Name, Age, Address"
  Has Headers: true
  Header Orientation: Horizontal
  Table in Multiple Pages: false
  Each Page Has Headers: false
  Result: extractedTable

In this example, the activity reads the table with the specified headers ("Name, Age, Address") from the PDF file located at "C:\Documents\Sample.pdf". The extracted table is stored in the DataTable variable "extractedTable" for further processing in the workflow.

Read PDF Table

Properties​

How to use:​

Example:​

Properties

How to use:

Example: