Get Text from PDF

A special toolkit implemented in Wizlink® extracts a plain text from .pdf file and then parses it. A PDF Viewer shows retrieved text in a form of an array (one word in a cell). Numbers of columns and rows in the array start from 0.

Open PDF Viewer from Tools in main menu bar. Click the Load PDF option and choose the pdf file. Click Open. Wait patiently as it will take some time to parse the document.

You can click a cell in an array and PDF Viewer shows numbers of its row and column.

Put full path to the pdf file into Input part of Properties, declare variables for output part of Properties (where needed: NumberOfRows, PDF Content, Rows, RowsCounts).

Parameter nameParameter TypeIn/OutDescription
Alternative Scenario If FailedBooleanInputcheckbox; default is "on"
DisplayNameLiteralInputyou can change the name of activity for greater clarity of the scenario
File PathStringInputfull path to the file
NumberOfRowsInt32Outputnumber of all rows in a table
PDF ContentString[][]OutputArray of text values retrieved from apllication
ResultBooleanOutputreturns the Boolean value: it will return True if no error is in activity, otherwise it will return False
Result CodeInt32OutputResult Code will return 0 if there is no error in activity, otherwise in most cases it will return 1, the extended result code will show only in several chosen activities
RowsString[]Outputvar(number of row): content of all cells in a given row
RowsCountsInt32[]Outputvar(number of row): number of columns with the data in a given row