This will be problematic when working with pdf that has descriptions with spacing. No way to determine how to split
Example:
Original copied text from PDF:
Pdf read output (L4)
“FBC BUILDING SOCIETY 600.00 0.00 0.00 3.00 0.00 603.00”
Pdf read output (L5)
“FBC BUILDING SOCIETY 600.00 0.00 0.00 3.00 0.00 603.00”