Jump to content

Welcome to Geeks to Go - Register now for FREE

Geeks To Go is a helpful hub, where thousands of volunteer geeks quickly serve friendly answers and support. Check out the forums and get free advice from the experts. Register now to gain access to all of our features, it's FREE and only takes one minute. Once registered and logged in, you will be able to create topics, post replies to existing threads, give reputation to your fellow members, get your own private messenger, post status updates, manage your profile and so much more.

Create Account How it Works
Photo

Convert pdf to excel using adobe acrobat


  • Please log in to reply

#1
xcel pro

xcel pro

    New Member

  • Member
  • Pip
  • 2 posts
Hi all
I have tried many different ways to convert pdf to excel
the best seems to be to highlight the data in adobe acrobat usint the table/fomatted text tool >> right click >> save as ANSI.txt >> open with Excel

Hoever, will only let me highlight a singel page in the document at a time
My current document is 200 pages.

Does anyone know how to beat this limitation or create a batch process?

btw, I have tried these utilities
Ghostscript
Able2Extract
pdf2txt

They all failed at negative numbers formatted as such
( 200,000) The right parend is output to Col A
The remainder is output to col B

In addition, labels such as TJE 098789 Roth are output to three columns instead of 1

TIA
xcel pro
  • 0

Advertisements


#2
greyknight17

greyknight17

    Malware Expert

  • Visiting Consultant
  • 16,560 posts
Welcome to GTG.

See if this program will do the job for you.
  • 0

#3
xcel pro

xcel pro

    New Member

  • Topic Starter
  • Member
  • Pip
  • 2 posts
Thank Grey -

I forgot to add Amber to the list of pdf utilities I have tried
It outputs pretty much same as Able to Extract
I can probaly program around text
But negative values on the pdf as ( 200,000)
Are reading into excel as ( in col A and 200,000) in Col B
This of course just example
The right parend is all over the place

Do you know VBScripting?
Or have a resource?

One of my employees wrote a script that uses pdf2txt to convert the
pdf's to txt files Then extracts certain items from the txt file and outputs to
a tsv file.

Problems is it is not working 100% correctly and have lost contact with
the employee.

Output:

Unit Number || Label Descrip || Accnt Nmbr || Amount

xxxxxxxx || travel-airfare || 6542.0001 || 900.02

etc.

There are many lines of data in the file
I only need to output last line of each account at "Account Total"

The script works closely, but not perfectly
Sometimes some values are missed - randomly.

Report initiates as .pdf and is converted to .txt
All .txt reports are mereged into 1 report
The 1 report is then "stripped" based on the above parameters

Attachments:
mergefile.zip This is the source file. When I convert from pdf to txt I lose a little structure. I tried marking it up to strip with DataImport, but the columns move around a bit with each different profit center

FAR-Account_V3.txt this is the .vbs file, I simply renamed the extension
It will prompt for input path and file and output and file.

Thanks
-xcel

Attached Files


  • 0

#4
greyknight17

greyknight17

    Malware Expert

  • Visiting Consultant
  • 16,560 posts
I'm not a big VBS person, much better with VB instead. But I probably won't have the skills to do this myself.

You can, however, try our Programming Forum here and see if the helpers there can better assist you. If necessary give them these same attachments with the source and the output that you like. They should be able to create a vbs script file for you.
  • 0






Similar Topics

0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users

As Featured On:

Microsoft Yahoo BBC MSN PC Magazine Washington Post HP