PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 02-20-2004, 04:46 PM   #1
philippeguerind
Green Mole
 
Join Date: Feb 2004
Location: FRANCE
Posts: 5
indexing pdf

Hi from France,
You will excuse my english. I can't make phpdig indexing pdf files.
I put the following lines into the config.php file.

define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','./pdftotext');
define('PHPDIG_OPTION_PDF','');
define('PHPDIG_PDF_EXTENSION','.txt');

as pdftotext.exe is located at the root. The indexing works perfectly with html files even with ASCII files but not with pdf files. My web site is located on Lycos server.
I uploaded pdftotext.exe at the root, then set permissions to 755. When I run phpdir from the administration panel asking to dig a pdf file indicating the full path, I get a green sign in front indicating the file is indexed. When I search for any word inside the pdf file I get no record.
What could I try? I have been looking at this Forum for weeks before posting. Now I have no more ideas.
Thanks for helping a novice.
Philippe.
philippeguerind is offline   Reply With Quote
Old 02-20-2004, 05:04 PM   #2
tomas
Orange Mole
 
Join Date: Feb 2004
Posts: 47
hi philippe,

is your server running on windows or unix/linux ?

tomas
tomas is offline   Reply With Quote
Old 02-20-2004, 05:21 PM   #3
philippeguerind
Green Mole
 
Join Date: Feb 2004
Location: FRANCE
Posts: 5
Thank's. Lycos Servers are running Unix. I use 1.6.4 phpdig version.
Philippe
philippeguerind is offline   Reply With Quote
Old 02-20-2004, 05:27 PM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Does Lycos allow commands such as exec to run on its servers?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 02-20-2004, 05:36 PM   #5
philippeguerind
Green Mole
 
Join Date: Feb 2004
Location: FRANCE
Posts: 5
Hi, I don't know. I just asked their support service by posting a thread. I'm waiting for the answer ...
Philippe
philippeguerind is offline   Reply With Quote
Old 02-20-2004, 05:49 PM   #6
philippeguerind
Green Mole
 
Join Date: Feb 2004
Location: FRANCE
Posts: 5
If running exec is not allowed, is there any wy I could run pdftotext onto my PC as a shell ?
Philippe
philippeguerind is offline   Reply With Quote
Old 02-20-2004, 06:38 PM   #7
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Perhaps check at the following link for a version that would work with your PC:

http://www.foolabs.com/xpdf/download.html
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 02-20-2004, 06:56 PM   #8
tomas
Orange Mole
 
Join Date: Feb 2004
Posts: 47
hello,

philippe - try this setting:
define('PHPDIG_PDF_EXTENSION','');

run spider and take a look into text_content directory -
are there temp-files? are they empty?

after this test reset to:
define('PHPDIG_PDF_EXTENSION','.txt');

what is your servers php-version?

tomas
tomas is offline   Reply With Quote
Old 02-20-2004, 07:02 PM   #9
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
OT: Thanks tomas for helping!
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 02-20-2004, 07:24 PM   #10
tomas
Orange Mole
 
Join Date: Feb 2004
Posts: 47
hello again philippe,

in your first post you wrote "pdftotext.exe" -
it seems that you installed the dos-version on an unix-server???

the unix download is:
http://www.foolabs.com/xpdf/download.html
x86, Linux (glibc 2.2, staticly linked to Motif, t1lib, and FreeType 2):
xpdf-3.00-linux.tar.gz (4544077 bytes)

tomas

Last edited by tomas; 02-20-2004 at 07:28 PM.
tomas is offline   Reply With Quote
Old 02-21-2004, 06:58 AM   #11
philippeguerind
Green Mole
 
Join Date: Feb 2004
Location: FRANCE
Posts: 5
I wasn't using the unix version of pdftotext.
Now I do. is the line below is correct? as www is my root. It still doesn't work but I still go on ...

define('PHPDIG_PARSE_PDF','./usr/local/bin/pdftotext');
Philippe
philippeguerind is offline   Reply With Quote
Old 02-21-2004, 11:50 AM   #12
tomas
Orange Mole
 
Join Date: Feb 2004
Posts: 47
hi philippe,

i don't think so -
please try this:
1) upload: pdftotext binary into the same folder where phpdig is
2) set: 755 permissions for pdftotext and admin/temp
3) set: define('PHPDIG_PARSE_PDF','/path/to/pdftotext');
4) set: define('PHPDIG_PDF_EXTENSION','');

run spider and take a look into text_content directory -
are there temp-files? are they empty?

kind regards
tomas
tomas is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Indexing PDF dlaperle Troubleshooting 1 03-21-2007 08:00 PM
Problem with PDF indexing Phantom External Binaries 2 07-25-2005 03:26 AM
indexing pdf Hoek External Binaries 9 02-25-2004 03:42 AM
PDF indexing lelandv External Binaries 15 12-08-2003 05:23 PM
PDF indexing aryan External Binaries 11 11-27-2003 08:51 AM


All times are GMT -8. The time now is 12:57 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.