Please post all pre-sales questions of all products on this forum

Edocman/OS pdf Indexer and indexing pdf files

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59469 by Alan Henness
Edocman/OS pdf Indexer and indexing pdf files was created by Alan Henness
Hi

I am looking for something that will index pdf files so they are searchable using joomla's standard search. I have tried JiFile but I'm running into problems on my demo website on demojoomla.com. The error has to do with the having too many open files giving an error with fopen. The author of JiFile has been very helpful, but it's a limitation of the server and not JiFile. The problem may not happen when the site is finished and transferred to a production server, but I don't know.

However, I came across Edocman (or maybe OS pdf Indexer) and it looks like this may be a better solution for me.

Can anyone tell me if I might get the same fopen problem as I had with JiFile?

Thanks!

Alan

Please Log in or Create an account to join the conversation.

More
10 years 2 months ago #59474 by Tuan Pham Ngoc
Replied by Tuan Pham Ngoc on topic Edocman/OS pdf Indexer and indexing pdf files
Hi Alan

To be honest, I don't know. How many files you want to index ? With EDocman, the file will only be indexed when you upload document, so it is indexed one by one and you should not have issue

With OS PDF Indexer, we index multiple files each time. However, it is separated by different processes. After one file indexed, the system wait for few seconds, make a new request for indexing next file, so it should work well, too

No one knows that it works well for you or not. You will have to try. Just purchase and try. If it doesn't work, ask us for refund. It is simple like that :)

Tuan

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59512 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Thanks for the quick reply, Tuan.

It's not the number of files I want to index that seemed to be the problem - there were only around 160, but if I tried to index just a few, I got the same fopen error. What I was told was:

the fopen problem is a system problem. JiFile use Zend Search Framework that open many files to write index and in many case the system does not allow to open many files from PHP.


So it seems to be the way it opens numerous files to store the inexes and not the number of pdf files it's indexing.

I already have the files uploaded, so I'll buy OS PDF Indexer and see if that works - I'll let you know later today!

Thanks again.

Alan

Please Log in or Create an account to join the conversation.

More
10 years 2 months ago #59529 by Tuan Pham Ngoc
Replied by Tuan Pham Ngoc on topic Edocman/OS pdf Indexer and indexing pdf files
OK Alan. As OS PDF Indexer doens't need to use Zend Search Framework, I think it won't consume much server resources and it should work well

But you will need to try it first to see the reason

Regards,

Tuan

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59533 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Tuan

Bought it, installed and indexed my files. It seemed to work OK and the documents managements page shows all the files but when I search for something in one of the pdfs, it returns no results.

If I click on one in the documents management page, the doc content is:

/home/j181f101/public_html/components/com_docindexer/lib/binaries/linux/pdftotext: /lib64/libc.so.6: version `GLIBC_2.11' not found (required by /home/j181f101/public_html/components/com_docindexer/lib/binaries/linux/pdftotext)


Is something missing? Do I need to do something else?

Thanks.

Alan

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59534 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Ah, wait. The plugin wasn't published... trying again!

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59544 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Still getting the same error!

/home/j181f101/public_html/components/com_docindexer/lib/binaries/linux/pdftotext: /lib64/libc.so.6: version `GLIBC_2.11' not found (required by /home/j181f101/public_html/components/com_docindexer/lib/binaries/linux/pdftotext)


Any idea what is wrong?

Thanks.

Alan

Please Log in or Create an account to join the conversation.

More
10 years 2 months ago #59551 by Tuan Pham Ngoc
Replied by Tuan Pham Ngoc on topic Edocman/OS pdf Indexer and indexing pdf files
Hi Alan

Seems the library is not compatible with the OS your server is using. But we have a workaround (have an old library which works)

So please submit a support ticket sending us super admin account and FTP account of your site so that we can install, check permission... and make sure it works for you

Tuan

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59584 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Tuan

Not sure I can. I'm using Siteground's free demo webspace at demojoomla.com/ and I suspect I won't have ftp access. I've had a look, but I can't see much information on what features they are offering.

This is just a temporary website anyway - I will move it to production webspace when I've finished designing it. What I may do is move it to some of my own spare webspace meantime and check that the Indexer works there (which I'm sure it will).

I'll let you know, but thanks for your help.

Alan

Please Log in or Create an account to join the conversation.

  • Alan Henness
  • Topic Author
  • Offline
  • Premium Member
  • Premium Member
More
10 years 2 months ago #59601 by Alan Henness
Replied by Alan Henness on topic Edocman/OS pdf Indexer and indexing pdf files
Tuan

I asked Siteground about the error and they replied:

The libraries that are used for the creation of the plugin are not supported by the OS (Operation System) the libc.so library that we use is not compatible with the plugin and this is why you will not be able to use this plugin.

Unfortunately we will not be able to update or perform any custom modification to the server as this may cause issues with all the other users on the server that use the current version of this library.

I've also discovered I have full cpanel access, so I'll create a ftp account and raise a support ticket with the details and login credentials.

Please Log in or Create an account to join the conversation.

Moderators: Tuan Pham NgocGiang Dinh TruongMr. Dam