I followed the instructions above for a Linux system, and still am having trouble uploading PDFs larger than about 1Mb.
Both Apache and PHP are set up to accept uploads up to 25 Mb.
I changed the use of pstotext to pdftotext but still, after I "send for publishing" the article containing the PDF dile, either the browser says "done" on the submitting screen itself, i.e., I don't get to the next screen, and the file is not uploaded, or I get a "A database transaction in Exponential failed...." message.
In order to see if the problem was related to the use of either pstotext or pdftotext, I timed the conversion of one of such problematic PDF files into text, and I show the results below. The PDF filesize is 1.291.379, which is not soooo big.
tmp# time pstotext senhora.pdf > senhora.txt
real 0m35.893s
user 0m33.070s
sys 0m0.920s
# time pdftotext senhora.pdf senhora.txt2
real 0m1.731s
user 0m1.450s
sys 0m0.040s
So as one can see, although pdftotext is really a lot faster than pstotext, this latter takes "only" 35.9 seconds to do its job. Pdftotext takes much less, 1.7s. So I ask, is this part of the uploading job really what's causing the system to lock? I don't think so.
Another clue is: when I go to the class and turn off the "Searchable" flag for the file attribute, everything goes very smooth... but then the file is not searchable!!!
And finally another hint is: Inspecting the database I see a very suspicious table size for table "ezsearch_object_word_link": 4,571,489 records / 817.5 MB. I guess this is the table that stores the indexed material, but at that size would it slow down indexing of new material?
I'd appreciate any help, as my system is heavily based on PDFs.
Roberto
Modified by Roberto Kirschbaum
on 17/11/2005 at 8:16:56 am
Still having trouble
Roberto Kirschbaum
Thursday 17 November 2005 12:04:28 am
I followed the instructions above for a Linux system, and still am having trouble uploading PDFs larger than about 1Mb.
Both Apache and PHP are set up to accept uploads up to 25 Mb.
I changed the use of pstotext to pdftotext but still, after I "send for publishing" the article containing the PDF dile, either the browser says "done" on the submitting screen itself, i.e., I don't get to the next screen, and the file is not uploaded, or I get a "A database transaction in Exponential failed...." message.
In order to see if the problem was related to the use of either pstotext or pdftotext, I timed the conversion of one of such problematic PDF files into text, and I show the results below. The PDF filesize is 1.291.379, which is not soooo big.
tmp# time pstotext senhora.pdf > senhora.txt
real 0m35.893s
user 0m33.070s
sys 0m0.920s
# time pdftotext senhora.pdf senhora.txt2
real 0m1.731s
user 0m1.450s
sys 0m0.040s
So as one can see, although pdftotext is really a lot faster than pstotext, this latter takes "only" 35.9 seconds to do its job. Pdftotext takes much less, 1.7s. So I ask, is this part of the uploading job really what's causing the system to lock? I don't think so.
Another clue is: when I go to the class and turn off the "Searchable" flag for the file attribute, everything goes very smooth... but then the file is not searchable!!!
And finally another hint is: Inspecting the database I see a very suspicious table size for table "ezsearch_object_word_link": 4,571,489 records / 817.5 MB. I guess this is the table that stores the indexed material, but at that size would it slow down indexing of new material?
I'd appreciate any help, as my system is heavily based on PDFs.
Roberto
Modified by Roberto Kirschbaum on 17/11/2005 at 8:16:56 am