Jump to content

gcahlik

Members
  • Posts

    7
  • Joined

  • Last visited

Everything posted by gcahlik

  1. Hi Jon, I'm starting to think this is some sort of IO issue, as you had asserted earlier -- but it's still strange none the less. I tried a few experiments -- these were conducted on our Mac Pro. First I selected 10,000 items to be exported, all of the same file format (png images). They were going to be exported into one folder, no directories. Here are the time stamps per 1,000 items processed. 1,000: 0:06 (6 seconds) 2,000: 0:17 (11 seconds) 3,000: 0:34 (17 seconds) 4,000: 1:01 (27 seconds) 5,000: 1:34 (32 seconds) 6,000: 2:18 (46 seconds) 7,000: 3:12 (54 seconds) 8,000: 4:05 (53 seconds) 9,000: 4:57 (42 seconds) 10,000: 5:50 (53 seconds) So the next experiment I had was to start exporting them in 5,000 item chunks. I had them originally exporting into individual directories. So chunk 1 went into Directory 1 and chunk 2 into Directory 2 and so forth. All exports were under 2 minutes. I then had them export into the same directory, here were the results, total time for each export: 1st 5k chunk: 1:47 2nd 5k chunk: 5:47 3rd 5k chunk: 8:24 4th 5k chunk: 12:10 So, for whatever reason, when I export them into the same directory, each progressive one takes longer and longer. Obviously this is arelatively small sample size, but it's still hard to wrap my head around it.
  2. Hi Jon, Yes, I am using version 2.0.1.1. I've been working at doing this all day today, exporting them in portions; It took about 4 hours to export exactly 50,000 JPEGs. The subsequent exports were cut down to 25,000 JPEGS and it has taken about 1 hr 15 minutes. I do not have any other programs running in the background and for all exports, it starts off relatively fast, processing around 1500 items a minute, but then it gets progressively slower and slower. George
  3. I killed off the WMF export; there were still about 12,000 items remaining after 10 hours of exporting and at this point, it was only exporting one file every 10-15 seconds. I deleted the exported files and went back and started over -- this time, I further broke it down and exported only 10,000 of the 70,000 wmf files and it completed the export in 10 minutes. It makes me feel like this is some sort of memory leak.
  4. Hi Jon, I've run it on an Mac Pro, Xeon E5-2697 w/ 64 GB RAM and I've run it concurrently with a i7 2860 with 16 GB of ram. All of the sources are on stored on the internal SATA drive, no external drives. I'm having problems with both platforms but at the moment I have it on my i7; it's having problems with just one source -- it's about 80 GB with around 1.6M items. The cases were created by another office, I have just copied them to the internal drives of my review computers. What I ended up trying is exporting results by file type facet -- exporting them separately (ie. all the communications in a batch, all of the documents) -- the hold up are the images. I had no problem processing the ~400,000 documents and communications -- it completed this task in under 6 hours, but with the images it is choking. I further began breaking it down by image type. I exported ~70,000 gifs with in about 4 hours, but I went to do 60,000 wmf files and it has been cranking for 10 hours now and it's just 75% done and it states that it has 4 hours left, but I know that's in accurate because that's what it said 3 hours ago. I haven't even touched the JPGs and PNGs, which account for about 500,000 items total. I have the Mac Pro at another location exporting the dataset as a whole and last check (Tuesday 8/1), after 10 days of countinuous operation, it's still less than 67% complete -- the progress meter says 7 more days, but like I said, that's inaccurate. George
  5. Thanks Jon -- Do you have any suggestions to speed up the process? I have several cases, ~80 GB that I would need to have exported and it's taking an impossibly long time. Initial estimates were at 3 hours, now after 2 days of straight running, it appears to be less than 25% completed with 8 days remaining. I thought it might be a problem with images and media; having to deal with compression and encoding, so I've tried to limit it to just documents and communications, but even that seems to be taking an incredible amount of time -- the original estimate was <1 hour for 225,000 items, now, after running for 3 hours, it is at about 55% completed with 3 more hours estimated in the export. It did not take nearly this long to index these, why is it taking so long to export? George
  6. Hi All, Hoping to get some advice -- I have a case with one source (~80GB, >150,000 items) loaded into Intella and I have tagged a large number of items to be reviewed by another person at a different site. Is there a way to export this material (ONLY the tagged material) as a separate Intella case? I've tried exporting the files in their original format but it is taking a significant amount of time and I would like to avoid having to re-index all the material anyway. The material that is not tagged is privileged information that must be redacted, so it is important that it is not included with the source material being reviewed at the other site. What is the best way to handle this?
  7. I've tried to install the mbox splitter from the zip file -- I have unzipped the files and editted the batch file, however when I run the batch file, I get a long error string: "Exception in thread "main" java.lang.UnsupportClassVersionError: GmailMboxSplitter : Unsupported major.mino version 51.0" I have the latest version of Java currently installed on this machine (Version 8, Update 66) -- has been restarted several times. Please advise.
×
×
  • Create New...