Problem downloading HiRez images and dynamic links..

mwaynewalter
Posts: 2
Joined: 12 Aug 2013, 18:48

Problem downloading HiRez images and dynamic links..

Post by mwaynewalter » 12 Aug 2013, 19:32

Hi, EPF is awesome. It was useful with 2 catalog sites that my wife and I look to search as random slideshow.

Anyway, this site has a problem. We want the HiRez images, of course, rather than the thumbnail or low res.
If we put http://www.yandy.com as the starting address..

..then it finds the following page for example:
http://www.yandy.com/Plus-Size-Flirty-F ... ostume.php

but it never attempts to download the HiRez picture at:
http://images.yandy.com/HiRez/DG_8691X_ ... _G2013.jpg

However, if we create a new project and put the starting address to
directly at this sub page and download only from this page:
http://www.yandy.com/Flirty-Finish-Line ... ostume.php

Then ETF download the HiRez picture:
http://images.yandy.com/HiRez/DG_8691X_ ... _G2013.jpg

So why does it it never download the picture from the Entire Site setting
using the main site URL at http://www.yandy.com but does download
it it when setup for the specific page?

Maksym
Site Admin
Posts: 2083
Joined: 02 Mar 2009, 17:02

Re: Problem downloading HiRez images and dynamic links..

Post by Maksym » 13 Aug 2013, 11:17

Hi, EPF is awesome.
Thank you very much!

As for the website - everything works just fine. Even video is downloaded without any problems. Probable you turned the "Download files from external sites" box off in your project. Anyway, here are the settings:

Starting address: http://www.yandy.com
Target files: *.jp*, *.mp4 (remove *.mp4 if you don't want video)
Exploration: Regular site -> Entire site. [Download files from external sites] box checked.
Excluded URLs:
[Tthumb]
/css/

That's it. So please make sure you have the latest version of the program:

http://www.exisoftware.com/picture_find ... rSetup.exe

and then try the above settings. If it won't work for you - let me know and I'll create a ready-to-use EPF project file for you.

mwaynewalter
Posts: 2
Joined: 12 Aug 2013, 18:48

Re: Problem downloading HiRez images and dynamic links..

Post by mwaynewalter » 13 Aug 2013, 16:33

Thanks. Actually after posting this...I later discovered that it was mostly working. And my settings
now match yours.

A related question. On the same site and same example page there is only 1 HiRez image visible
but several more thumbnails for additional HiRez image views of the model in the garment.

EPF downloads the one visible but none of the others from the thumbnails. As a web developer myself,
it appears this is because the thumbnails don't have ordinary links to the HiRez but instead activate
a javascript that changes the visible images on the same page.

Is there any way to make EPF find images referenced in this way?

For now, I discovered that the thumbnail and HiRez images on that site have the same file name
but located in different directories "Thumbs" vs. "HiRez". So I switched EPF to only download the
thumbs and when it finishes, I'll delete any thumbs that we don't want and then make a script to
create a URL list of all the thumbs. Then modify the URLs to look in the HiRez directory and
feed the entire list into a new EPF project to download the HiRez directly.

So it's a lot of work..just wondering if there's an easier way to grab all the HiRez without so much
effort in the future.

Maksym
Site Admin
Posts: 2083
Joined: 02 Mar 2009, 17:02

Re: Problem downloading HiRez images and dynamic links..

Post by Maksym » 13 Aug 2013, 17:34

Currently there is no other way around this,sorry. But we do plan to add Regular Expression parser which will allow to automate all the steps you just described by changing

http://www.something.com/Thumbs/image_file_name.jpg

to

http://www.something.com/HiRez/image_file_name.jpg

during the page parsing.

utomo
Posts: 145
Joined: 05 Dec 2013, 10:14

Re: Problem downloading HiRez images and dynamic links..

Post by utomo » 05 Dec 2013, 15:54

Maxim wrote:Currently there is no other way around this,sorry. But we do plan to add Regular Expression parser which will allow to automate all the steps you just described by changing

http://www.something.com/Thumbs/image_file_name.jpg

to

http://www.something.com/HiRez/image_file_name.jpg

during the page parsing.
What is the status of this issue?
I hope elf can solve it

Maksym
Site Admin
Posts: 2083
Joined: 02 Mar 2009, 17:02

Re: Problem downloading HiRez images and dynamic links..

Post by Maksym » 05 Dec 2013, 16:03

Status: in progress... Requires a lot of time.

utomo
Posts: 145
Joined: 05 Dec 2013, 10:14

Re: Problem downloading HiRez images and dynamic links..

Post by utomo » 09 Dec 2013, 02:27

Thanks.
I hope it will be global solution, where we can change the thumb name to the image folder name whatever the name is

Maksym
Site Admin
Posts: 2083
Joined: 02 Mar 2009, 17:02

Re: Problem downloading HiRez images and dynamic links..

Post by Maksym » 12 Feb 2014, 15:13

OK, the feature is ready. Are you still interested?

nufon
Posts: 6
Joined: 10 Dec 2013, 10:50

Re: Problem downloading HiRez images and dynamic links..

Post by nufon » 19 Mar 2014, 09:40

I am,

As it seems the parser does not like to be used when there is a target file already available in the string to be used in a regX.

My little dirty test can hash it out for you as well. Below is a topical nested image source used to display a wonderful picture in thumbnail size. The actual picture is located in /resized/JBGj2xh9k3yfr.jpg. Since this is a valid picture the software seems to ignore any other directives after. I have attempted to "exclude" thumbs, but that stopped it from parsing it as well.

This is the most basic find and re-format url string for grabbing the picture, this is what I assumed the parser was for. I still continue to use this very useful product and thank you for all the work.

<div class="toprandthumb" style="background-image: url(thumbs/JBGj2xh9k3yfr.jpg); background-position: center; background-repeat: no-repeat;"><a href="index.php?id=105684"><img src="images/spacer.gif" alt="" border="0" height="75" width="75"></a></div>

Maksym
Site Admin
Posts: 2083
Joined: 02 Mar 2009, 17:02

Re: Problem downloading HiRez images and dynamic links..

Post by Maksym » 19 Mar 2014, 14:04

Not a problem. First of all remove 'thumb' from Excluded URLs. Then create a custom parser with the following Reg Ex:

url\(thumbs/([^)]+)

and the following address line:

/resized/[#1]

Post Reply