Hi, EPF is awesome. It was useful with 2 catalog sites that my wife and I look to search as random slideshow.
Anyway, this site has a problem. We want the HiRez images, of course, rather than the thumbnail or low res.
If we put http://www.yandy.com as the starting address..
..then it finds the following page for example:
http://www.yandy.com/Plus-Size-Flirty-F ... ostume.php
but it never attempts to download the HiRez picture at:
http://images.yandy.com/HiRez/DG_8691X_ ... _G2013.jpg
However, if we create a new project and put the starting address to
directly at this sub page and download only from this page:
http://www.yandy.com/Flirty-Finish-Line ... ostume.php
Then ETF download the HiRez picture:
http://images.yandy.com/HiRez/DG_8691X_ ... _G2013.jpg
So why does it it never download the picture from the Entire Site setting
using the main site URL at http://www.yandy.com but does download
it it when setup for the specific page?
Problem downloading HiRez images and dynamic links..
-
- Posts: 2
- Joined: 12 Aug 2013, 18:48
-
- Site Admin
- Posts: 2432
- Joined: 02 Mar 2009, 17:02
Re: Problem downloading HiRez images and dynamic links..
Thank you very much!Hi, EPF is awesome.
As for the website - everything works just fine. Even video is downloaded without any problems. Probable you turned the "Download files from external sites" box off in your project. Anyway, here are the settings:
Starting address: http://www.yandy.com
Target files: *.jp*, *.mp4 (remove *.mp4 if you don't want video)
Exploration: Regular site -> Entire site. [Download files from external sites] box checked.
Excluded URLs:
[Tthumb]
/css/
That's it. So please make sure you have the latest version of the program:
http://www.exisoftware.com/picture_find ... rSetup.exe
and then try the above settings. If it won't work for you - let me know and I'll create a ready-to-use EPF project file for you.
-
- Posts: 2
- Joined: 12 Aug 2013, 18:48
Re: Problem downloading HiRez images and dynamic links..
Thanks. Actually after posting this...I later discovered that it was mostly working. And my settings
now match yours.
A related question. On the same site and same example page there is only 1 HiRez image visible
but several more thumbnails for additional HiRez image views of the model in the garment.
EPF downloads the one visible but none of the others from the thumbnails. As a web developer myself,
it appears this is because the thumbnails don't have ordinary links to the HiRez but instead activate
a javascript that changes the visible images on the same page.
Is there any way to make EPF find images referenced in this way?
For now, I discovered that the thumbnail and HiRez images on that site have the same file name
but located in different directories "Thumbs" vs. "HiRez". So I switched EPF to only download the
thumbs and when it finishes, I'll delete any thumbs that we don't want and then make a script to
create a URL list of all the thumbs. Then modify the URLs to look in the HiRez directory and
feed the entire list into a new EPF project to download the HiRez directly.
So it's a lot of work..just wondering if there's an easier way to grab all the HiRez without so much
effort in the future.
now match yours.
A related question. On the same site and same example page there is only 1 HiRez image visible
but several more thumbnails for additional HiRez image views of the model in the garment.
EPF downloads the one visible but none of the others from the thumbnails. As a web developer myself,
it appears this is because the thumbnails don't have ordinary links to the HiRez but instead activate
a javascript that changes the visible images on the same page.
Is there any way to make EPF find images referenced in this way?
For now, I discovered that the thumbnail and HiRez images on that site have the same file name
but located in different directories "Thumbs" vs. "HiRez". So I switched EPF to only download the
thumbs and when it finishes, I'll delete any thumbs that we don't want and then make a script to
create a URL list of all the thumbs. Then modify the URLs to look in the HiRez directory and
feed the entire list into a new EPF project to download the HiRez directly.
So it's a lot of work..just wondering if there's an easier way to grab all the HiRez without so much
effort in the future.
-
- Site Admin
- Posts: 2432
- Joined: 02 Mar 2009, 17:02
Re: Problem downloading HiRez images and dynamic links..
Currently there is no other way around this,sorry. But we do plan to add Regular Expression parser which will allow to automate all the steps you just described by changing
http://www.something.com/Thumbs/image_file_name.jpg
to
http://www.something.com/HiRez/image_file_name.jpg
during the page parsing.
http://www.something.com/Thumbs/image_file_name.jpg
to
http://www.something.com/HiRez/image_file_name.jpg
during the page parsing.
-
- Posts: 145
- Joined: 05 Dec 2013, 10:14
Re: Problem downloading HiRez images and dynamic links..
What is the status of this issue?Maxim wrote:Currently there is no other way around this,sorry. But we do plan to add Regular Expression parser which will allow to automate all the steps you just described by changing
http://www.something.com/Thumbs/image_file_name.jpg
to
http://www.something.com/HiRez/image_file_name.jpg
during the page parsing.
I hope elf can solve it
-
- Site Admin
- Posts: 2432
- Joined: 02 Mar 2009, 17:02
Re: Problem downloading HiRez images and dynamic links..
Status: in progress... Requires a lot of time.
-
- Posts: 145
- Joined: 05 Dec 2013, 10:14
Re: Problem downloading HiRez images and dynamic links..
Thanks.
I hope it will be global solution, where we can change the thumb name to the image folder name whatever the name is
I hope it will be global solution, where we can change the thumb name to the image folder name whatever the name is
-
- Site Admin
- Posts: 2432
- Joined: 02 Mar 2009, 17:02
Re: Problem downloading HiRez images and dynamic links..
OK, the feature is ready. Are you still interested?
-
- Posts: 6
- Joined: 10 Dec 2013, 10:50
Re: Problem downloading HiRez images and dynamic links..
I am,
As it seems the parser does not like to be used when there is a target file already available in the string to be used in a regX.
My little dirty test can hash it out for you as well. Below is a topical nested image source used to display a wonderful picture in thumbnail size. The actual picture is located in /resized/JBGj2xh9k3yfr.jpg. Since this is a valid picture the software seems to ignore any other directives after. I have attempted to "exclude" thumbs, but that stopped it from parsing it as well.
This is the most basic find and re-format url string for grabbing the picture, this is what I assumed the parser was for. I still continue to use this very useful product and thank you for all the work.
<div class="toprandthumb" style="background-image: url(thumbs/JBGj2xh9k3yfr.jpg); background-position: center; background-repeat: no-repeat;"><a href="index.php?id=105684"><img src="images/spacer.gif" alt="" border="0" height="75" width="75"></a></div>
As it seems the parser does not like to be used when there is a target file already available in the string to be used in a regX.
My little dirty test can hash it out for you as well. Below is a topical nested image source used to display a wonderful picture in thumbnail size. The actual picture is located in /resized/JBGj2xh9k3yfr.jpg. Since this is a valid picture the software seems to ignore any other directives after. I have attempted to "exclude" thumbs, but that stopped it from parsing it as well.
This is the most basic find and re-format url string for grabbing the picture, this is what I assumed the parser was for. I still continue to use this very useful product and thank you for all the work.
<div class="toprandthumb" style="background-image: url(thumbs/JBGj2xh9k3yfr.jpg); background-position: center; background-repeat: no-repeat;"><a href="index.php?id=105684"><img src="images/spacer.gif" alt="" border="0" height="75" width="75"></a></div>
-
- Site Admin
- Posts: 2432
- Joined: 02 Mar 2009, 17:02
Re: Problem downloading HiRez images and dynamic links..
Not a problem. First of all remove 'thumb' from Excluded URLs. Then create a custom parser with the following Reg Ex:
url\(thumbs/([^)]+)
and the following address line:
/resized/[#1]
url\(thumbs/([^)]+)
and the following address line:
/resized/[#1]