Configuration problem! I don't know how to download.

nnina084
Posts: 15
Joined: 25 Feb 2017, 04:52

Configuration problem! I don't know how to download.

Post by nnina084 » 07 Feb 2020, 19:28

Configuration problem! I don't know how to download. Please help.

I have a list of links.
I want to download each link as a single HTML page (just like in a browser CTRL+S).
I don't want to download the entire site!

E.g. 1 link = 1 HTML file

https://www.example.com/example/example&s=0
https://www.example.com/example/example&s=50
https://www.example.com/example/example&s=100
https://www.example.com/example/example&s=150
https://www.example.com/example/example&s=200
https://www.example.com/example/example&s=250



How to 1. import list all URLs, set, and start downloading? I am a regular user and need some tutorial.

Maxim
Site Admin
Posts: 1246
Joined: 02 Mar 2009, 17:02

Re: Configuration problem! I don't know how to download.

Post by Maxim » 08 Feb 2020, 17:05

Do you want to save the HTML pages or images from those pages? Ctrl + S in browser saves everything, including the HTML page, all related JavaScript and CSS files, all images and media files. What do you need to save exactly?

nnina084
Posts: 15
Joined: 25 Feb 2017, 04:52

Re: Configuration problem! I don't know how to download.

Post by nnina084 » 09 Feb 2020, 22:11

Adds links Create New Project -> Starting adress. one URL per line., downloads the entire site. Wrong!

Oh, I just want to download a single page.


Simple HTML without CSS, without java and img etc.


Example Firefox Browser CTRL+S select -> Only HTML File :)
So how to download it, how do I add/import a list of links?


Image

Current page only:
Your project finished but zero target files were saved. Please make sure you specified the correct target files and starting addresses for your project by clicking [ View / modify project settings ] button. Also it's a good idea to see the download log (click the [ View project download log ] button) to find out what addresses were crawled by Extreme Picture Finder and with what result.

Maxim
Site Admin
Posts: 1246
Joined: 02 Mar 2009, 17:02

Re: Configuration problem! I don't know how to download.

Post by Maxim » 10 Feb 2020, 18:16

OK, I see.

You can import addresses from the file or paste them into the "Starting address(es)" field when you click the [ New project... ] button. There is an [ Import... ] button under this field which allows you to select a text file to import URLs from.

Then click [ Next > ] button and leave the "Configure project settings manually" option selected. Click [ Next > ] button.

On the [ Target files ] step clear the field which contains any files extensions and enter Windows file mask of files you want to save. If you want pages you may enter this:

*.hml, *.htm, *.

The last one is for files without extension which seems to be your case. If you want me to give you exact file mask - give me exact URL. Then click [ Next > ] button.

On the "Website exploration type" step select [ Current page only ] option and clear [ Download files from external sites ] box.

That's it. So basically, all you had to do was to click [ Next > ] button in the [ Create New Project ] window several times to explore possible options. But even more options are available in the [ Project properties ] window.

nnina084
Posts: 15
Joined: 25 Feb 2017, 04:52

Re: Configuration problem! I don't know how to download.

Post by nnina084 » 10 Feb 2020, 19:13

For example, I would like to download this page and all subpages \&s=(\d+)

https://proza.ru/avtor/anastas2


Filename Pattern (URL name): avtor_anastas2

Other link similar:
https://www.proza.ru/avtor/lidiasin

Image

Maxim
Site Admin
Posts: 1246
Joined: 02 Mar 2009, 17:02

Re: Configuration problem! I don't know how to download.

Post by Maxim » 11 Feb 2020, 18:17

Here is a project:

proza.ru-avtor-anastas2

Use right-click, [ Save link as... ] menu item to download the project, then open it with Extreme Picture Finder and start it in the program.

nnina084
Posts: 15
Joined: 25 Feb 2017, 04:52

Re: Configuration problem! I don't know how to download.

Post by nnina084 » 13 Feb 2020, 16:09

Does not work properly for this link, does not correctly recognize all subpages (Multipage)

Start adress:
https://proza.ru/avtor/pravonv

All subpage 1350
https://proza.ru/avtor/pravonv&s=1350

Maxim
Site Admin
Posts: 1246
Joined: 02 Mar 2009, 17:02

Re: Configuration problem! I don't know how to download.

Post by Maxim » 14 Feb 2020, 11:16

Simply change the exploration limit to "Entire site" and it'll work.

nnina084
Posts: 15
Joined: 25 Feb 2017, 04:52

Re: Configuration problem! I don't know how to download.

Post by nnina084 » 14 Feb 2020, 15:43

How to update the project? I want to add new links in the same project + the same links to check which links are missing?
I tried "Update". Not working. Need to re-create another separate project?

Maxim
Site Admin
Posts: 1246
Joined: 02 Mar 2009, 17:02

Re: Configuration problem! I don't know how to download.

Post by Maxim » 14 Feb 2020, 15:53

Why would you use "Update"? "Update" is used when you want to check existing project for new links, without changing Starting URL list. If you changed Starting URLs the only option that makes sense is "Restart".

Post Reply