Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

I want to continue a mirrored project, but HTTrack is rescanning all pages. Whats going on?

0
Posted

I want to continue a mirrored project, but HTTrack is rescanning all pages. Whats going on?

0

HTTrack has to (quickly) rescan all pages from the cache, without retransfering them, to rebuild the internal file structure. However, this process can take some time with huge sites with numerous links. Q: HTTrack window sometimes “disappears” at then end of a mirrored project. What’s going on? A: This is a known bug in the interface. It does NOT affect the quality of the mirror, however. We are still hunting it down, but this is a smart bug.. Questions concerning a mirror: Q: I want to mirror a Web site, but there are some files outside the domain, too. How to retrieve them? A: If you just want to retrieve files that can be reached through links, just activate the ‘get file near links’ option. But if you want to retrieve html pages too, you can both use wildcards or explicit addresses ; e.g. add www.someweb.com/* to accept all files and pages from www.someweb.com. Q: I have forgotten some URLs of files during a long mirror.. Should I redo all? A: No, if you have kept the ‘cache’ file

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.