Use the full url of your site for the “Starting URL” option. The crawler will explore only the URLs within the starting directory, i.e. when starting URL is “http://www.example.com/path/index.html”, the “http://www.example.com/path/sub/page.html” will be indexed, but “http://www.example.com/other/index.html” will NOT.
“Save sitemap to” – is the filename in the “public_html/” folder of your website. This file should be writable by the script. To make sure it is, create this file and set its permissions to 0666.
It is recommended to use “Server’s repsonse” for “Last modification” field. In this case the entries for static pages will be filled with their real last modification time, while for dynamic pages the current time is used.
“Do not parse” input field contains file types, separated by space. These files will be added to the sitemap, but not fetched to save bandwidth, because they are not html files and have no embedded links. Please make sure these files are indexed by Google since there is no sense in adding them to sitemap otherwise!
“Do not parse URLs” works together with the option above to increase the speed of sitemap generation. If you are sure that some pages at your site do not contain the unique links to other pages, you can tell generator not to fetch them.
For instance, if your site has “view article” pages with urls like “viewarticle.php?..”, you may want to add them here, because most likely all links inside these pages are already listed at “higher level” (like the list of articles) documents as well:





Reply With Quote

Bookmarks