字體:  

如何在 linux下使用 wget 砍站(下載整個網站)

virage 發表於: 2013-7-17 15:11 來源: ADJ網路控股集團


使用wget砍站供離線瀏覽

範例:只抓 http://www.adj.com.tw 之下的資料

$ wget --recursive --no-clobber --page-requisites --html-extension --convert-links --restrict-file-names=windows http://www.adj.com.tw

常用參數說明:
    --recursive: download the entire Web site.
    --domains example.org: don’t follow links outside website.org.
    --no-parent: don’t follow links outside the directory manual/install/.
    --page-requisites: get all the elements that compose the page (images, CSS and so on).
    --html-extension: save files with the .html extension.
    --convert-links: convert links so that they work locally, off-line.
    --restrict-file-names=windows: modify filenames so that they will work in Windows as well.
    --no-clobber: don’t overwrite any existing files (used in case the download is interrupted and resumed).
    -e robots=off: ignore robots.txt