使用 python 从 Internet 下载编号文件夹中的编号图像文件

Question

问题

http://www.fdci.org/imagelibrary/EventCollection/1980/Big/IMG_2524.jpg

我有一个这种类型的 link，其中 1980 是正在更改的初始文件夹，其次是 IMG_2524[=32 格式的图像文件名=].jpg 正在改变。

我想做的是从这些 url 下载所有图像，方法是在文件夹的情况下在 1900-2000 的范围内迭代和更改这些数字，IMG_2000.jpg 到 IMG_4000.jpg在文件名的情况下。下载的文件必须保存在其所在的文件夹编号内。

我认为 for 循环应该是一个选项，但作为新手我有点迷茫。请帮忙谢谢。

更新

text_file = open('Output.txt', 'w') for i in xrange(1900,2001): for j in xrange(2000, 4001): year = str(i) image = str(j) new_link = 'http://www.fdci.org/imagelibrary/EventCollection/'+year+'/Big/IMG_'+image+'.jpg' text_file.write(new_link) text_file.close()

感谢 anmol

Answer 1

实际上你需要两个 for 循环，嵌套 for 循环所以现在我们有 2000 - 4000 给定年份范围 1900 - 2001

的所有图像

for i in xrange(1900,2001):
    for j in xrange(2000, 4001):
        year = str(i)
        image = str(j)
        new_link = 'http://www.fdci.org/imagelibrary/EventCollection/'+year+'/Big/IMG_'+image+'.jpg'
        print new_link
        #Now you will get the possible links within the given ranges,
        #then you can use urllib2 to fetch the response from the link 
        # and do whatever you wanna do

示例输出：

http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3994.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3995.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3996.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3997.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3998.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_3999.jpg
http://www.fdci.org/imagelibrary/EventCollection/2000/Big/IMG_4000.jpg

使用 python 从 Internet 下载编号文件夹中的编号图像文件

Download numbered image files in numbered folders from internet using python

python

loops

image

http