python

关注公众号 jb51net

关闭
首页 > 脚本专栏 > python > 豆瓣小站mp3下载

python使用urllib模块开发的多线程豆瓣小站mp3下载器

作者:

对豆瓣音乐小站页面html分析出所有mp3(正则匹配)文件url,然后用urllib.urlretrieve中方法直接将文件下载到本地,通过多线程技术下载

复制代码 代码如下:

#! /usr/bin/python2.7
# -- coding:utf-8 --

import os, urllib,urllib2, thread,threading
import re

#匹配音乐url
reg=re.compile('{"name":"(.+?)".+?"rawUrl":"(.+?)",.+?}', re.I)


class downloader(threading.Thread):
        def __init__(self, url, name):
                threading.Thread.__init__(self)
                self.url=url
                self.name=name

        def run(self):
                print 'downloading from %s' % self.url
                urllib.urlretrieve(self.url, self.name)

threads=[]

#多线程下载文件
def main(url):
        response=urllib.urlopen(url)
        text=response.read()
        groups=re.finditer(reg, text)
        for g in groups:
                name=g.group(1).strip() + ".mp3"
                path=g.group(2).replace('\\', '')
                t=downloader(path, name)
                threads.append(t)
                t.start()

                                                                                                                                                     
if __name__ == '__main__':
        main("http://site.douban.com/huazhou/")
        for t in threads:
                t.join()

您可能感兴趣的文章:
阅读全文