程式扎記

Source From Here
Question
I need to find "yesterday's" date in this format MMDDYY in Python. So for instance, today's date would be represented like this: 111009 I can easily do this for today but I have trouble doing it automatically for "yesterday".

How-To
Check sample code below:

>>> from datetime import date, timedelta
>>> yesterday = date.today() - timedelta(1)
>>> print yesterday.strftime('%m%d%y')
'110909'

>>> import time
>>> print("Time stamp={}".format(int(time.mktime(yesterday.timetuple()))))
Time stamp=1532966400

Supplement
* python时间,日期,时间戳处理

Source From Here
Question
Requests is a really nice library. I'd like to use it for download big files (>1GB). The problem is it's not possible to keep whole file in memory I need to read it in chunks. And this is a problem with the following code:

view plaincopy to clipboardprint?
import requests  
  
def DownloadFile(url)  
    local_filename = url.split('/')[-1]  
    r = requests.get(url)  
    f = open(local_filename, 'wb')  
    for chunk in r.iter_content(chunk_size=512 * 1024):   
        if chunk: # filter out keep-alive new chunks  
            f.write(chunk)  
    f.close()  
    return   

By some reason it doesn't work this way. It still loads response into memory before save it to a file.

How-To
It's much easier if you use Response.raw and shutil.copyfileobj():

view plaincopy to clipboardprint?
import requests  
import shutil  
  
def download_file(url):  
    local_filename = url.split('/')[-1]  
    r = requests.get(url, stream=True)  
    with open(local_filename, 'wb') as f:  
        shutil.copyfileobj(r.raw, f)  
  
    return local_filename  

This streams the file to disk without using excessive memory, and the code is simple. For large file, you need to write the content piece by piece to avoid "out of memory":

view plaincopy to clipboardprint?
def download_file(url):  
    local_filename = url.split('/')[-1]  
    # NOTE the stream=True parameter  
    r = requests.get(url, stream=True)  
    with open(local_filename, 'wb') as f:  
        for chunk in r.iter_content(chunk_size=1024):   
            if chunk: # filter out keep-alive new chunks  
                f.write(chunk)  
                #f.flush() commented by recommendation from J.F.Sebastian  
    return local_filename  

See http://docs.python-requests.org/en/latest/user/advanced/#body-content-workflow for further reference.

Source From Here
Question
I am trying to post a request to log in to a website using the Requests module in Python but its not really working.

How-To
If the information you want is on the page you are directed to immediately after login, below is the simple code:

view plaincopy to clipboardprint?
payload = {'inUserName': 'USERNAME/EMAIL', 'inUserPass': 'PASSWORD'}  
url = 'http://www.locationary.com/home/index2.jsp'  
requests.post(url, data=payload)  

Assuming your login attempt was successful, you can simply use the session instance to make further requests to the site. The cookie that identifies you will be used to authorize the requests:

view plaincopy to clipboardprint?
import requests  
  
# Fill in your details here to be posted to the login form.  
payload = {  
    'inUserName': 'username',  
    'inUserPass': 'password'  
}  
  
# Use 'with' to ensure the session context is closed after use.  
with requests.Session() as s:  
    p = s.post('LOGIN_URL', data=payload)  
    # print the html returned or something more intelligent to see if it's a successful login page.  
    print p.text  
  
    # An authorised request.  
    r = s.get('A protected web page url')  
    print r.text  
        # etc...  

Source From Here
Question
To filter a dataframe (df) by a single column, if we consider data with male and females we might:

view plaincopy to clipboardprint?
males = df[df[Gender]=='Male']  

Question 1 - But what if the data spanned multiple years and i wanted to only see males for 2014? In other languages I might do something like:

view plaincopy to clipboardprint?
if A = "Male" and if B = "2014" then   

(except I want to do this and get a subset of the original dataframe in a new dataframe object)

Question 2. How do I do this in a loop, and create a dataframe object for each unique sets of year and gender (i.e. a df for: 2013-Male, 2013-Female, 2014-Male, and 2014-Female

How-To
Using & operator, don't forget to wrap the sub-statements with ():

view plaincopy to clipboardprint?
males = df[(df[Gender]=='Male') & (df[Year]==2014)]  

To store your dataframes in a dict using a for loop:

view plaincopy to clipboardprint?
from collections import defaultdict  
dic={}  
for g in ['male', 'female']:  
  dic[g]=defaultdict(dict)  
  for y in [2013, 2014]:  
    dic[g][y]=df[(df[Gender]==g) & (df[Year]==y)] #store the DataFrames to a dict of dict  

Source From Here
Question
How can I output colored text to the terminal, in Python? What is the best Unicode symbol to represent a solid block?

How-To
One easier option would be to use the cprint function from termcolor package:

>>> from termcolor import cprint
>>> cprint('john', 'red')
john
>>> cprint('ken', 'green')
ken
>>> cprint('%s is %d years-old' % ('john', 37), 'blue')
john is 37 years-old

Or you can define a string that starts a color and a string that ends the color, then print your text with the start string at the front and the end string at the end:

view plaincopy to clipboardprint?
CRED = '\033[91m'  
CEND = '\033[0m'  
print(CRED + "Error, does not compute!" + CEND)  

This produces the following in bash, in urxvt with a Zenburn-style color scheme:

Through experemintation, we can get more colors:

This way we can create a full color collection:

view plaincopy to clipboardprint?
CEND      = '\33[0m'  
CBOLD     = '\33[1m'  
CITALIC   = '\33[3m'  
CURL      = '\33[4m'  
CBLINK    = '\33[5m'  
CBLINK2   = '\33[6m'  
CSELECTED = '\33[7m'  
  
CBLACK  = '\33[30m'  
CRED    = '\33[31m'  
CGREEN  = '\33[32m'  
CYELLOW = '\33[33m'  
CBLUE   = '\33[34m'  
CVIOLET = '\33[35m'  
CBEIGE  = '\33[36m'  
CWHITE  = '\33[37m'  
  
CBLACKBG  = '\33[40m'  
CREDBG    = '\33[41m'  
CGREENBG  = '\33[42m'  
CYELLOWBG = '\33[43m'  
CBLUEBG   = '\33[44m'  
CVIOLETBG = '\33[45m'  
CBEIGEBG  = '\33[46m'  
CWHITEBG  = '\33[47m'  
  
CGREY    = '\33[90m'  
CRED2    = '\33[91m'  
CGREEN2  = '\33[92m'  
CYELLOW2 = '\33[93m'  
CBLUE2   = '\33[94m'  
CVIOLET2 = '\33[95m'  
CBEIGE2  = '\33[96m'  
CWHITE2  = '\33[97m'  
  
CGREYBG    = '\33[100m'  
CREDBG2    = '\33[101m'  
CGREENBG2  = '\33[102m'  
CYELLOWBG2 = '\33[103m'  
CBLUEBG2   = '\33[104m'  
CVIOLETBG2 = '\33[105m'  
CBEIGEBG2  = '\33[106m'  
CWHITEBG2  = '\33[107m'  

Here is the code to generate the test:

view plaincopy to clipboardprint?
x = 0  
for i in range(24):  
  colors = ""  
  for j in range(5):  
    code = str(x+j)  
    colors = colors + "\33[" + code + "m\\33[" + code + "m\033[0m "  
  print(colors)  
  x=x+5  

Supplement
* URxvt 和 Bash 的自定义配色

程式扎記

標籤

2018年7月31日星期二

[ Python 常見問題 ] Formatting “yesterday's” date in python

2018年7月27日星期五

[ Python 常見問題 ] requests - How to download large file in python?

[ Python 常見問題 ] requests - How to “log in” to a website?

2018年7月25日星期三

[ Python 常見問題 ] Pandas - how do you filter pandas dataframes by multiple columns

2018年7月24日星期二

[ Python 常見問題 ] Print in terminal with colors?

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

標籤

2018年7月31日 星期二

2018年7月27日 星期五

2018年7月25日 星期三

2018年7月24日 星期二

學習筆記

2018年7月31日星期二

2018年7月27日星期五

2018年7月25日星期三

2018年7月24日星期二