Python的文件操作

期盼死亡的小丑 发表于 2018-8-6 07:58:32

1、读写文件
　　

#!/usr/bin/env python　　
# -*- coding:utf-8 -*-
　　
# @Time : 2018/1/25 20:49
　　
# @Author: zhouyuyao
　　
# @File : demonWrite.py
　　
# PyCharm 2017.3.2 (Community Edition)
　　
# Build #PC-173.4127.16, built on December 19, 2017
　　
# JRE: 1.8.0_152-release-1024-b8 amd64
　　
# JVM: OpenJDK 64-Bit Server VM by JetBrains s.r.o
　　
# Windows 10 10.0
　　
# Python 3.6.1 (v3.6.1:69c0db5, Mar 21 2017, 18:41:36)
　　
# on win32
　　

　　
if __name__== "__main__":
　　filename = input("Please input the name of file:")
　　f = open(filename,"w") # 以写的形式打开一个文件
　　while 1:       # 1 的效率是最高的
　　context = input("Please input context('EOF' will close file): ")
　　if context == "EOF":
　　f.close()
　　break
　　else:
　　f.write(context)
　　f.write("\n")
　　

　　fRead = open(filename)
　　readContext = fRead.read()
　　print("------------start-------------")
　　print(readContext)
　　print("-------------end--------------")
　　fRead.close()
　　

　　

　　运行结果：
　　

Please input the name of file:z.log　　
Please input context('EOF' will close file): hello
　　
Please input context('EOF' will close file): the weather is cool
　　
Please input context('EOF' will close file): you have wear more clothes
　　
Please input context('EOF' will close file): EOF
　　
------------start-------------
　　
hello
　　
the weather is cool
　　
you have wear more clothes
　　

　　
-------------end--------------
　　

2、读取文件方法
　　

import codecs　　

　　
ENCODING = "utf-8"    # 字符集
　　
f = open("z.log",encoding=ENCODING)
　　
print(f.name)          # 文件名
　　
print(f.readline())    # 读取成列表的形式
　　
print(f.readlines()) # 读取成列表的形式
　　

　　
with codecs.open("z.log","r",encoding=ENCODING) as f:
　　print(f.read())
　　

　　

3、编码问题
　　编码：
　　支持中文的编码：utf-8，gbk，gb2312
　　decode解码
　　encode编码
　　在Python2中不定义代码的编码排头，在内容中出现中文时会报错。
　　Python默认将代码文件内容当做ASCII编码处理，但是ASCII编码不存在中文，因为则会抛出异常。
　　解决问题之道就是要让Python之道文件中使用的是什么编码形式，对于中文，可以用的常见编码有utf-8，gbk和gb2312等，只需在代码文件的最前端添加如下内容即可：
　　# -*- coding:utf-8 -*-
　　Python转码的过程：
　　原有编码 ——> Unicode编码 ——> 目的编码
　　python会自动将带中文的字符串解码成Unicode，然后再编码成gbk，因为解码是字典进行的，如果没有指明解码方式，就会使用sys,defaultencoding指明的方式来解码。
　　方法一：
　　s.decode("utf-8").encoding("gbk")

4、对文件进行排序
　　

#!/usr/bin/env python　　
# -*- coding:utf-8 -*-
　　
# @Time : 2018/1/25 23:06
　　
# @Author: zhouyuyao
　　
# @File : sortUIDPasswd.py
　　
# PyCharm 2017.3.2 (Community Edition)
　　
# Build #PC-173.4127.16, built on December 19, 2017
　　
# JRE: 1.8.0_152-release-1024-b8 amd64
　　
# JVM: OpenJDK 64-Bit Server VM by JetBrains s.r.o
　　
# Windows 10 10.0
　　
# Python 3.6.1 (v3.6.1:69c0db5, Mar 21 2017, 18:41:36)
　　
# on win32
　　
import codecs
　　

　　
file = "passwd"
　　
sortfile = "sortpasswd.txt"
　　
filecontext = []
　　
sortuid = []
　　

　　
with codecs.open(sortfile,"wb") as fsort:
　　with codecs.open(file,encoding="utf-8") as f:
　　filecontext += f.readlines()
　　for line in filecontext:
　　sortuid.append(int(line.split(":")))
　　sortuid.sort()
　　for uid in sortuid:
　　for line in filecontext:
　　if str(uid) == line.split(":"):
　　print(line)
　　fsort.write(line.encode("utf-8"))
　　

　　python3的新特性对文本和二进制数据作了更为清晰的区分，
　　文本总是Unicode，由str类型表示，
　　二进制则是由bytes类型表示
　　字符串可以encode编码成字节包，而字节包可以decode解码成字符串

页: [1]

运维网's Archiver

Python的文件操作