PYTHON字符串

(这个操作现在基本不用)

(使用format方法,也不怎么用)

    '''使用format方法进行格式化

    >>> print(“The number {0:,} in hex is: {0:#x}, the number {1} in oct is{1:#o}”.format(5555,55)) #使用位置进行格式化

    The number 5,555 in hex is: 0x15b3, the number 55 in oct is 0o67

    >>> print("The number {1:,} in hex is: {1:#x}, the number {0} in oct is {0:o}".format(5555,55))

    The number 55 in hex is: 0x37, the number 5555 in oct is 12663

    >>> print(“my name is {name}, my age is {age}, and my QQ is {qq}”.format(name=“Dong Fuguo”,age=40,qq=“30646****”)) #使用参数名字进行格式化

    my name is Dong Fuguo, my age is 40, and my QQ is 30646****

    >>> position = (5, 8, 13)
    >>> print(“X:{0[0]};Y:{0[1]};Z:{0[2]}”.format(position)) #使用序列解包

    X:5;Y:8;Z:13

    >>> '{0:<8d},{0:^8d},{0:>8d}'.format(65) #设置对齐方式
    '65        ,    65    ,       65

    >>> '{0:+<8d},{0:-^8d},{0:=>8d}'.format(65)

    '65++++++,---65---,======65'

从Python 3.6.x开始支持一种新的字符串格式化方式，官方叫做Formatted String Literals，简称f-字符串，在字符串前加字母f，含义与字符串对象 format()方法类似。在进行格式化时，使用前面定义的同名变量的值对格式化字符串中的占位符进行替换。

    width = 8
    >>> height = 6
    >>> print(f'Rectangle of {width}*{height}\nArea:{width*height}')
    
    Rectangle of 8*6
    Area:48

find()、rfind()、index()、rindex()、count()

find()和rfind方法分别用来查找一个字符串在另一个字符串指定范围（默认是整个字符串）中首次和最后一次出现的位置，如果不存在则返回-1；

index()和rindex()方法用来返回一个字符串在另一个字符串指定范围中首次和最后一次出现的位置，如果不存在则抛出异常；

count()方法用来返回一个字符串在当前字符串中出现的次数

    >>> s="apple,peach,banana,peach,pear"
    >>> s.find("peach")
    6

    >>> s.find("peach",7)
    19

    >>> s.find("peach",7,20)
    -1

    >>> s.rfind('p')
    25

    >>> s.index('p')
    1

    >>> s.index('pe')
    6
    
    >>> s.index('pear')
    25

    >>> s.index('ppp')
    Traceback (most recent calllast):  
        File "<pyshell#11>", line 1,in <module>
                s.index('ppp')
    ValueError: substring not found

    >>> s.count('p')
    5

    >>> s.count('pp')
    1

    >>> s.count('ppp')
    0

split()、rsplit()、partition()、rpartition()

split()和rsplit()方法分别用来以指定字符为分隔符，把当前字符串从左往右或从右往左分隔成多个字符串，并返回包含分隔结果的列表；(对于split()和rsplit()方法，如果不指定分隔符，则字符串中的任何空白符号（空格、换行符、制表符等）都将被认为是分隔符，并删除切分结果中的空字符串。)然而，明确传递参数指定split()使用的分隔符时，情况是不一样的，会保留切分

得到的空字符串。

partition()和rpartition()用来以指定字符串为分隔符将原字符串分隔为3部分，即分隔符前的字符串、分隔符字符串、分隔符后的字符串，如果指定的分隔符不在原字符串中，则返回原字符串和两个空字符串

    >>> s = "apple,peach,banana,pear"
    >>> s.split(",")
    ["apple", "peach", "banana", "pear"]

    >>> s.partition(',')
    ('apple', ',', 'peach,banana,pear')

    >>> s.rpartition(',')
    ('apple,peach,banana', ',', 'pear')

    >>> s.rpartition('banana')
    ('apple,peach,', 'banana', ',pear')

    >>> s = "2017-10-31"
    >>> t = s.split("-")
    >>> print(t)

    ['2017', '10', '31']

    >>> print(list(map(int, t)))
    [2017, 10, 31]

对于split()和rsplit()方法，如果不指定分隔符，则字符串中的任何空白符

    >>> s = 'hello world \n\n My name is Dong '
    >>> s.split()

    ['hello', 'world', 'My', 'name', 'is', 'Dong']

    >>> s = '\n\nhello world \n\n\n My name is Dong '
    >>> s.split()

    ['hello', 'world', 'My', 'name', 'is', 'Dong']

    >>> s = '\n\nhello\t\t world \n\n\n My name\t is Dong '
    >>> s.split()

    ['hello', 'world', 'My', 'name', 'is', 'Dong']

然而，明确传递参数指定split()使用的分隔符时，情况是不一样的，会保留切分得到的空字符串。

    >>> 'a,,,bb,,ccc'.split(',') #每个逗号都被作为独立的分隔符
    ['a', '', '', 'bb', '', 'ccc']

    >>> 'a\t\t\tbb\t\tccc'.split('\t') #每个制表符都被作为独立的分隔符
    ['a', '', '', 'bb', '', 'ccc']

    >>> 'a\t\t\tbb\t\tccc'.split() #连续多个制表符被作为一个分隔符
    ['a', 'bb', 'ccc']

split()和rsplit()方法还允许指定最大分割次数。

    >>> s = '\n\nhello\t\t world \n\n\n My name is Dong '
    >>> s.split(None, 1) #不指定分隔符，使用空白字符作为分隔符
    ['hello', 'world \n\n\n My name is Dong ']

    >>> s.rsplit(None, 1)
    ['\n\nhello\t\t world \n\n\n My name is', 'Dong']

    >>> s.split(None, 2)
    ['hello', 'world', 'My name is Dong ']

    >>> s.rsplit(None, 2)
    ['\n\nhello\t\t world \n\n\n My name', 'is', 'Dong']

    >>> s.split(maxsplit=6)    
    ['hello', 'world', 'My', 'name', 'is', 'Dong']

    >>> s.split(maxsplit=100) #最大分隔次数大于可分隔次数时无效
    ['hello', 'world', 'My', 'name', 'is', 'Dong']

字符串连接join()

用来将可迭代对象中多个字符串进行连接，并在相邻两个字符之间插入指定字符串。 (不推荐使用+运算符连接字符串，优先使用join()方法。时间大约相差十倍)

    >>> 'apple,peach,banana,pear'
    
    >>> '.'.join(li)
    'apple.peach.banana.pear'

    >>> '::'.join(li)
    'apple::peach::banana::pear'
    
    
    if __name__ == '__main__':

测一下效率差

    #重复运行次数
    times = 1000
    jointimer = timeit.Timer('use_join()', 'from __main__ import use_join')
    print('time for join:', jointimer.timeit(number=times))
    plustimer = timeit.Timer('use_plus()', 'from __main__ import use_plus')
    print('time for plus:', plustimer.timeit(number=times))

lower()、upper()、capitalize()、title()、swapcase()

    >>> s = "What is Your Name?"
    >>> s.lower() #返回小写字符串
    'what is your name?'

    >>> s.upper() #返回大写字符串
    'WHAT IS YOUR NAME?'

    >>> s.capitalize() #字符串首字符大写
    'What is your name?'

    >>> s.title() #每个单词的首字母大写
    'What Is Your Name?'

    >>> s.swapcase() #大小写互换
    'wHAT IS yOUR nAME?'

查找替换replace()，该方法用来替换指定字符或字符串的所有重复出现，每次只能替换一个字符或字符串。

    >>> words = ('测试', '非法', '暴力', '话')
    >>> text = '这句话里含有非法内容'
    >>> for word in words:

    if word in text:
    text = text.replace(word, '***')

    >>> text

    '这句***里含有***内容'

字符串对象的maketrans()方法用来生成字符映射表，而translate()方法用来根据映射表中定义的对应关系转换字符串并替换其中的字符，使用这两个方法的组合可以同时处理多个字符。

      #创建映射表，将字符"abcdef123"一一对应地转换为"uvwxyz@#$"

    >>> table = ''.maketrans('abcdef123', 'uvwxyz@#$')
    >>> s = "Python is a great programming language. I like it!"
    >>> s.translate(table) #按映射表进行替换

    'Python is u gryut progrumming lunguugy. I liky it!'
    
    
    >>> table = ''.maketrans('0123456789', '零一二三四伍陆柒捌玖')
    >>> ‘2022年3月22日'.translate(table)
    '二零二二年三月二二日'

strip()、rstrip()、lstrip()

用来剔除字符串两端、右侧或左侧的空白字符或指定字符。

    >>> s = " abc "
    >>> s.strip() #删除空白字符
    'abc'

    >>> '\n\nhello world \n\n'.strip() #删除空白字符
    'hello world'

    >>> "aaaassddf".strip("a") #删除指定字符
    'ssddf'

    >>> "aaaassddf".strip("af")
    'ssdd'

    >>> "aaaassddfaaa".rstrip("a") #删除字符串右端指定字符
    'aaaassddf'

    >>> "aaaassddfaaa".lstrip("a") #删除字符串左端指定字符
    'ssddfaaa'

这三个方法的参数指定的字符串并不作为一个整体对待，而是在原字符串的两侧、右侧、左侧删除参数字符串中包含的所有字符，一层一层地从外往里扒。

    >>> 'aabbccddeeeffg'.strip('af') #字母f不在字符串两侧，所以不删除
    'bbccddeeeffg'

    >>> 'aabbccddeeeffg'.strip('gaf')
    'bbccddeee'

    >>> 'aabbccddeeeffg'.strip('gaef')
    'bbccdd'

    >>> 'aabbccddeeeffg'.strip('gbaef')
    'ccdd'

    >>> 'aabbccddeeeffg'.strip('gbaefcd')
    ''

内置函数eval()

尝试把任意字符转化为Python表达式并求值。

    >>> eval("3+4") #计算表达式的值

    7

    >>> a = 3

    >>> b = 5

    >>> eval('a+b') #要求变量a和b已存在

    8

    >>> import math

    >>> eval('math.sqrt(3)')

    1.7320508075688772

    >>> eval('aa') #当前上下文中不存在对象aa

    NameError: name 'aa' is not defined

    >>> eval('*'.join(map(str, range(1, 6)))) #5的阶乘

    120

$\color{Red}{eval()函数是非常危险的}$

    >>> a = input("Please input:")
    Please input:__import__('os').startfile(r'C:\Windows\notepad.exe')

    >>> eval(a)
    >>>

    >>> eval(“__import__(‘os’).system(‘md testtest’)”) #创建文件夹testtest
    0

    >>> eval(“__import__(‘os’).system(‘rd testtest’)”) #删除文件夹testtest
    0

成员判断，关键字in

    >>> "a" in "abcde" #测试一个字符中是否存在于另一个字符串中

    True

    >>> 'ab' in 'abcde'

    True

    >>> 'ac' in 'abcde' #关键字in左边的字符串作为一个整体对待

    False

    >>> "j" in "abcde"

    False