UP主自己爬取评论

128 阅读1分钟

Up主个人登录bilibili,评论很好寻找,寻找classname中的ci-content就好了。 我的评论有24页,也就是循环的24个,只需要修后最末尾的数字就可以跳转

注意哦,登录页面是需要chrome跳出登陆页面由UP等timesleep的时候手动扫码登陆的哦,大家是登陆不上去的。

# import os
from selenium import webdriver
import time
from selenium.webdriver.common.by import By
# import datetime
# from dateutil.relativedelta import relativedelta
# import zipfile
            

options = webdriver.ChromeOptions()

fensi = webdriver.Chrome(options=options) 

fensi.get("https://member.bilibili.com/platform/comment/article?keyword=&order=ctime&filter=-1&is_hidden=0&type=1&comment_display=0&pn=1")
time.sleep(10)

suoyou = []
# fi = open('suoyoupinglun.txt','w')
for num in range(1,24):

    fensi.get("https://member.bilibili.com/platform/comment/article?keyword=&order=ctime&filter=-1&is_hidden=0&type=1&comment_display=0&pn="+str(num))
    
    time.sleep(10)
    # suoyou = fensi.text
    pinglun = fensi.find_elements(By.CLASS_NAME,"ci-content")
    
    for nn in pinglun:
        suoyou.append(nn.text)

for ii in suoyou:
    try:
        fm.write(str(ii)+'\n')
    except:
        continue
        
fm.close()