一、环境准备:
使用chrome需要安装chrome浏览器和chromedriver,并且需要把chromedriver放入到PATH环境变量中;
使用phantomjs需要安装phantomjs,并且需要把phantomjs加入到PATH环境变量中;
二、示例代码:
# coding = utf-8
from selenium import webdriver
import time
# 实例化一个浏览器
driver = webdriver.Chrome()
# driver = webdriver.PhantomJS()
# 设置窗口大小
# driver.set_window_size(1920,1080)
# 最大化窗口
driver.maximize_window()
driver.get("https://www.baidu.com")
# 页面定位方法
driver.find_element_by_id("kw").send_keys("python")
driver.find_element_by_id("su").click()
# 获取html字符串
# print(driver.page_source)
print(driver.current_url)
# driver 获取 cookie
cookies = driver.get_cookies()
print(cookies)
print("*" * 100)
cookies = {i["name"]:i["value"] for i in cookies}
print(cookies)
# 进行页面截屏
time.sleep(3)
# driver.save_screenshot("./baidu.png")
driver.quit()
三、获取元素常用方法:
获取一个 超找不到元素时 抛异常:
*
driver.find_element_by_id()
*
driver.find_element_by_name()
*
driver.find_element_by_xpath()
*
driver.find_element_by_link_text()
*
driver.find_element_by_partial_link_text()
*
driver.find_element_by_tag_name()
*
driver.find_element_by_class_name()
*
driver.find_element_by_css_selector()
获取多个 超找不到元素时 返回空列表:
*
driver.find_elements_by_name()
*
driver.find_elements_by_xpath()
*
driver.find_elements_by_link_text()
*
driver.find_elements_by_partial_link_text()
*
driver.find_elements_by_tag_name()
*
driver.find_elements_by_class_name()
*
driver.find_elements_by_css_selector()