Common Functions When Using Selenium Chrome in Python
A guide that introduces the most common functions used when working with Selenium and Chrome in Python, enabling tasks like searching, interacting with web elements, and browser navigation.
Selenium is a powerful tool for automating browser tasks. When using Selenium with Chrome in Python, you can perform several key functions like opening a web page, clicking buttons, entering text, taking screenshots, and more. This article lists the most commonly used functions.
Common Functions with Selenium Chrome in Python:
-
Initializing WebDriver
from selenium import webdriver from selenium.webdriver.chrome.service import Service # Initialize Chrome WebDriver service = Service('/path/to/chromedriver') driver = webdriver.Chrome(service=service)
-
Opening a website
driver.get('https://www.example.com')
-
Finding elements on the webpage
element = driver.find_element('name', 'q') # Find by name
-
Inputting text in a search box
element.send_keys('Python Selenium')
-
Clicking on an element
button = driver.find_element('id', 'submit') button.click()
-
Waiting for a page or element to load
from selenium.webdriver.common.by import By from selenium.webdriver.support.ui import WebDriverWait from selenium.webdriver.support import expected_conditions as EC # Wait until the element is present element = WebDriverWait(driver, 10).until( EC.presence_of_element_located((By.ID, 'myElement')) )
-
Taking a screenshot of the page
driver.save_screenshot('screenshot.png')
-
Navigating the webpage
driver.back() # Go back to the previous page driver.forward() # Go forward to the next page driver.refresh() # Refresh the current page
-
Getting the page title
title = driver.title
-
Closing the browser
driver.quit() # Close all browser windows and terminate WebDriver
- Getting the current URL
current_url = driver.current_url
- Getting the page source
page_source = driver.page_source
- Handling new windows (tabs)
driver.switch_to.window(driver.window_handles[1])
- Scrolling to the bottom of the page
driver.execute_script("window.scrollTo(0, document.body.scrollHeight);")
Detailed explanation:
- Initializing WebDriver: Start the process of controlling Chrome through Selenium WebDriver.
- Opening a website: Navigate to the desired URL.
-
Finding elements on the webpage: Retrieve HTML elements using methods like
name
,id
,class name
. - Inputting text in a search box: Send strings into input fields.
- Clicking on an element: Simulate click actions on buttons or links.
- Waiting for an element to load: Explicitly wait until an element appears, useful for handling slow-loading pages.
- Taking a screenshot: Save a screenshot of the current webpage.
- Navigating: Move back, forward, or reload the page.
- Getting the page title: Retrieve the title of the current webpage.
- Closing the browser: Close the browser windows and end the WebDriver session.
- Getting the current URL: Retrieve the URL of the current page.
- Getting the page source: Access the entire HTML source of the page.
- Handling new windows: Switch between newly opened tabs or windows.
- Scrolling to the bottom: Scroll the page to the bottom to load any hidden elements.
System requirements:
- Python 3.x
- Selenium library (
pip install selenium
) - ChromeDriver that matches your Chrome version.
How to install the required libraries:
To install Selenium and ChromeDriver:
- Install Selenium via pip:
pip install selenium
- Download and install ChromeDriver from ChromeDriver Download.
Tips:
- Ensure that the ChromeDriver version matches the Chrome browser you are using.
- When working with slow-loading pages, use
WebDriverWait
to avoid errors due to elements not being loaded in time.