Logging in to LinkedIn
Here we will write code for login into Linkedin, First, we need to initiate the web driver using selenium and send a get request to the URL and Identify the HTML document and find the input tags and button tags that accept username/email, password, and sign-in button.
Code:
Python3
from selenium import webdriver from selenium.webdriver.common.by import By from bs4 import BeautifulSoup import time # Creating a webdriver instance driver = webdriver.Chrome( "Enter-Location-Of-Your-Web-Driver" ) # This instance will be used to log into LinkedIn # Opening linkedIn's login page driver.get( "https://linkedin.com/uas/login" ) # waiting for the page to load time.sleep( 5 ) # entering username username = driver.find_element(By. ID , "username" ) # In case of an error, try changing the element # tag used here. # Enter Your Email Address username.send_keys( "User_email" ) # entering password pword = driver.find_element(By. ID , "password" ) # In case of an error, try changing the element # tag used here. # Enter Your Password pword.send_keys( "User_pass" ) # Clicking on the log in button # Format (syntax) of writing XPath --> # //tagname[@attribute='value'] driver.find_element(By.XPATH, "//button[@type='submit']" ).click() # In case of an error, try changing the # XPath used here. |
After executing the above command, you will be logged into your LinkedIn profile. Here is what it would look like.
Scrape LinkedIn Using Selenium And Beautiful Soup in Python
In this article, we are going to scrape LinkedIn using Selenium and Beautiful Soup libraries in Python.
First of all, we need to install some libraries. Execute the following commands in the terminal.
pip install selenium pip install beautifulsoup4
In order to use selenium, we also need a web driver. You can download the web driver of either Internet Explorer, Firefox, or Chrome. In this article, we will be using the Chrome web driver.
Note: While following along with this article, if you get an error, there are most likely 2 possible reasons for that.
- The webpage took too long to load (probably because of a slow internet connection). In this case, use time.sleep() function to provide extra time for the webpage to load. Specify the number of seconds to sleep as per your need.
- The HTML of the webpage has changed from the one when this article was written. If so, you will have to manually select the required webpage elements, instead of copying the element names written below. How to find the element names is explained below. Additionally, don’t decrease the window height and width from the default height and width. It also changes the HTML of the webpage.