In Python, ModuleNotFoundError: No module named ‘bs4’ error occurs if we try to import the ‘beautifulsoup4‘ module without installing the package or if you have not installed it in the correct environment.
In this tutorial, let’s look at installing the beautifulsoup4
module correctly in different operating systems and solve ModuleNotFoundError: No module named ‘bs4’ error.
What is ModuleNotFoundError: No module named ‘bs4’?
There are various reasons why we get the ModuleNotFoundError: No module named ‘bs4’ error
- Trying to use the modulewithout installing the beautifulsoup4 package.
- If the IDE is set to the incorrect version of the Python/Python interpreter.
- You are using the virtual environment and the beautifulsoup4module is not installed inside a virtual environment
- Installing the beautifulsoup4 package in a different version of Python than the one which is used currently.
- Declaring a variable name as the module name(beautifulsoup4)
If you are getting an error installing pip, checkout pip: command not found to resolve the issue.
How to fix ModuleNotFoundError: No module named ‘bs4’?
beautifulsoup4
is not a built-in module (it doesn’t come with the default python installation) in Python; you need to install it explicitly using the pip installer and then use it.
Beautiful Soup is a library that makes it easy to scrape information from web pages. It sits atop an HTML or XML parser, providing Pythonic idioms for iterating, searching, and modifying the parse tree.
We can fix the error by installing the ‘beautifulsoup4
‘ module by running the pip install
command in your terminal/shell.beautifulsoup4
We can verify if the package is installed correctly by running the following command in the terminal/shell.
This will provide the details of the package installed, including the version number, license, and the path it is installed. If the module is not installed, you will get a warning message in the terminal stating WARNING: Package(s) not found: beautifulsoup4.
pip show beautifulsoup4
Output
Name: beautifulsoup4
Version: 4.11.1
Summary: Screen-scraping library
Home-page: https://www.crummy.com/software/BeautifulSoup/bs4/
Author: Leonard Richardson
Author-email: leonardr@segfault.org
License: MIT
Location: c:\personal\ijs\python_samples\venv\lib\site-packages
Requires: soupsieve
Solution 1 – Installing and using the beautifulsoup4 module in a proper way
Based on the Python version and the operating system you are running, run the relevant command to install the beautifulsoup4 module.
# If you are using Python 2 (Windows)
pip install beautifulsoup4
# if you are using Python 3 (Windows)
pip3 install beautifulsoup4
# If the pip is not set as environment varibale PATH
python -m pip install beautifulsoup4
# If you are using Python 2 (Linux)
sudo pip install beautifulsoup4
# if you are using Python 3 (Linux)
sudo pip3 install beautifulsoup4
# In case if you have to easy_install
sudo easy_install -U beautifulsoup4
# On Centos
yum install beautifulsoup4
# On Ubuntu
sudo apt-get install beautifulsoup4
# If you are installing it in Anaconda
conda install -c conda-forge beautifulsoup4
Once you have installed the beautifulsoup4 module, we can now import it inside our code and use it as shown below.
In the below example we will take a look at how to fetch all the links(href) in the Metrics ConverterHome Page using BeautifulSoup and urllib libraries
from bs4 import BeautifulSoup
import urllib.request
soup = BeautifulSoup(
"<html><body><h1>Welcome to Python Tutorial</h1></body></html>", "html.parser"
)
print(soup.prettify())
parser = "html.parser" # or 'lxml' (preferred) or 'html5lib', if installed
resp = urllib.request.urlopen("https://www.metricsconverter.com")
soup = BeautifulSoup(resp, parser, from_encoding=resp.info().get_param("charset"))
for link in soup.find_all("a", href=True):
print(link["href"])
Output
<html><body><h1>
Welcome to Python Tutorial
</h1></body></html>
/
/static/privacy-policy/
/static/cookie-policy/
/static/contact-us/
/length/
/volume/
/area/
/energy/
/force/
/speed/
/weight-and-mass/
/time/
Solution 2 – Verify if the IDE is set to use the correct Python version
If you are still getting the same error even after installing the package, you can verify if the IDE you are using is configured with the correct version of the Python interpreter.
For Eg:- In the case of Visual Studio Code, we can set the Python version by pressing CTRL + Shift + P
or (⌘
+ Shift
+ P
on Mac) to open the command palette.
Once the command palette opens, select the Python interpreter and select the correct version of Python and also the virtual environment(if configured) as shown below.
Python InterpreterSolution 3 – Installing beautifulsoup4 inside the virtual environment
Many different IDEs like Jupyter Notebook, Spyder, Anaconda, or PyCharm often install their own virtual environment of Python to keep things clean and separated from your global Python.
If you are using VS Code, then you can also create a virtual environment, as shown below.
In the case of virtual environments, you need to ensure that the beautifulsoup4 module needs to be installed inside the virtual environment and not globally.
Step 1: Create a Virtual Environment. If you have already created a virtual environment, then proceed to step 2.
Step 2: Activate the Virtual Environment
Step 3: Install the required module using the pip install
command
# Create a virtual Environment
py -3 -m venv venv
# Activate the virtual environment (windows command)
venv\Scripts\activate.bat
# Activate the virtual environment (windows powershell)
venv\Scripts\Activate.ps1
# Activate the virtual environment (Linux)
source venv/bin/activate
# Install beautifulsoup4 inside the virtual environment
pip install beautifulsoup4
Solution 4 – Ensure that a module name is not declared name a variable name.
Last but not least, you may need to cross-check and ensure that you haven’t declared a variable with the same name as the module name.
You should check if you haven’t named any files as bs4.py
as it may shadow the original beautifulsoup4 module.
If the issue is still not solved, you can try removing the package and installing it once again, restart the IDE, and check the paths to ensure that packages are installed in the correct environment path and Python version.
Conclusion
The ModuleNotFoundError: No module named ‘bs4’ error occurs when we try to import the ‘beautifulsoup4‘ module without installing the package or if you have not installed it in the correct environment.
We can resolve the issue by installing the beautifulsoup4 module by running the pip install beautifulsoup4
command. Also, ensure that the module is installed in the proper environment in case you use any virtual environments, and the Python version is appropriately set in the IDE that you are running the code.