Exploratory Data Analysis On Iris Dataset

Hi Everyone, We are Back with a New Project. In this Project, We Use Iris Dataset for Exploratory Data Analysis.
Necessary Library for this project,
- Pandas
- Numpy
- Seaborn
- Matplotlib
These Libraries are Enough for EDA. Now Let's Start
import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
#Load Datasets
iris = pd.read_csv('iris.csv')
#check Shape of this project
print(iris.shape)
#Check Columns in this project
print(iris.columns)
#Check Species of Iris Datasets
print("iris['species'].value_counts()")
#Check Scatter Plot Between Sepal_length Vs Sepal_width
iris.plot(kind='scatter', x='sepal_length', y='sepal_width')
plt.show()
Create a Whitegrid Scatter Plot Between sepal_leangth Vs Sepal_width , Hue= species
sns.set_style('whitegrid')
sns.FacetGrid(iris, hue='species',size=4) \
.map(plt.scatter, 'sepal_length', 'sepal_width') \
.add_legend()
plt.show()
Now We Show a Pairplot
plt.close()
sns.set_style('whitegrid')
sns.pairplot(iris,hue='species',size=3)
plt.show()
![]()
sns.FacetGrid(iris, hue='species', size=5) \ .map(sns.distplot, 'sepal_width') \ .add_legend() plt.show() Output:- ![]()
PDF and CDF
counts, bin_edges = np.histogram(iris_setosa['petal_length'], bins=10, density=True)
pdf = counts/(sum(counts))
print(pdf)
print(bin_edges)
cdf = np.cumsum(pdf)
plt.plot(bin_edges[1:],pdf)
plt.plot(bin_edges[1:], cdf)
counts, bin_edges = np.histogram(iris_setosa['petal_length'], bins=20, density=True)
pdf = counts/(sum(counts))
plt.plot(bin_edges[1:], pdf)
plt.show()
Box Plot
sns.boxplot(x='species',y='petal_length',data=iris)
plt.showow()
sns.violinplot(x='species',y='petal_length',data=iris,size=8)
plt.show()
Jointplot
sns.jointplot(x='petal_length',y='petal_width',data=iris_setosa,kind='kde')
plt.show()
Download Source Code
Thanks for Reading Share this Post
Random Blogs
- Mastering Python in 2025: A Complete Roadmap for Beginners
- Robotics & AI – How AI is Powering Modern Robotics
- Python Challenging Programming Exercises Part 2
- Types of Numbers in Python
- Understanding HTAP Databases: Bridging Transactions and Analytics
- Best Platform to Learn Digital Marketing in Free
- How AI is Making Humans Weaker – The Hidden Impact of Artificial Intelligence
- What is YII? and How to Install it?
- Store Data Into CSV File Using Python Tkinter GUI Library
- Google’s Core Update in May 2020: What You Need to Know
Prepare for Interview
- SQL Interview Questions for 2–5 Years Experience
- SQL Interview Questions for 1–2 Years Experience
- SQL Interview Questions for 0–1 Year Experience
- SQL Interview Questions for Freshers
- Design Patterns in Python
- Dynamic Programming and Recursion in Python
- Trees and Graphs in Python
- Linked Lists, Stacks, and Queues in Python
- Sorting and Searching in Python
- Debugging in Python
- Unit Testing in Python
Datasets for Machine Learning
- Amazon Product Reviews Dataset
- Ozone Level Detection Dataset
- Bank Transaction Fraud Detection
- YouTube Trending Video Dataset (updated daily)
- Covid-19 Case Surveillance Public Use Dataset
- US Election 2020
- Forest Fires Dataset
- Mobile Robots Dataset
- Safety Helmet Detection
- All Space Missions from 1957
- OSIC Pulmonary Fibrosis Progression Dataset
- Wine Quality Dataset
- Google Audio Dataset
- Iris flower dataset
- Artificial Characters Dataset
- Bitcoin Heist Ransomware Address Dataset