【R语言学习笔记】6. 运用ggplot2包进行数据可视化----基于鸢尾花卉(iris)数据集

本文转载自查看原文 2019-12-06 04:33 722 ggplot2/ R语言/ iris数据集/ 数据可视化/ 数据清洗

1. 摘要：基于鸢尾花卉(iris)数据集来练习运用ggplot2进行数据可视化。

2. 数据来源：R语言内置数据集

3. 练习

3.1 基于原数据集以及整合数据集

# Aggregate the first four column by Species and calculate the mean
iris.summary <- aggregate(iris[1:4], list(iris$Species), mean)

# Change the name of the first column to 'Species'
names(iris.summary)[1] <- 'Species'

library(ggplot2) # load the ggplot2 package
ggplot(iris, aes(x = Sepal.Length, y = Sepal.Width, col = Species)) + geom_point() + geom_point(data = iris.summary, shape = 15, size = 5)

ggplot(iris, aes(x = Sepal.Length, y = Sepal.Width, col = Species)) + geom_point() +  geom_vline(
  data = iris.summary, linetype = 2, aes(xintercept = Sepal.Length, col = Species)) + geom_hline(
    data = iris.summary, linetype = 2, aes(yintercept = Sepal.Width, col = Species))

3.2 基于清洗的数据集

library(tidyr) # load tidyr package 
#iris.tidy----data frome format
iris_tidy <- iris %>%
  gather(key, value, -Species) %>%
  separate(key, c('Part', 'Measure'), '\\.')
head(iris_tidy)
str(iris_tidy)

ggplot(iris_tidy, aes(x = Species, y = value, col = Part)) + geom_jitter() + facet_grid(. ~ Measure)

#iris.wide
iris_wide <- iris %>%
  gather(key, value, -Species, -id) %>%
  separate(key, c('Part', 'Measure'), '\\.') %>%
  spread(Measure, value)
head(iris_wide)
str(iris_wide)

ggplot(iris_wide, aes(x = Length, y = Width, color = Part)) + geom_jitter() + facet_grid(. ~ Species)

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 鸢尾花数据集可视化 iris数据集（鸢尾花）机器学习——logistic回归，鸢尾花数据集预测，数据可视化 R_Studio(决策树算法)鸢尾花卉数据集Iris是一类多重变量分析的数据集【精】用PCA对鸢尾花数据集降维并可视化 15 鸢尾花(iris)数据集分析使用ggplot2进行数据可视化--案例分析鸢尾花数据集鸢尾花数据集鸢尾花数据集分析