1
2
3
4
5
#

Name:

Description:

This is a simulated data set for photo classifications based on a machine learning algorithm versus what the true classification is for those photos. While the data are not real, they resemble performance that would be reasonable to expect in a well-built classifier.

Variables:

  • mach_learn - The prediction by the machine learning system as to whether the photo is about fashion or not.

  • truth - The actual classification of the photo by a team of humans.

Details

The hypothetical ML algorithm has a precision of 90\ meaning of those photos it claims are fashion, about 90\ of them are actually about fashion. The recall of the ML algorithm is about 64\ of the photos that are about fashion, it correctly predicts that they are about fashion about 64\

Link To Google Sheets:

Rows:

Columns:

License Type:

References/Notes/Attributions:

Source

The data are simulated / hypothetical.

R Dataset Upload:

Use the following R code to directly access this dataset in R.

d <- read.csv("https://www.key2stats.com/Photo_classifications__fashion_or_not_1476_85.csv")

R Coding Interface:


Datasets Tag Questions & Instructional Blocks

NumberContentType
No results found.