1
2
3
4
5
#
View Data Set

Name:

Description:

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

  • Update Frequency: This dataset is updated daily.

Variables:

Comic Characters

This folder contains data behind the story Comic Books Are Still Made By Men, For Men And About Men.

The data comes from Marvel Wikia and DC Wikia. Characters were scraped on August 24. Appearance counts were scraped on September 2. The month and year of the first issue each character appeared in was pulled on October 6.

The data is split into two files, for DC and Marvel, respectively: dc-wikia-data.csv and marvel-wikia-data.csv. Each file has the following variables:

Variable

Definition

page_id

The unique identifier for that characters page within the wikia

name

The name of the character

urlslug

The unique url within the wikia that takes you to the character

ID

The identity status of the character (Secret Identity, Public identity, [on marvel only: No Dual Identity])

ALIGN

If the character is Good, Bad or Neutral

EYE

Eye color of the character

HAIR

Hair color of the character

SEX

Sex of the character (e.g. Male, Female, etc.)

GSM

If the character is a gender or sexual minority (e.g. Homosexual characters, bisexual characters)

ALIVE

If the character is alive or deceased

APPEARANCES

The number of appareances of the character in comic books (as of Sep. 2, 2014. Number will become increasingly out of date as time goes on.)

FIRST APPEARANCE

The month and year of the character's first appearance in a comic book, if available

YEAR

The year of the character's first appearance in a comic book, if available

Link To Google Sheets:

Rows:

Columns:

License Type:

References/Notes/Attributions:

Acknowledgments

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

Cover photo by Zbysiu Rodak on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

R Dataset Upload:

Use the following R code to directly access this dataset in R.

d <- read.csv("https://www.key2stats.com/FiveThirtyEight_Comic_Characters_Dataset_1621_1.csv")
d <- read.csv("https://www.key2stats.com/FiveThirtyEight_Comic_Characters_Dataset_1621_2.csv")

R Coding Interface:


Datasets Tag Questions & Instructional Blocks

NumberContentType
No results found.