1
2
3
4
5
#
View Data Set

Name:

Description:

This is a list of every UFC fight in the history of the organisation. Every row contains information about both fighters, fight details and the winner. The data was scraped from ufcstats website. After fightmetric ceased to exist, this came into picture. I saw that there was a lot of information on the website about every fight and every event and there were no existing ways of capturing all this. I used beautifulsoup to scrape the data and pandas to process it. It was a long and arduous process, please forgive any mistakes. I have provided the raw files incase anybody wants to process it differently. This is my first time creating a dataset, any suggestions and corrections are welcome! Incase anyone wants to check out the work, I have all uploaded all the code files, including the scraping module here

Have fun!

Variables:

Each row is a compilation of both fighter stats. Fighters are represented by 'red' and 'blue' (for red and blue corner). So for instance, red fighter has the complied average stats of all the fights except the current one. The stats include damage done by the red fighter on the opponent and the damage done by the opponent on the fighter (represented by 'opp' in the columns) in all the fights this particular red fighter has had, except this one as it has not occured yet (in the data). Same information exists for blue fighter. The target variable is 'Winner' which is the only column that tells you what happened.
Here are some column definitions:

Column definitions:

  • R_ and B_ prefix signifies red and blue corner fighter stats respectively

  • _opp_ containing columns is the average of damage done by the opponent on the fighter

  • KD is number of knockdowns

  • SIG_STR is no. of significant strikes 'landed of attempted'

  • SIG_STR_pct is significant strikes percentage

  • TOTAL_STR is total strikes 'landed of attempted'

  • TD is no. of takedowns

  • TD_pct is takedown percentages

  • SUB_ATT is no. of submission attempts

  • PASS is no. times the guard was passed?

  • REV is the no. of Reversals landed

  • HEAD is no. of significant strinks to the head 'landed of attempted'

  • BODY is no. of significant strikes to the body 'landed of attempted'

  • CLINCH is no. of significant strikes in the clinch 'landed of attempted'

  • GROUND is no. of significant strikes on the ground 'landed of attempted'

  • win_by is method of win

  • last_round is last round of the fight (ex. if it was a KO in 1st, then this will be 1)

  • last_round_time is when the fight ended in the last round

  • Format is the format of the fight (3 rounds, 5 rounds etc.)

  • Referee is the name of the Ref

  • date is the date of the fight

  • location is the location in which the event took place

  • Fight_type is which weight class and whether it's a title bout or not

  • Winner is the winner of the fight

  • Stance is the stance of the fighter (orthodox, southpaw, etc.)

  • Height_cms is the height in centimeter

  • Reach_cms is the reach of the fighter (arm span) in centimeter

  • Weight_lbs is the weight of the fighter in pounds (lbs)

  • age is the age of the fighter

  • title_bout Boolean value of whether it is title fight or not

  • weight_class is which weight class the fight is in (Bantamweight, heavyweight, Women's flyweight, etc.)

  • no_of_rounds is the number of rounds the fight was scheduled for

  • current_lose_streak is the count of current concurrent losses of the fighter

  • current_win_streak is the count of current concurrent wins of the fighter

  • draw is the number of draws in the fighter's ufc career

  • wins is the number of wins in the fighter's ufc career

  • losses is the number of losses in the fighter's ufc career

  • total_rounds_fought is the average of total rounds fought by the fighter

  • total_time_fought(seconds) is the count of total time spent fighting in seconds

  • total_title_bouts is the total number of title bouts taken part in by the fighter

  • win_by_Decision_Majority is the number of wins by majority judges decision in the fighter's ufc career

  • win_by_Decision_Split is the number of wins by split judges decision in the fighter's ufc career

  • win_by_Decision_Unanimous is the number of wins by unanimous judges decision in the fighter's ufc career

  • win_by_KO/TKO is the number of wins by knockout in the fighter's ufc career

  • win_by_Submission is the number of wins by submission in the fighter's ufc career

  • win_by_TKO_Doctor_Stoppage is the number of wins by doctor stoppage in the fighter's ufc career

Link To Google Sheets:

Rows:

Columns:

License Type:

References/Notes/Attributions:

Acknowledgments

R Dataset Upload:

Use the following R code to directly access this dataset in R.

d <- read.csv("https://www.key2stats.com/UFC-Fight_historical_data_from_1993_to_2019_1551_40.csv")

R Coding Interface:


Datasets Tag Questions & Instructional Blocks

NumberContentType
#PROBLEM-49321

In the following questions, we will use the "UFC-Fight historical data from 1993 to 2019"....

Question
Showing 1-1 of 1 item.