Tip: For the best layout, collapse the left course sidebar while you work.
Data Source: The Titanic Dataset

The sinking of the Titanic is one of the most infamous shipwrecks in history.
On April 15, 1912, during her maiden voyage, the widely considered "unsinkable" RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren't enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew.
While there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others. This dataset, which has 891 rows, is a portion of the passenger list.
| # | Column Name | Note |
|---|---|---|
| 1 | PassengerId | Primary Key |
| 2 | Survived | Survival 0 = No, 1 = Yes; Values that indicate only [A or B] like survival status [death or alive] could create very unique queries |
| 3 | Pclass | Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd |
| 4 | Name | Name of Passengers |
| 5 | Sex | Gender male/female |
| 6 | Age | Age in years |
| 7 | SibSp | # of siblings / spouses aboard the Titanic |
| 8 | Parch | # of parents / children aboard the Titanic. Some children travelled only with a nanny, therefore parch=0 for them. |
| 9 | Ticket | Ticket number |
| 10 | Fare | Passenger fare |
| 11 | Cabin | Cabin number |
| 12 | Embarked | Port of Embarkation C = Cherbourg, Q = Queenstown, S = Southampton |
(only after you've learned the Basic SQL)
Survived is 0 or 1, so the average times 100 is the percentage.
Age Range: Teenage: 0-18, Young Adult: 18-25, Adult: 25-65, Elder: older than 65. Passengers with missing Age are grouped as Unknown.
Buckets use the same boundaries as the lesson text, with 18 and 25 starting the next band.