Activity - Data Analysis#

import pandas as pd

Let’s work with a Heart Disease dataset, all the details are in the following link.

In this activity you should:

  1. Read the dataset correctly.

  2. Perform descriptive statistics of the dataset.

  3. Explore the dataset.

You can find some ideas here and in the documetantion of pandas.

Share your finds with the rest of the group.

  1. What are the variables you think would be useful for predicting a heart disease?

  2. Is there any chart/plot that could help you to understand this dataset better?

  3. Did you find anything interesting?

Read dataset#

data_filepath = "https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/processed.cleveland.data"
heart_df = pd.read_csv(
    data_filepath,
    names=[#FIXME#]
)
heart_df.head()
  Cell In[3], line 4
    )
    ^
SyntaxError: closing parenthesis ')' does not match opening parenthesis '[' on line 3

Descripitive Statistics#

# Play here

Data Exploration#

# Play here