Predict air pollution index

0

0 votes
Approved, Machine Learning, Time Series Analysis, Easy., Approved
Problem

Problem statement

The increase in the number of vehicles has caused an increase in the pollution level. This increase is affecting nature in many ways. The government wishes to monitor the air pollution index in your city such that they can take action accordingly.

Your task is to predict the air pollution index based on the historical data provided to help the government to take appropriate actions.

Data

Data files

  • train.csv: Contains the training data; 33751 x 14 (includes headers)
  • test.csv: Contains the test data; 14455 x 13 (includes headers)

Data description

Columns

Description

date_time

Date, time, and hour of the data that is collected in the local time

is_holiday

Whether the day is a holiday or not

humidity

Numeric humidity in Celcius

wind_speed

Numeric wind speed in miles per hour

wind_direction

Cardinal wind direction (0-360 degree)

visibility_in_miles

Visibility of distance in miles

dew_point

Numeric dew point in Celcius

temperature

Numeric average temperature in Kelvin

rain_p_h

Numeric amount (in mm) of rain that occurred in the hour

snow_p_h

Numeric amount (in mm) of snow that occurred in the hour

clouds_all

Numeric percentage of cloud cover

weather_type

Categorical short textual description of the current weather

traffic_volume Hourly traffic volume bound in a specific direction
air_pollution_index Air quality index (10-300)

Submission format

You are required to write your predictions in a .csv file and upload it by clicking on the Upload file.

Sample submission 

date_time

air_pollution_index

1969-05-17 21:00:00

73

1969-05-17 21:00:00

251

1969-05-17 21:00:00

83

1969-05-17 22:00:00

98

Evaluation criteria

score=max(0,(100root_mean_absolute_error))

Time Limit: 5
Memory Limit: 256
Source Limit:
Contributers:
Editor Image

?