import babypandas as bpd
import numpy as np


oct_1 = bpd.read_csv('data/get-it-done-oct-1.csv')
oct_1


# This DataFrame has 753 rows and 7 columns
oct_1


oct_1.set_index('service_request_id')


oct_1


oct_1 = oct_1.set_index('service_request_id')
oct_1


# There were 7 columns before, but one of them became the index, and the index is not a column!
oct_1.shape

(753, 6)


# Number of rows
oct_1.shape[0]

753


# Number of columns
oct_1.shape[1]

6


requests = bpd.read_csv('data/get-it-done-requests.csv')
requests


requests


requests.get('closed')

0         46
1          2
2       1484
3         25
4        977
        ... 
1582       1
1583       0
1584       9
1585       3
1586       1
Name: closed, Length: 1587, dtype: int64


requests.get('closed')

0         46
1          2
2       1484
3         25
4        977
        ... 
1582       1
1583       0
1584       9
1585       3
1586       1
Name: closed, Length: 1587, dtype: int64


requests.get('open')

0         0
1         0
2       219
3         1
4         0
       ... 
1582      0
1583      1
1584      1
1585      0
1586      0
Name: open, Length: 1587, dtype: int64


requests.get('closed') + requests.get('open')

0         46
1          2
2       1703
3         26
4        977
        ... 
1582       1
1583       1
1584      10
1585       3
1586       1
Length: 1587, dtype: int64


requests.assign(
    total=requests.get('closed') + requests.get('open')
)


requests


requests = requests.assign(
    total=requests.get('closed') + requests.get('open')
)
requests


requests.get('total').max()

11342


requests.get('total').mean()

171.33333333333334


requests.get('total').median()

41.0


requests.get('open').mean()

25.220541902961564


requests.get('open').median()

4.0


requests.sort_values(by='total')


ordered_requests = requests.sort_values(by='total', ascending=False)
ordered_requests


ordered_requests


ordered_requests.get('neighborhood')

232                   Downtown
452      Mid-City:City Heights
1360    Southeastern San Diego
1363    Southeastern San Diego
840                 North Park
                 ...          
1428      Tijuana River Valley
1426      Tijuana River Valley
1423      Tijuana River Valley
1422      Tijuana River Valley
1586           Via De La Valle
Name: neighborhood, Length: 1587, dtype: object


ordered_requests.get('neighborhood').iloc[0]

'Downtown'


ordered_requests.get('service').iloc[0]

'Encampment'


oct_1


oct_1.get('status')

service_request_id
3940112      Open
3940113      Open
3940114      Open
3940115      Open
3940116      Open
            ...  
3940819    Closed
3940876    Closed
3940900    Closed
3940909    Closed
3940924    Closed
Name: status, Length: 753, dtype: object


oct_1.get('status').loc[3940652]

'Open'

...

Ellipsis


bpd.read_csv('data/get-it-done-oct-1.csv')


bpd.read_csv('data/get-it-done-oct-1.csv').get('public_description').loc[31]

'Limes blocking the sidewalk'


bpd.read_csv('data/get-it-done-oct-1.csv').get('public_description').iloc[31]

'Limes blocking the sidewalk'


requests


requests[requests.get('service') == 'Weed Cleanup']


'Weed Cleanup' == 'Weed Clean-Up'

False


'Weed Cleanup' == 'Weed Cleanup'

True


requests.get('service') == 'Weed Cleanup'

0       False
1       False
2       False
3       False
4       False
        ...  
1582    False
1583    False
1584    False
1585    False
1586    False
Name: service, Length: 1587, dtype: bool


requests


requests.get('open') > 1

0       False
1       False
2        True
3       False
4       False
        ...  
1582    False
1583    False
1584    False
1585    False
1586    False
Name: open, Length: 1587, dtype: bool


requests[requests.get('open') > 1]


weed_cleanup_only = requests[requests.get('service') == 'Weed Cleanup']
weed_cleanup_sorted = weed_cleanup_only.sort_values(by='total', ascending=False)
weed_cleanup_sorted


weed_cleanup_sorted.get('neighborhood').iloc[0]

'Southeastern San Diego'


requests[requests.get('service') == 'Lime Cleanup']

...

Ellipsis

...

Ellipsis

	service_request_id	date_requested	neighborhood	service	status	street_address	public_description
0	3940112	2022-10-01T00:11:00	La Jolla	Pothole	Open	7556 VIA CAPRI, San Diego, CA 92037, USA	Pothole
1	3940113	2022-10-01T00:12:00	La Jolla	Pothole	Open	7566 VIA CAPRI, San Diego, CA 92037, USA	Potholes / fix the damn road
2	3940114	2022-10-01T00:13:00	Pacific Beach	Street Light Maintenance	Open	1698-1500 Monmouth Dr, San Diego, CA 92109, USA	Street light out on the corner of Monmouth Dr ...
3	3940115	2022-10-01T00:13:00	La Jolla	Pothole	Open	7456 VIA CAPRI, San Diego, CA 92037, USA	Pothole
4	3940116	2022-10-01T00:16:00	Scripps Miramar Ranch	Traffic Signal Timing	Open	10895 Hibert St, San Diego, CA 92131, USA	Is it possible to time this light sequentially...
...	...	...	...	...	...	...	...
748	3940819	2022-10-01T17:39:00	North Park	Other	Closed	3935 32nd St	Bike Theft Chop Shop Behind Starbucks on 32nd ...
749	3940876	2022-10-01T19:47:00	Skyline-Paradise Hills	Parking	Closed	7701-7899 Bloomfield Rd, San Diego, CA 92114, USA	Car has been parked there for many months. Exp...
750	3940900	2022-10-01T21:06:00	Downtown	Sidewalk Repair Issue	Closed	Petco Park	Safety hazard illegal vending
751	3940909	2022-10-01T21:44:00	Clairemont Mesa	Other	Closed	2810 Denver Street	Underage drinking party
752	3940924	2022-10-01T22:54:00	Clairemont Mesa	Other	Closed	5378 Jamestown Rd	People doing drugs in their car AGAIN. Please ...

	service_request_id	date_requested	neighborhood	service	status	street_address	public_description
0	3940112	2022-10-01T00:11:00	La Jolla	Pothole	Open	7556 VIA CAPRI, San Diego, CA 92037, USA	Pothole
1	3940113	2022-10-01T00:12:00	La Jolla	Pothole	Open	7566 VIA CAPRI, San Diego, CA 92037, USA	Potholes / fix the damn road
2	3940114	2022-10-01T00:13:00	Pacific Beach	Street Light Maintenance	Open	1698-1500 Monmouth Dr, San Diego, CA 92109, USA	Street light out on the corner of Monmouth Dr ...
3	3940115	2022-10-01T00:13:00	La Jolla	Pothole	Open	7456 VIA CAPRI, San Diego, CA 92037, USA	Pothole
4	3940116	2022-10-01T00:16:00	Scripps Miramar Ranch	Traffic Signal Timing	Open	10895 Hibert St, San Diego, CA 92131, USA	Is it possible to time this light sequentially...
...	...	...	...	...	...	...	...
748	3940819	2022-10-01T17:39:00	North Park	Other	Closed	3935 32nd St	Bike Theft Chop Shop Behind Starbucks on 32nd ...
749	3940876	2022-10-01T19:47:00	Skyline-Paradise Hills	Parking	Closed	7701-7899 Bloomfield Rd, San Diego, CA 92114, USA	Car has been parked there for many months. Exp...
750	3940900	2022-10-01T21:06:00	Downtown	Sidewalk Repair Issue	Closed	Petco Park	Safety hazard illegal vending
751	3940909	2022-10-01T21:44:00	Clairemont Mesa	Other	Closed	2810 Denver Street	Underage drinking party
752	3940924	2022-10-01T22:54:00	Clairemont Mesa	Other	Closed	5378 Jamestown Rd	People doing drugs in their car AGAIN. Please ...

	date_requested	neighborhood	service	status	street_address	public_description
service_request_id
3940112	2022-10-01T00:11:00	La Jolla	Pothole	Open	7556 VIA CAPRI, San Diego, CA 92037, USA	Pothole
3940113	2022-10-01T00:12:00	La Jolla	Pothole	Open	7566 VIA CAPRI, San Diego, CA 92037, USA	Potholes / fix the damn road
3940114	2022-10-01T00:13:00	Pacific Beach	Street Light Maintenance	Open	1698-1500 Monmouth Dr, San Diego, CA 92109, USA	Street light out on the corner of Monmouth Dr ...
3940115	2022-10-01T00:13:00	La Jolla	Pothole	Open	7456 VIA CAPRI, San Diego, CA 92037, USA	Pothole
3940116	2022-10-01T00:16:00	Scripps Miramar Ranch	Traffic Signal Timing	Open	10895 Hibert St, San Diego, CA 92131, USA	Is it possible to time this light sequentially...
...	...	...	...	...	...	...
3940819	2022-10-01T17:39:00	North Park	Other	Closed	3935 32nd St	Bike Theft Chop Shop Behind Starbucks on 32nd ...
3940876	2022-10-01T19:47:00	Skyline-Paradise Hills	Parking	Closed	7701-7899 Bloomfield Rd, San Diego, CA 92114, USA	Car has been parked there for many months. Exp...
3940900	2022-10-01T21:06:00	Downtown	Sidewalk Repair Issue	Closed	Petco Park	Safety hazard illegal vending
3940909	2022-10-01T21:44:00	Clairemont Mesa	Other	Closed	2810 Denver Street	Underage drinking party
3940924	2022-10-01T22:54:00	Clairemont Mesa	Other	Closed	5378 Jamestown Rd	People doing drugs in their car AGAIN. Please ...

	service_request_id	date_requested	neighborhood	service	status	street_address	public_description
0	3940112	2022-10-01T00:11:00	La Jolla	Pothole	Open	7556 VIA CAPRI, San Diego, CA 92037, USA	Pothole
1	3940113	2022-10-01T00:12:00	La Jolla	Pothole	Open	7566 VIA CAPRI, San Diego, CA 92037, USA	Potholes / fix the damn road
2	3940114	2022-10-01T00:13:00	Pacific Beach	Street Light Maintenance	Open	1698-1500 Monmouth Dr, San Diego, CA 92109, USA	Street light out on the corner of Monmouth Dr ...
3	3940115	2022-10-01T00:13:00	La Jolla	Pothole	Open	7456 VIA CAPRI, San Diego, CA 92037, USA	Pothole
4	3940116	2022-10-01T00:16:00	Scripps Miramar Ranch	Traffic Signal Timing	Open	10895 Hibert St, San Diego, CA 92131, USA	Is it possible to time this light sequentially...
...	...	...	...	...	...	...	...
748	3940819	2022-10-01T17:39:00	North Park	Other	Closed	3935 32nd St	Bike Theft Chop Shop Behind Starbucks on 32nd ...
749	3940876	2022-10-01T19:47:00	Skyline-Paradise Hills	Parking	Closed	7701-7899 Bloomfield Rd, San Diego, CA 92114, USA	Car has been parked there for many months. Exp...
750	3940900	2022-10-01T21:06:00	Downtown	Sidewalk Repair Issue	Closed	Petco Park	Safety hazard illegal vending
751	3940909	2022-10-01T21:44:00	Clairemont Mesa	Other	Closed	2810 Denver Street	Underage drinking party
752	3940924	2022-10-01T22:54:00	Clairemont Mesa	Other	Closed	5378 Jamestown Rd	People doing drugs in their car AGAIN. Please ...

	date_requested	neighborhood	service	status	street_address	public_description
service_request_id
3940112	2022-10-01T00:11:00	La Jolla	Pothole	Open	7556 VIA CAPRI, San Diego, CA 92037, USA	Pothole
3940113	2022-10-01T00:12:00	La Jolla	Pothole	Open	7566 VIA CAPRI, San Diego, CA 92037, USA	Potholes / fix the damn road
3940114	2022-10-01T00:13:00	Pacific Beach	Street Light Maintenance	Open	1698-1500 Monmouth Dr, San Diego, CA 92109, USA	Street light out on the corner of Monmouth Dr ...
3940115	2022-10-01T00:13:00	La Jolla	Pothole	Open	7456 VIA CAPRI, San Diego, CA 92037, USA	Pothole
3940116	2022-10-01T00:16:00	Scripps Miramar Ranch	Traffic Signal Timing	Open	10895 Hibert St, San Diego, CA 92131, USA	Is it possible to time this light sequentially...
...	...	...	...	...	...	...
3940819	2022-10-01T17:39:00	North Park	Other	Closed	3935 32nd St	Bike Theft Chop Shop Behind Starbucks on 32nd ...
3940876	2022-10-01T19:47:00	Skyline-Paradise Hills	Parking	Closed	7701-7899 Bloomfield Rd, San Diego, CA 92114, USA	Car has been parked there for many months. Exp...
3940900	2022-10-01T21:06:00	Downtown	Sidewalk Repair Issue	Closed	Petco Park	Safety hazard illegal vending
3940909	2022-10-01T21:44:00	Clairemont Mesa	Other	Closed	2810 Denver Street	Underage drinking party
3940924	2022-10-01T22:54:00	Clairemont Mesa	Other	Closed	5378 Jamestown Rd	People doing drugs in their car AGAIN. Please ...

	neighborhood	service	closed	open
0	Balboa Park	Dead Animal	46	0
1	Balboa Park	Development Services - Code Enforcement	2	0
2	Balboa Park	Encampment	1484	219
3	Balboa Park	Environmental Services Code Compliance	25	1
4	Balboa Park	Graffiti	977	0
...	...	...	...	...
1582	Via De La Valle	Parking	1	0
1583	Via De La Valle	Pavement Maintenance	0	1
1584	Via De La Valle	Pothole	9	1
1585	Via De La Valle	Stormwater Code Enforcement	3	0
1586	Via De La Valle	Street Light Maintenance	1	0

	neighborhood	service	closed	open	total
232	Downtown	Encampment	9262	2080	11342
452	Mid-City:City Heights	Illegal Dumping	9021	222	9243
1360	Southeastern San Diego	Illegal Dumping	5350	228	5578
1363	Southeastern San Diego	Parking	2780	614	3394
840	North Park	Parking	3059	224	3283
...	...	...	...	...	...
1428	Tijuana River Valley	Street Light Maintenance	0	1	1
1426	Tijuana River Valley	Sidewalk Repair Issue	1	0	1
1423	Tijuana River Valley	Pavement Maintenance	1	0	1
1422	Tijuana River Valley	Parks Issue	1	0	1
1586	Via De La Valle	Street Light Maintenance	1	0	1

	neighborhood	service	closed	open	total
30	Balboa Park	Weed Cleanup	23	0	23
61	Barrio Logan	Weed Cleanup	10	1	11
87	Black Mountain Ranch	Weed Cleanup	0	1	1
116	Carmel Mountain Ranch	Weed Cleanup	2	0	2
146	Carmel Valley	Weed Cleanup	6	1	7
...	...	...	...	...	...
1433	Tijuana River Valley	Weed Cleanup	2	0	2
1489	Torrey Hills	Weed Cleanup	1	0	1
1518	Torrey Pines	Weed Cleanup	10	7	17
1549	University	Weed Cleanup	53	10	63
1580	Uptown	Weed Cleanup	36	8	44

	neighborhood	service	closed	open	total
1383	Southeastern San Diego	Weed Cleanup	72	7	79
807	Navajo	Weed Cleanup	66	1	67
177	Clairemont Mesa	Weed Cleanup	55	11	66
1549	University	Weed Cleanup	53	10	63
1352	Skyline-Paradise Hills	Weed Cleanup	52	8	60
...	...	...	...	...	...
268	East Elliott	Weed Cleanup	1	0	1
309	Fairbanks Ranch Country Club	Weed Cleanup	1	0	1
1489	Torrey Hills	Weed Cleanup	1	0	1
87	Black Mountain Ranch	Weed Cleanup	0	1	1
746	Mission Beach	Weed Cleanup	1	0	1

Lecture 5 – DataFrames: Accessing, Sorting, and Querying¶

DSC 10, Fall 2022¶

Announcements¶

Agenda¶

Note:¶

DataFrames¶

pandas¶

But pandas is not so cute...¶

Enter babypandas!¶

DataFrames in babypandas 🐼¶

About the Data: Get It Done 👷¶

Reading data from a file 📖¶

Structure of a DataFrame¶

Setting a new index¶

Shape of a DataFrame¶

Annual summary of Get It Done requests¶

Example 1: Total requests¶

Finding total requests¶

Step 1 – Getting a column¶

Digression: Series¶

Step 2 – Getting another column¶

Step 3 – Calculating the total¶

Step 4 – Adding the totals to the DataFrame as a new column¶

Example 2: Analyzing requests¶

Questions¶

Example 3: What and where is the most frequently requested service?¶

Step 1 – Sorting the DataFrame¶

Step 1 – Sorting the DataFrame in descending order¶

Step 2 – Extracting the neighborhood and service¶

Example 4: Status of a request¶

Status of a request¶

Accessing using the row label¶

Activity 🚚¶

Summary of accessing a Series¶

Note¶

Reflection¶

Questions we can answer right now...¶

Questions we can't yet answer...¶

Example 6: Which neighborhood has the most 'Weed Cleanup' requests?¶

Selecting rows¶

The solution¶

Boolean indexing¶

Another example of element-wise comparison¶

Original Question: Which neighborhood has the most 'Weed Cleanup' requests?¶

What if the condition isn't satisfied?¶

Concept Check ✅ – Answer at cc.dsc10.com¶

Activity 🚘¶

Summary¶

Summary¶

Next time¶

`pandas`¶

But `pandas` is not so cute...¶

Enter `babypandas`!¶

DataFrames in `babypandas` 🐼¶

Example 6: Which neighborhood has the most `'Weed Cleanup'` requests?¶

Original Question: Which neighborhood has the most `'Weed Cleanup'` requests?¶

Concept Check ✅ – Answer at cc.dsc10.com ¶