Dict
In addition to lists, another more comprehensive method for storing complex data are dicts, or dictionaries. In the example below, we associate a key
(e.g. 'taq') to a value
(e.g. 'karim').
dict1 = {
'taq': 'karim',
'apple': 35,
False: 87.96,
35: 'dog',
'tree': True,
47: 92,
# etc.
}
print(dict1) # {'taq': 'karim', 'apple': 35, False: 87.96, 35: 'dog', 'tree': True, 47: 92}
The values
in a dict can be any valid Python data type, but there are some restrictions on what you can use as keys
. Keys CAN be strings, integers, floats, booleans, and tuples. Keys CANNOT be lists or dicts. Do you see the pattern here? The data in a dict key must be immutable. Since lists and dicts are mutable, they cannot be used as keys in a dict.
NOTE! The keys in a dict must be unique as well. Be careful not to add a key to a dict a second time. If you do, the second item will override the first item. For instance, if you upload data from a .csv file into a dict, it would be better to create a new dict first, then compare the two to check for identical keys and make any adjustments necessary.
One last thing before we move past the nitty gritty -- the keys and values of a single dict don't have to be homogenous. In other words, you can mix and match different key, value, and key value pair data types within one dict as seen above.
Creating Dicts
There are several ways you can create your dict
, but we'll go through the most basic ones here.
1. The simplest is to create an empty list with the dict()
method.
students = dict() # this creates a new, empty dict
2. You can create a dict by passing in key value pairs directly using this syntax:
food_groups = {
'pomegranate': 'fruit',
'asparagus': 'vegetable',
'goat cheese': 'dairy',
'walnut': 'legume'
}
3. You can also convert a list of tuples into a dict using dict()
...
# list of tuples
listofTuples = [("Hello" , 7), ("hi" , 10), ("there" , 45),("at" , 23),("this" , 77)]
wordFrequency = dict(listofTuples)
print(wordFrequency) # {'this': 77, 'there': 45, 'hi': 10, 'at': 23, 'Hello': 7}
4. ...and even combine two lists to create a dict by using the zip()
method.
The zip()
method takes the name of each list as parameters - the first list will become the dict's keys, and the second list will become the dict's values. NOTE! This only works if you're sure the key value pairs have the same index position in their original lists (so they will match in the dict).
names = ['Taq', 'Zola', 'Valerie', 'Valerie']
scores = [[98, 89, 92, 94], [86, 45, 98, 100], [100, 100, 100, 100], [76, 79, 80, 82]]
grades = dict(zip(names,scores))
print(grades) # {'Taq': [98, 89, 92, 94], 'Zola': [86, 45, 98, 100], 'Valerie': [76, 79, 80, 82]}
Accessing Dict Data
Once you've stored data in your dict, you'll need to be able to get back in and access it! Take a look at this dict holding state capitals.
state_capitals = {
'NY': 'Albany',
'NJ': 'Trenton',
'CT': 'Hartford',
'MA': 'Boston'
}
We can access each value in the list by referencing its key like so:
MAcap = state_capitals['MA']
print('The capital of MA is {}.'.format(MAcap)) # 'The capital of MA is Boston.'
Attempting to find a key that does not exist leads to error. You also can't access dict items with index numbers like you do with lists! If you try, you will get a KeyError
- because an index number does not function like a dict key.
print(state_capitals['PA']) # KeyError from missing key
print(state_capitals[2]) # KeyError from index reference
Instead, it's better to look up a key in a dict using .get(key, [])
. The .get()
method takes the key argument just as above EXCEPT it allows you to enter some default value it should return if the key you enter does not exist. Usually, we use []
as that value.
print(state_capitals.get('PA', []))
# PA is not in our dict, so .get() returns []
Now, this dict has 4 keys, but what if it had hundreds? We can retrieve data from large dicts using .keys()
, .values()
, or .items()
.
pets_owned = {
'Taq': ['teacup pig','cat','cat'],
'Francesca': ['llama','horse','dog'],
'Walter': ['ferret','iguana'],
'Caleb': ['dog','rabbit','parakeet']
}
pets.keys() # ['Taq', 'Francesca', 'Walter', 'Caleb']
pets.values() # [['teacup pig','cat','cat'], ['dog','rabbit','parakeet'], etc ]
pets.items() # [('Taq', ['teacup pig','cat','cat']), ('Francesca', [['llama','horse','dog']), etc]
Built-in Operators for Manipulating Dicts
Just like lists, you can edit, analyze, and format your dicts. Some work the same for dicts and lists such as len()
. However, adding, deleting, and updating data requires a little more detail for dicts than for lists.
Add or Edit Dict Items
We can add a single item to a dict...
state_capitals = {
'NY': 'Albany',
'NJ': 'Trenton',
'CT': 'Hartford',
'MA': 'Boston'
}
state_capitals['CA'] = 'Sacramento'
print(state_capitals) # {'NY': 'Albany', 'NJ': 'Trenton', 'CT': 'Hartford', 'MA': 'Boston', 'CA': 'Sacramento'}
...but more likely you'll want to make bulk updates to save yourself time. To do so, you can use the .update()
method to add one or more items to the dict. NOTE!: It's easy to accidentally override items when you're merging datasets. Don't worry though - we'll learn an easy way to check for duplicate keys in the next section.
state_capitals = {
'NY': 'Albany',
'NJ': 'Trenton',
'CT': 'Hartford',
'MA': 'Boston',
'CA': 'Sacramento'
}
more_states = {
'WA': 'Olympia',
'OR': 'Salem',
'TX': 'Austin',
'NJ': 'Hoboken',
'AZ': 'Phoenix',
'GA': 'Atlanta'
}
state_capitals.update(more_states)
state_capitals = {
'NY': 'Albany',
'NJ': 'Hoboken',
'CT': 'Hartford',
'MA': 'Boston',
'CA': 'Sacramento',
'WA': 'Olympia',
'OR': 'Salem',
'TX': 'Austin',
'AZ': 'Phoenix',
'GA': 'Atlanta'
}
Notice something? It's easy to accidentally override items when you're merging datasets. Oops, we just changed the capital of NJ to Hoboken! Don't worry though - we'll learn an easy way to check for duplicate keys in the next section.
Remove Items from a Dict
.clear()
simply empties the dict of all items.
.pop()
:
This removes an item, which you must specify by key. There are two things to note here -
First, you cannot delete a dict item by specifying a value. Since values do not have to be unique the way keys are, trying to delete items by referencing values could cause issues.
Second, just like we saw earlier with .get(key, value)
, .pop(key, value)
will raise a KeyError
if you try to remove a key that does not exist in the dict. We avoid this in the same way, by setting a default value - typically []
- for the program to return in case of a missing key.
Unfortunately, you can't use the same method as we did for .update()
to delete larger portions of data. We'll learn a way to do that in the next section.
state_capitals.pop('AZ', [])
# removes 'AZ': 'Phoenix' from our dict
popitem()
:
This one just removes an arbitrary key value pair from dict and returns it as a tuple.
seceded1 = state_capitals.popitem()
# ^ removes a random item and returns it as a tuple
print(seceded1) # ('GA': 'Atlanta') for example