-
Notifications
You must be signed in to change notification settings - Fork 0
/
get_pet_labels.py
77 lines (66 loc) · 3.32 KB
/
get_pet_labels.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
#!/usr/bin/env python3
# -*- coding: utf-8 -*-
# */AIPND-revision/intropyproject-classify-pet-images/get_pet_labels.py
#
# PROGRAMMER: Paschal Ugwu
# DATE CREATED: 24/07/2024
# REVISED DATE: 24/07/2024
# PURPOSE: Create the function get_pet_labels that creates the pet labels from
# the image's filename. This function inputs:
# - The Image Folder as image_dir within get_pet_labels function and
# as in_arg.dir for the function call within the main function.
# This function creates and returns the results dictionary as results_dic
# within get_pet_labels function and as results within main.
# The results_dic dictionary has a 'key' that's the image filename and
# a 'value' that's a list. This list will contain the following item
# at index 0 : pet image label (string).
#
# Imports python modules
from os import listdir
def get_pet_labels(image_dir):
"""
Creates a dictionary of pet labels (results_dic) based upon the filenames
of the image files. These pet image labels are used to check the accuracy
of the labels that are returned by the classifier function, since the
filenames of the images contain the true identity of the pet in the image.
Be sure to format the pet labels so that they are in all lower case letters
and with leading and trailing whitespace characters stripped from them.
(ex. filename = 'Boston_terrier_02259.jpg' Pet label = 'boston terrier')
Parameters:
image_dir - The (full) path to the folder of images that are to be
classified by the classifier function (string)
Returns:
results_dic - Dictionary with 'key' as image filename and 'value' as a
List. The list contains the following item:
index 0 = pet image label (string)
"""
# Creates list of files in directory
in_files = listdir(image_dir)
# Creates empty dictionary for the results (pet labels, etc.)
results_dic = dict()
# Processes through each file in the directory, extracting only the words
# of the file that contain the pet image label
for idx in range(0, len(in_files), 1):
# Skips file if it starts with . (like .DS_Store of Mac OSX) because it
# isn't a pet image file
if in_files[idx][0] != ".":
# Creates temporary label variable to hold pet label name extracted
pet_label = ""
# Extracts the pet image label from the filename
# Filename: 'Boston_terrier_02259.jpg'
# Split by '_', extract words, join by space, convert to lowercase
words = in_files[idx].split('_')
for word in words:
if word.isalpha():
pet_label += word.lower() + " "
pet_label = pet_label.strip()
# If filename doesn't already exist in dictionary add it and its
# pet label - otherwise print an error message because indicates
# duplicate files (filenames)
if in_files[idx] not in results_dic:
results_dic[in_files[idx]] = [pet_label]
else:
print("** Warning: Duplicate files exist in directory:",
in_files[idx])
# Returns the results_dic dictionary that you created
return results_dic