-
Notifications
You must be signed in to change notification settings - Fork 1
/
run.py
executable file
·54 lines (40 loc) · 1.51 KB
/
run.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
#!./env/bin/python
import sys
################################
########### WEEK 1 #############
################################
# Problem statement: Bring the corpus into required format. (preprocessing)
# word1_tag
# word2_tag
if not "--no-train" in sys.argv:
import tag_words
################################
############ WEEK 2 ############
################################
# Problem statement: Create a dictionary having entry for unique word+tag combination with it's frequency
# count in the corpus.
import word_tag_frequency
################################
############ WEEK 3 ############
################################
# Problem statement: Report top 10 frequently used words and 10 frequently used tags. Provide your analysis of
# the word and tag distribution in the corpus.
# Word frequency Preprocessing
# Unique word map
import word_tag_statistics
################################
############ WEEK 4 ############
################################
# Problem statement: For each word, compute probabilities of word associations with tags. Program should be
# able to display probability of each word given the tag for the training corpus.
import training
################################
############ WEEK 5 ############
################################
# Problem statement: Predict the new tags for the words in the test corpus.
import prediction
################################
############ WEEK 6 ############
################################
# Generate confusion matrix for the word-tag pair.
import confusion