1, File structure. data: store different period position data lib: some help function writen by myself resource: origin profile excel file, skills, displine, responsibility etc.
2, Important file analysis.py: this file contain the whole process from extract phrase to clustering according to tfidf value. main.py: extract company information and rank company according to fortune 500 company list human.py: generate university rank and summary information.