DLCVizsla

My NTU CommE 5052 Deep Learning for Computer Vision (by Prof. Frank Wang) homeworks,
you can get more details in each README.md or report.pdf inside folders.

The Vizsla (Hungarian: [ˈviʒlɒ]) is a dog breed originating in Hungary.

demo_digit_recognition
- MNIST handwritten digit recognition
- for pytorch and cuda testing
hw1_face_recognition
- face images of 40 different subjects and 10 grayscale images for each subject, all of size (56, 46) pixels
- cv2&matplotlib(gray), MSE, PCA, reconstruction, k-NN
hw1_image_classification
- 4 categories (classes) and 500 RGB images for each category, all of size (64, 64, 3) pixels
- cv2&matplotlib(rgb), Bag of Word(BoW), Patches, k-means, PCA, scatterPlot3D, Soft-max(max pooling), k-NN
hw2_YOLOv1_object_detection
- https://github.com/dlcv-spring-2019/hw2-shannon112
- YOLOv1: Redmon, Joseph, et al. "You only look once: Unified, real-time object detection." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
- trained on DOTA-v1.5 Aerial Images
- yolo loss, vgg16_bn+linear model
hw3_dcgan_acgan_dann
- https://github.com/dlcv-spring-2019/hw3-shannon112
- GAN[1], DCGAN[2], ACGAN[3], DANN[4], GTA[5], tSNE plot
- [1] Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems. 2014.
- [2] Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." arXiv preprint arXiv:1511.06434 (2015).
- [3] Odena, Augustus, Christopher Olah, and Jonathon Shlens. "Conditional image synthesis with auxiliary classifier gans." Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 2017.
- [4] Ganin, Yaroslav, et al. "Domain-adversarial training of neural networks." The Journal of Machine Learning Research 17.1 (2016): 2096-2030.
- [5] Sankaranarayanan, Swami, et al. "Generate to adapt: Aligning domains using generative adversarial networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.
- trained on USPS(28,28), MNIST-M(28,28,3), SVHN(28,28,3) Dataset
hw4_rnn_action_recognition
- https://github.com/dlcv-spring-2019/hw4-shannon112
- pre-train resnet-50 + linear, LSTM, pack_padding, seq2seq action recognition, tSNE plot
- train on 37 full-length videos (each 5-20 mins in 24 fps), and 4151 trimmed videos (each 5-20 secs in 24 fps), 11 action classes
DLCV_Final
- https://github.com/dlcv-spring-2019/final-ShiaGiBaLuanTrain
- Cast Search by Portrait Challenge
- face recognition (dlib), person re-id (resnet-50)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DLCVizsla

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
DLCV_final		DLCV_final
MNIST_recognition_demo		MNIST_recognition_demo
hw1_face_recognition		hw1_face_recognition
hw1_image_classification		hw1_image_classification
hw2_YOLOv1_object_detection		hw2_YOLOv1_object_detection
hw3_dcgan_acgan_dann		hw3_dcgan_acgan_dann
hw4_rnn_action_recognition		hw4_rnn_action_recognition
README.md		README.md
poster.png		poster.png

shannon112/DLCVizsla

Folders and files

Latest commit

History

Repository files navigation

DLCVizsla

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages