Final Project

This is official manual for your final project.
Please follow the instructions and specs below.
Keep your eyes on updates as there may be some changes in specification / scoring criteria in future.

Updates

(5/14) The due date has been extended to June 19.
(5/24) The official file and videos for the final project has been uploaded on ETL.
(5/24) Updates to the scoring criteria
(5/25) Materials for quantization has been uploaded.
(6/4) Materials for zero-skipping has been uploaded.
(6/9) Updates to specs : Latency of your floating point MAC must be set as 16 cycles.

Due date

With your report and project files: ~6/19
For now we do not consider accepting any delayed submission.

Optimizing your work

We suggest three different ways to optimize your work: Quantization, Zero-Skipping, and DMA(Direct Memory Access).


V0 (Baseline)	Korean	English	Files
Quantization	Korean	English	Files
Zero-skpping	Korean	English	Files
DMA	Korean	English	Files

1. Prepare your bitstream file

You need a bitstream file that you have generated with the block design that includes your IP.
You just have to replace the custom IP of the block design in lab10 with your MM(Matrix-Matrix) PE controller.
How the PE controller should be designed is explained here.

2. Boot your device with the bitstream file

Once you are prepared with the bistream file, rename it to "zynq.bit", and move it to the sdcard.
Insert the sdcard to the device and boot it.
How you can boot your device via minicom is explained here.

3. (Optional) Download the repository

※ This is optional since the source files are totally same as in lab09, except benchmark.sh.
You can therefore skip 3~6 and extend your work on lab09.

You need to download this repository to start your final project.

$ git clone https://github.com/tahsd/hsd21_project

Note that this command can be run on the terminal on your device if connected to the network.

4. (Optional) Check dependencies

Check if all the dependencies for running the codes have been installed.

$ sudo apt-get update -y
$ sudo apt-get install -y libprotobuf-dev protobuf-compiler python python-numpy

These would have already been installed on your device if you have successfully done your lab09.

5. (Optional) Download the dataset

Run the command below to download the dataset.

$ bash download.sh

6. (Optional) Modify the codes

You will see three functions (LargeMV & LargeMM & ConvLowering) that have not been implemented in the fpga_api.cpp & fpga_api_cpu.cpp.
You can fill in the codes for the functions, though, since you have already done it in the lab09.
Modify fpga_api.cpp & fpga_api_cpu.cpp based on your previous works.

7. Run it

Run the validation code as below.

sh benchmark.sh

Hopefully you will get 100% accuracy on the classfication task!

Specifications

Accuracy on the classification task with CNN should be 100%.
(Small degradation by quantization or zero-skipping will be allowed for particular cases.)
The PE controller should consist of (at most) 8x8 (=64) PEs.
The FSM should consist of 5 states: IDLE - LOAD - CALC - HARV - DONE
During HARV(harvest) state, the PE controller should write back the computed data to BRAM.
You are not bound to this approach for optimizing V0. That means, you can also utilize pipelining.
Latency of your floating point MAC must be set as 16 cycles.

Scoring Criteria

Well explained in the videos.

Implementation
Inference time

Total computation time spent by HW for V0, quantization, zero-skipping
Total data transfer time for DMA
Time spent by SW is not evaluted.

Accuracy
Report

Please use the Q&A board on ETL if you have questions or want more information about the project.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
build		build
data		data
include		include
pretrained_weights		pretrained_weights
proto		proto
src		src
Makefile		Makefile
README.md		README.md
benchmark.sh		benchmark.sh
download.sh		download.sh
eval.py		eval.py
models.py		models.py
models.pyc		models.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Project

Updates

Due date

Optimizing your work

1. Prepare your bitstream file

2. Boot your device with the bitstream file

3. (Optional) Download the repository

4. (Optional) Check dependencies

5. (Optional) Download the dataset

6. (Optional) Modify the codes

7. Run it

Specifications

Scoring Criteria

About

Releases

Packages

Languages

tahsd/hsd21_project

Folders and files

Latest commit

History

Repository files navigation

Final Project

Updates

Due date

Optimizing your work

1. Prepare your bitstream file

2. Boot your device with the bitstream file

3. (Optional) Download the repository

4. (Optional) Check dependencies

5. (Optional) Download the dataset

6. (Optional) Modify the codes

7. Run it

Specifications

Scoring Criteria

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages