-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2017 07 12
Pthon/C++ interface
- https://github.com/PaddlePaddle/Paddle/issues/2750
- https://github.com/PaddlePaddle/Paddle/pull/2759#discussion_r125996741
Operator
- https://github.com/PaddlePaddle/Paddle/pull/2725#discussion_r126044899
- https://github.com/PaddlePaddle/Paddle/pull/2775#pullrequestreview-48655409
- Add Go testing into cmake (CI will now run Go unit tests): https://github.com/PaddlePaddle/Paddle/pull/2811
- Improve Go cmake: https://github.com/PaddlePaddle/Paddle/pull/2832
- Review:
- Paddle Cloud:
- cmake:
- Fix TeamCity CI.
- Buddy Memory Allocator Pull Request
- Buddy Allocator Documentation
- CMAKE Generic Documentation
Others:
- go_binary: remove hardcoded library link path, add pserver client test #2832
- FIX: add -lrt for link #2823
- FIX: explicitly specify glog install path #2763
-
Paddle Cloud:
- ISSUE: https://github.com/PaddlePaddle/Paddle/issues/2752
- Add Go pre-commit support:https://github.com/PaddlePaddle/cloud/pull/204
- Add task fail interface: https://github.com/PaddlePaddle/Paddle/pull/2719
-
PaddlePaddle:
-
refine machine translation models and fix the problem that training process goes to NaN or explosion.
-
modifications to recurrent layer group to output attention weights for each generated sequence in each time step during beam search.
- this feature is required by OCR team.
- the codes are finished and under test with colleagues from OCR. I will create a PR later.
- I am also writing an example to show how to use this feature to Paddle models, but some bugs of V2 APIs are found and I haven't fixed them yet, that this feature cannot be used in V2 API currently.
- recurrent_group cannot return more than one layer as its outputs: https://github.com/PaddlePaddle/Paddle/issues/2834
- beam search in the recurrent group gives wrong results by using multi-threads: https://github.com/PaddlePaddle/Paddle/issues/2836
- RecurrentOp: implement SegmentInput and ConcatOutput function.
- fix bug:
- pybind compiler question in CentOS: https://github.com/PaddlePaddle/Paddle/pull/2819
- mistaken: Folk -> Fork: https://github.com/PaddlePaddle/Paddle/pull/2768
- MKL-DNN: find two bugs, create issues, all have been resolved.
- make fails in master branch: https://github.com/01org/mkl-dnn/issues/87
- test_reorder fails in v0.9: https://github.com/01org/mkl-dnn/issues/88
- code review:
- wechat PaddlePaddle: 279 fans -> 324 fans
-
Expose paddle.framework C++ --> Python
- Give Cython+C-API and PyBind11 as demo, we dicided use PyBind11.
- #2793
-
Refine OpRegistry.
- #2782
- Defined static variable in .cc
- Refined C++ syntax
- Fix static variable init order problems
-
Fix slow parsing a recursive depends topolgy in
trainer_config_helper
-
Refine CUDA Related Libraries.
- Fix compile error in cuda.h
- #2806
-
Define the interface about
OpWithKernel
-
[WIP] add a sample op,
add_op
-
[WIP] Default scope function in Python
- 2808
- Need Review
-
[TODO] Generate Python OpCreation Code
-
[TODO] Python Model concept.
- finish op_registry development
- add interfaces for tensor and renew its implementation
- discuss Op design
Operator --> OpKernel --> Tensor/DeviceContext --> Eigen
Operator
- review Operator/OpKernel design
- https://github.com/PaddlePaddle/Paddle/pull/2725#discussion_r126335399
DeviceContext
- Refine DeviceContext
- https://github.com/PaddlePaddle/Paddle/pull/2814
Tensor and Eigen
- Add method converting Tensor to Eigen TensorMap
- https://github.com/PaddlePaddle/Paddle/pull/2805
Some fix
- Image
- Add pixel softmax layer for FCN model
- Add voc2012 dataset for image segment
- Add pixel cross entropy cost layer.#2840(WIP)
- Add FCN model. (WIP)
- Paddle cloud:
- new pserver fixes: https://github.com/PaddlePaddle/Paddle/pull/2774
- Notebook update https://github.com/PaddlePaddle/cloud/pull/210
- Fault tolerant on paddle cloud https://github.com/PaddlePaddle/cloud/pull/212
- Job termination: https://github.com/PaddlePaddle/cloud/pull/206
- Survey on https://github.com/PaddlePaddle/Paddle/wiki/Comparison-of-Distributed-Training-Implementation
- Meet mellanox, talk about RDMA in deep learning
- Review https://github.com/PaddlePaddle/Paddle/pull/2674#discussion_r127118366
- MPI:
- reduce job package size by auto removing cuda package when non-GPU training
- DS2
- Prepare VoxForge dataset
- Compress LM model
- Track reason of float overflow
- SSD
- Performance validation
https://github.com/PaddlePaddle/models/issues/136
- Performance validation
- Other
- Refine ConvShiftLayer
https://github.com/PaddlePaddle/Paddle/pull/2133 - Code Review
https://github.com/PaddlePaddle/models/pull/154
- Refine ConvShiftLayer
- Code Review:
- VOC dataset https://github.com/PaddlePaddle/Paddle/pull/2785
- PaddlePaddle Image Enhancement
- Faster R-CNN: add Region Proposal Network https://github.com/PaddlePaddle/Paddle/pull/2828
- Add OperatorBase
- OpKernel Design
- OpKernel test
- Add Init for operator
- Add a sample op,
add_op
- OperatorWithKernel
- refine device_context
- Fix slow parsing a recursive depends topology
- Rnn Opeartor
- DS2
- Prepare CHiME-test dataset for evaluating noisy speech (https://github.com/PaddlePaddle/models/pull/157)
- Experimenting with data augmentation.
- Models
- Polish model "machine translation with external memory (neural turning machine)" (https://github.com/PaddlePaddle/models/pull/36)
- RL
- Play with OpenAI-Gym and implement Tabular Q-learning agents.
-
paddle cloud develop
- continue revise pserver saving checkpoint https://github.com/PaddlePaddle/Paddle/pull/2716
-
paddle cloud fix
- pserver Serivce error https://github.com/PaddlePaddle/Paddle/pull/2839
- [WIP] pserver return error https://github.com/PaddlePaddle/Paddle/pull/2702
-
paddle cloud reviews
- pserver recover from checkpoint https://github.com/PaddlePaddle/Paddle/pull/2741
- add go testing to cmake https://github.com/PaddlePaddle/Paddle/pull/2811
- add pserver client test to go_binary https://github.com/PaddlePaddle/Paddle/pull/2832
-
paddle refactoring develop
- network implement https://github.com/PaddlePaddle/Paddle/pull/2730
- [WIP] look into the autograd, caffe2 IR module. add GradientOperator to Network Implement.
-
paddle refactoring reviews
- rnn op implement https://github.com/PaddlePaddle/Paddle/pull/2775
DS2:
-
Refine CTC beam search decoder in C++: performance evaluation, adding the support of loading language model in trie tree.
-
Distribute the language model
-
Improve parameters tuning
Fixed two issues of convolution calculation performance last week. At present, the inference of face model with Paddle can reach 400ms, better than their own implementation(750ms).
Convolution Reconstruction and Mobile Optimization
Review
- https://github.com/PaddlePaddle/Paddle/pull/2709#pullrequestreview-48091788
- https://github.com/PaddlePaddle/Paddle/pull/2776#pullrequestreview-48798896
- https://github.com/PaddlePaddle/Paddle/pull/2776#pullrequestreview-49188400
- https://github.com/PaddlePaddle/Paddle/pull/2776#pullrequestreview-49666837
- PaddlePaddle Refactoring
- implement recurrent group operator
- Issue:
- Whether the OperatorBase needs an initial interface?
- The usage of parent scope in the scope design. #2781
- Code Review
- pr
- mobilenet gpu acceleration https://github.com/PaddlePaddle/Paddle/pull/2776
- in process
- mobilent cpu acceleration.
- Fault Tolerant
- PServer recovery from checkpint file, https://github.com/PaddlePaddle/Paddle/pull/2741
- PR review: https://github.com/PaddlePaddle/Paddle/pull/2719
- Integration test for fault tolerant
- PaddleCloud
- DLNEL index page: https://github.com/PaddlePaddle/cloud/pull/194
- Bug fix: https://github.com/PaddlePaddle/cloud/pull/205
- Bug fix: https://github.com/PaddlePaddle/cloud/pull/202
-
paddle models
- DSSM (Deep Semantic Similarity Model)
- fix first PR
- DSSM (Deep Semantic Similarity Model)
-
paddle refactor
-
RecurrentOp with @qingqing and @luotao
- forward nearly finished
- reviews
-
RecurrentOp with @qingqing and @luotao
-
VisualDL with @longfei @yongfeng
- had a discussion with FE
- Compile Arm‘s ComputeLibrary, and try the calling of gemm implementation
- Benchmark several computing libraries benchmark
- Code review:
- auto pruning: https://github.com/PaddlePaddle/Paddle/pull/2603
- android compiling for armv8: https://github.com/PaddlePaddle/Paddle/pull/2816