[skip tests] [skip docs] start joss paper draft

LuxDL · Oct 3, 2023 · 1aa317f · 1aa317f
1 parent 1ee7e66
commit 1aa317f
Show file tree

Hide file tree

Showing 3 changed files with 166 additions and 0 deletions.
diff --git a/.github/workflows/JOSSPaper.yml b/.github/workflows/JOSSPaper.yml
@@ -0,0 +1,23 @@
+on: [push]
+
+jobs:
+ paper:
+ runs-on: ubuntu-latest
+ name: Paper Draft
+ steps:
+ - name: Checkout
+ uses: actions/checkout@v3
+ - name: Build draft PDF
+ uses: openjournals/openjournals-draft-action@master
+ with:
+ journal: joss
+ # This should be the path to the paper within your repo.
+ paper-path: joss/paper.md
+ - name: Upload
+ uses: actions/upload-artifact@v1
+ with:
+ name: paper
+ # This is the output path where Pandoc will write the compiled
+ # PDF. Note, this should be the same directory as the input
+ # paper.md
+ path: joss/paper.pdf
diff --git a/joss/paper.bib b/joss/paper.bib
@@ -0,0 +1,71 @@
+@article{bai2019deep,
+ title = {Deep equilibrium models},
+ author = {Bai, Shaojie and Kolter, J Zico and Koltun, Vladlen},
+ journal = {arXiv preprint arXiv:1909.01377},
+ year = {2019}
+}
+
+@inproceedings{enzyme:2020,
+ author = {Moses, William and Churavy, Valentin},
+ booktitle = {Advances in Neural Information Processing Systems},
+ editor = {H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin},
+ pages = {12472--12485},
+ publisher = {Curran Associates, Inc.},
+ title = {Instead of Rewriting Foreign Code for Machine Learning, Automatically Synthesize Fast Gradients},
+ url = {https://proceedings.neurips.cc/paper/2020/file/9332c513ef44b682e9347822c2e457ac-Paper.pdf},
+ volume = {33},
+ year = {2020}
+}
+
+@inproceedings{enzyme:2021,
+ author = {Moses, William S. and Churavy, Valentin and Paehler, Ludger and H\"{u}ckelheim, Jan and Narayanan, Sri Hari Krishna and Schanen, Michel and Doerfert, Johannes},
+ title = {Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme},
+ year = {2021},
+ isbn = {9781450384421},
+ publisher = {Association for Computing Machinery},
+ address = {New York, NY, USA},
+ url = {https://doi.org/10.1145/3458817.3476165},
+ doi = {10.1145/3458817.3476165},
+ booktitle = {Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis},
+ articleno = {61},
+ numpages = {16},
+ keywords = {CUDA, LLVM, ROCm, HPC, AD, GPU, automatic differentiation},
+ location = {St. Louis, Missouri},
+ series = {SC '21}
+}
+
+@misc{innes2018fashionable,
+ title = {Fashionable Modelling with Flux},
+ author = {Michael Innes and Elliot Saba and Keno Fischer and Dhairya Gandhi and Marco Concetto Rudilosso and Neethu Mariya Joy and Tejan Karmali and Avik Pal and Viral Shah},
+ year = {2018},
+ eprint = {1811.01457},
+ archiveprefix = {arXiv},
+ primaryclass = {cs.PL}
+}
+
+@misc{pal2023continuous,
+ title = {Continuous Deep Equilibrium Models: Training Neural ODEs Faster by Integrating Them to Infinity},
+ author = {Avik Pal and Alan Edelman and Christopher Rackauckas},
+ year = {2022},
+ eprint = {2201.12240},
+ archiveprefix = {arXiv},
+ primaryclass = {cs.LG}
+}
+
+@misc{simplechains,
+ author = {Elrod, Chris},
+ title = {SimpleChains.jl},
+ year = {2021},
+ publisher = {GitHub},
+ journal = {GitHub repository},
+ url = {https://github.com/PumasAI/SimpleChains.jl}
+}
+
+@inproceedings{yuret2016knet,
+ title = {Knet: beginning deep learning with 100 lines of julia},
+ author = {Yuret, Deniz},
+ booktitle = {Machine Learning Systems Workshop at NIPS},
+ volume = {2016},
+ pages = {5},
+ year = {2016}
+}
diff --git a/joss/paper.md b/joss/paper.md
@@ -0,0 +1,72 @@
+---
+title: 'Lux.jl: Bridging Scientific Computing & Deep Learning'
+tags:
+ - Julia
+ - Deep Learning
+ - Scientific Computing
+ - Neural Ordinary Differential Equations
+ - Deep Equilibrium Models
+authors:
+ - name: Avik Pal
+ orcid: 0000-0002-3938-7375
+ affiliation: "1"
+affiliations:
+ - name: Electrical Engineering and Computer Science, CSAIL, MIT
+ index: 1
+date: 2 October 2023
+bibliography: paper.bib
+---
+
+# Summary
+
+Combining Machine Learning and Scientific Computing have recently led to development of
+methods like Universal Differential Equations, Neural Differential Equations, Deep Equilibrium Models, etc.,
+which have been pushing the boundaries of physical sciences. However, every major deep learning
+framework requires the numerical softwares to be rewritten to satisfy their specific requirements.
+Lux.jl is a deep learning framework written in Julia with the correct abstractions to provide seamless
+composability with scientific computing softwares. Lux uses pure functions to provide a
+compiler and automatic differentiation friendly interface without compromising on the performance.
+
+# Statement of Need
+
+Julia already has quite a few well established Neural Network Frameworks –
+Flux [@innes2018fashionable] and KNet [@yuret2016knet]. However, similar to Pytorch,
+Tensorflow, etc. these frameworks were designed for typical Deep Learining workflows and
+Scientific Computing workflows had to be tailored to fit into these frameworks.
+
+Having to rewrite these workflows, which are often highly optimized, is a major barrier for
+research in this domain.
+
+## Switching Automatic Differentiation Frameworks
+
+## Support for CPU, NVIDIA GPUs and AMD GPUs
+
+## Composability with Scientific Computing Softwares
+
+In this section, we will go over a couple of examples to show how Lux.jl can be used with
+other scientific computing softwares. Lux.jl has an extensive
+[manual](https://lux.csail.mit.edu/dev/manual/interface),
+[tutorials](https://lux.csail.mit.edu/dev/tutorials/), and
+[API Reference](https://lux.csail.mit.edu/dev/api/), showcasing the composability in more
+details.
+
+### Neural Ordinary Differential Equations
+
+### Deep Equilibrium Models
+
+Deep Equilibrium Models [@bai2019deep; @pal2023continuous] are a class of neural networks
+where the output of the model is the steady state of a dynamical system defined by an
+internal neural network.
+
+## Ecosystem
+
+# Limitations
+
+Lux.jl is still in its early days of development and has the following known limitations:
+
+* Training Small Neural Networks on CPUs are not optimized yet. For small networks,
+ [SimpleChains.jl](https://github.com/PumasAI/SimpleChains.jl) [@simplechains] is the fastest option!
+* Nested Automatic Differentiation is current not well supported. We hope to fix this soon,
+ with a migration to Enzyme Automatic Differentiation Framework [@enzyme:2020; @enzyme:2021].
+
+# References