Skip to content

Commit

Permalink
Merge pull request #25 from ogawa/devel-20190305
Browse files Browse the repository at this point in the history
add system updates pages.
  • Loading branch information
ogawa authored Mar 13, 2019
2 parents 2cc20dc + 4cbbf31 commit 2f196b1
Show file tree
Hide file tree
Showing 6 changed files with 397 additions and 37 deletions.
14 changes: 11 additions & 3 deletions en/docs/01.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,21 +35,29 @@ The software available on the ABCI system is shown below.
| Category | Software | Version |
|:--|:--|:--|
| OS | CentOS | 7.4 |
| Development Environment | Intel Parallel Stduio XE Cluster Edition | 2018.2.046|
| | PGI Professional Edition | 18.5 |
| Development Environment | Intel Parallel Stduio XE Cluster Edition | 2017.8<br>2018.2<br>2018.3 |
| | PGI Professional Edition | 17.10<br>18.5 |
| | NVIDIA CUDA SDK | 8.0.61.2<br>9.0.176.2<br>9.0.176.3<br>9.0.176.4<br>9.1.85.3<br>9.2.88.1<br>9.2.148.1 |
| | GCC | 4.8.5 |
| | Python | 2.7.15<br>3.4.8<br>3.5.5<br>3.6.5 |
| | Ruby | 2.0.0.648-33 |
| | R | 3.5.0 |
| | Java | 1.8.0_131 |
| | Java | 1.6.0_41<br>1.7.0_141<br>1.8.0_131 |
| | Scala | 1.27-248 |
| | Lua | 5.1.4 |
| | Perl | 5.16.3 |
| File System | DDN GRIDScaler | 4.2.3-8 |
| | BeeOND | 6.18 |
| Container | docker | 17.12.0 |
| | Singularity | 2.6.1 |
| MPI | Intel MPI | 2018.2.199 |
| | MVAPICH2 | 2.3rc2<br>2.3 |
| | MVAPICH2-GDR | 2.3a<br>2.3rc1<br>2.3 |
| | Open MPI | 1.10.7<br>2.1.3<br>2.1.5<br>2.1.6<br>3.0.3<br>3.1.0<br>3.1.2<br>3.1.3 |
| Library | cuDNN | 5.1.10<br>6.0.21<br>7.0.5<br>7.1.3<br>7.1.4<br>7.2.1<br>7.3.1<br>7.4.2<br>7.5.0 |
| | NCCL | 1.3.5-1<br>2.1.15-1<br>2.2.13-1<br>2.3.5-2<br>2.3.7-1<br>2.4.2-1 |
| | gdrcopy | 1.2 |
| | Intel MKL | 2017.8<br>2018.2<br>2018.3 |

## 1.4. Storage Configuration

Expand Down
169 changes: 169 additions & 0 deletions en/docs/system-updates.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,169 @@
# System Updates

<!--
## 2019-XX-XX
| Add / Update / Delete | Software | Version | Previous version |
|:--|:--|:--|:--|
| Update | CentOS | 7.X | 7.4 |
| Add | CUDA | 10.0.X | |
-->

## 2019-03-14

| Add / Update / Delete | Software | Version | Previous version |
|:--|:--|:--|:--|
| Add | Intel Compiler | 2017.8, 2018.3 | |
| Add | PGI | 17.10 | |
| Add | Open MPI | 2.1.6 | |
| Add | cuDNN | 7.5.0 | |
| Add | NCCL | 2.4.2-1 | |
| Add | Intel MKL | 2017.8, 2018.3 | |

Other fixes are as follows:

* Add PGI 17.10 support to MVAPICH2-GDR 2.3
* Add PGI support to Open MPI 2.1.5, 2.1.6, 3.1.3
* Change the default version of Open MPI to 2.1.6
* Fix typo in MVAPICH2 modules, wrong top directory

## 2019-01-31

### User/Group/Job names are now masked when displaying the result of 'qstat'

We changed the job scheduler configuration, so that User/Group/Job names are masked from the result of `qstat` command. These columns are shown only for your own jobs, otherwise these columns are masked by '*'. An example follows:

```
[username@es1 ~]$ qstat -u '*' | head
job-ID prior name user state submit/start at queue jclass slots ja-task-ID
------------------------------------------------------------------------------------------------------------------------------------------------
123456 0.28027 run.sh username r 01/31/2019 12:34:56 gpu@g0001 80
123457 0.28027 ********** ********** r 01/31/2019 12:34:56 gpu@g0002 80
123458 0.28027 ********** ********** r 01/31/2019 12:34:56 gpu@g0003 80
123450 0.28027 ********** ********** r 01/31/2019 12:34:56 gpu@g0004 80
```

## 2018-12-18

| Add / Update / Delete | Software | Version | Previous version |
|:--|:--|:--|:--|
| Add | NCCL | 2.3.7-1 | |
| Add | cuDNN | 7.4.2 | |
| Add | Open MPI | 3.0.3, 3.1.3 | |
| Add | MVAPICH2-GDR | 2.3 | |
| Add | Hadoop | 2.9.2 | |
| Add | Spark | 2.3.2, 2.4.0 | |
| Add | Go | 1.11.2 | |
| Add | Intel MKL | 2018.2.199 | |

### NCCL 2.3.7-1

The NVIDIA Collective Communications Library (NCCL) 2.3.7-1 was installed.

The relase note will be found: [NCCL Release 2.3.7](https://docs.nvidia.com/deeplearning/sdk/nccl-release-notes/index.html)

To set up user environment:

```
$ module load cuda/9.2/9.2.148.1
$ module load nccl/2.3/2.3.7-1
```

### cuDNN 7.4.2

The NVIDIA CUDA Deep Neural Network library (cuDNN) 7.4.2 was installed.

The release note will found: [cuDNN Release Notes v7.4.2](https://docs.nvidia.com/deeplearning/sdk/cudnn-release-notes/rel_742.html)

To set up user environment:

```
$ module load cuda/9.2/9.2.148.1
$ module load cudnn/7.4/7.4.2
```

### Open MPI 3.0.3, 3.1.3

Open MPI (without --cuda option) 3.0.3, 3.1.3 were installed.

To set up user environment:

```
$ module load openmpi/3.1.3
```

### MVAPICH2-GDR 2.3

MVAPICH2-GDR 2.3 was installed.

To set up user environment:

```
$ module load cuda/9.2/9.2.148.1
$ module load mvapich/mvapich2-gdr/2.3
```

### Hadoop 2.9.2

Apache Hadoop 2.9.2 was installed.

To set up user environment:

```
$ module load openjdk/1.8.0.131
$ module load hadoop/2.9.1
```

### Spark 2.3.2, 2.4.0

Apache Spark 2.3.2, 2.4.0 were installed.

To set up user environment:

```
$ module load spark/2.4.0
```

### Go 1.11.2

Go Programming Language 1.11.2 was installed.

To set up user environment:

```
$ module load go/1.11.2
```

### Intel MKL 2018.2.199

Intel Math Kernel Library (MKL) 2018.2.199 was installed.

To set up user environment:

```
$ module load intel-mkl/2018.2.199
```

## 2018-12-14

| Add / Update / Delete | Software | Version | Previous version |
|:--|:--|:--|:--|
| Update | Singularity | 2.6.1 | 2.6.0 |
| Delete | Singularity | 2.5.2 | |

Singularity 2.6.1 was installed. The usage is as follows:

```
$ module load singularity/2.6.1
$ singularity run image_path
```

The release note will be found:

[Singularity 2.6.1](https://github.com/sylabs/singularity/releases/tag/2.6.1)

And, we uninstalled version 2.5.2 and 2.6.0 because severe security issues ([CVE-2018-19295](https://cve.mitre.org/cgi-bin/cvename.cgi?name=2018-19295)) were reported. If you are using Singularity with specifying version number, such as `singularity/2.5.0` or `singularity/2.6.0`, please modify your job scripts to specify `singularity/2.6.1`.

```
ex) module load singularity/2.5.2 -> module load singularity/2.6.1
```
32 changes: 17 additions & 15 deletions en/mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,21 +8,23 @@ copyright: 'Copyright &copy; 2018 National Institute of Advanced Industrial Scie
google_analytics: ['UA-118371652-2', 'portal.abci.ai']
nav:
- 'Introduction': 'index.md'
- '1. ABCI System Overview': '01.md'
- '2. ABCI System User Environment': '02.md'
- '3. Job Execution Environment': '03.md'
- '4. Storage': '04.md'
- '5. Use Environment Modules': '05.md'
- '6. Use Python': '06.md'
- '7. Use GPU': '07.md'
- '8. Use MPI': '08.md'
- '9. Linux Container': '09.md'
- '10. Software Development Environment': '10.md'
- '11. Application Framework': '11.md'
- '12. Remote desktop': '12.md'
- '13. AWS': '13.md'
- 'Appendix 1. Configuration of Installed Software': 'appendix1.md'
- 'Appendix 2. Use of ABCI System for HPCI ': 'appendix2.md'
- Users Guide:
- '1. ABCI System Overview': '01.md'
- '2. ABCI System User Environment': '02.md'
- '3. Job Execution Environment': '03.md'
- '4. Storage': '04.md'
- '5. Use Environment Modules': '05.md'
- '6. Use Python': '06.md'
- '7. Use GPU': '07.md'
- '8. Use MPI': '08.md'
- '9. Linux Container': '09.md'
- '10. Software Development Environment': '10.md'
- '11. Application Framework': '11.md'
- '12. Remote desktop': '12.md'
- '13. AWS': '13.md'
- 'Appendix 1. Configuration of Installed Software': 'appendix1.md'
- 'Appendix 2. Use of ABCI System for HPCI ': 'appendix2.md'
- 'System Updates': 'system-updates.md'
- 'Privacy Policy': 'https://portal.abci.ai/docs/privacy-policy/'
theme:
name: 'material'
Expand Down
14 changes: 11 additions & 3 deletions ja/docs/01.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,21 +33,29 @@ ABCIシステムで利用可能なソフトウェア一覧を以下に示しま
| 項目 | ソフトウェア | バージョン |
|:--|:--|:--|
| OS | CentOS | 7.4 |
| 開発環境 | Intel Parallel Stduio XE Cluster Edition | 2018.2.046 |
| | PGI Professional Edition | 18.5 |
| 開発環境 | Intel Parallel Stduio XE Cluster Edition | 2017.8<br>2018.2<br>2018.3 |
| | PGI Professional Edition | 17.10<br>18.5 |
| | NVIDIA CUDA SDK | 8.0.61.2<br>9.0.176.2<br>9.0.176.3<br>9.0.176.4<br>9.1.85.3<br>9.2.88.1<br>9.2.148.1 |
| | GCC | 4.8.5 |
| | Python | 2.7.15<br>3.4.8<br>3.5.5<br>3.6.5 |
| | Ruby | 2.0.0.648-33 |
| | R | 3.5.0 |
| | Java | 1.8.0_131 |
| | Java | 1.6.0_41<br>1.7.0_141<br>1.8.0_131 |
| | Scala | 1.27-248 |
| | Lua | 5.1.4 |
| | Perl | 5.16.3 |
| ファイルシステム | DDN GRIDScaler | 4.2.3-8 |
| | BeeOND | 6.18 |
| コンテナ | Docker | 17.12.0 |
| | Singularity | 2.6.1 |
| MPI | Intel MPI | 2018.2.199 |
| | MVAPICH2 | 2.3rc2<br>2.3 |
| | MVAPICH2-GDR | 2.3a<br>2.3rc1<br>2.3 |
| | Open MPI | 1.10.7<br>2.1.3<br>2.1.5<br>2.1.6<br>3.0.3<br>3.1.0<br>3.1.2<br>3.1.3 |
| ライブラリ | cuDNN | 5.1.10<br>6.0.21<br>7.0.5<br>7.1.3<br>7.1.4<br>7.2.1<br>7.3.1<br>7.4.2<br>7.5.0 |
| | NCCL | 1.3.5-1<br>2.1.15-1<br>2.2.13-1<br>2.3.5-2<br>2.3.7-1<br>2.4.2-1 |
| | gdrcopy | 1.2 |
| | Intel MKL | 2017.8<br>2018.2<br>2018.3 |

## 1.4. ストレージ構成

Expand Down
Loading

0 comments on commit 2f196b1

Please sign in to comment.