Skip to content

Commit

Permalink
Merge pull request #1 from Sydney-Informatics-Hub/details
Browse files Browse the repository at this point in the history
Small changes
  • Loading branch information
georgiesamaha authored Nov 28, 2024
2 parents 88a8f4a + 642b6d9 commit fdcb636
Show file tree
Hide file tree
Showing 4 changed files with 43 additions and 41 deletions.
18 changes: 9 additions & 9 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# USyd NCI Gadi guide
# USyd NCI Gadi guide

## Contributing to this repository

Expand All @@ -14,37 +14,37 @@

## How to contribute to the guide

You can choose to work either locally or on your Nimbus VM to make changes to this guide. Please work in VS code to easily edit the markdown files. No changes are to be made to the html files directly.
You can choose to work either locally or on your Nimbus VM to make changes to this guide. Please work in VS code to easily edit the markdown files. No changes are to be made to the html files directly.

Grab a copy of the repository by cloning it to your local machine or Nimbus VM:
Grab a copy of the repository by cloning it to your local machine or Nimbus VM:

```bash
git clone https://github.com/Sydney-Informatics-Hub/ANZAC_neurogenetics_Gadi_guide.git
git clone https://github.com/Sydney-Informatics-Hub/usyd-gadi-onboarding-guide.git
```

1. Edit `index.qmd` to change the main landing page. This is a markdown file.
2. Edit or create `setup.qmd` to change the Setup instruction pages. Same - basically a md file.
3. Edit `_quarto.yml` to reorder/rename/remove/replace the dropdown menu options.
4. Add additional `*.qmd` files to the root dir to have them converted to html files (and add them to `_quarto.yml` to make them navigable), if you'd like.
5. Add any figures to `figs/` and give them a descriptive name.
5. Add any figures to `figs/` and give them a descriptive name.

You can browse the result locally by exploring the html files created (note: sometimes figures display locally but not on web and the other way around too) by running:
You can browse the result locally by exploring the html files created (note: sometimes figures display locally but not on web and the other way around too) by running:

```bash
quarto preview
```

Once you've made changes to the materials, run the following on the commandline:
Once you've made changes to the materials, run the following on the commandline:

```bash
quarto render
# First time you create the file, add them to be tracked by github, e.g.
git add docs/*
git commit -am "your comments"
git push origin main
git push origin main
```

This repository has been set up to auto publish updates using a github action in `.github/workflows/quarto_render_publish.yml`. Once you push changes to main, you should see the updates on the [website](https://sydney-informatics-hub.github.io/ANZAC_neurogenetics_Gadi_guide/) within a couple of minutes.
This repository has been set up to auto publish updates using a github action in `.github/workflows/quarto_render_publish.yml`. Once you push changes to main, you should see the updates on the [website](https://sydney-informatics-hub.github.io/usyd-gadi-onboarding-guide/) within a couple of minutes.

### Themes, Aesthetic and Branding

Expand Down
22 changes: 11 additions & 11 deletions index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -7,25 +7,25 @@ output:

---

This document contains instructions for how to access and utilise NCI's HPC, Gadi for University of Sydney researchers. Please be mindful this is not an exhausive resource for using the NCI Gadi HPC. It is only intended to orientate you to the system and navigate the Gadi user documentation.
This document contains instructions for how to access and utilise NCI's HPC, Gadi for University of Sydney researchers. Please be mindful this is not an exhaustive resource for using the NCI Gadi HPC. It is only intended to orientate you to the system and navigate the Gadi user documentation.

NCI’s HPC systems are designed to handle complex computational tasks, such as simulations, data analysis, and modeling, for a wide range of scientific disciplines, including climate modeling, genomics, astronomy, and materials science.
NCI’s HPC systems are designed to handle complex computational tasks, such as simulations, data analysis, and modeling, for a wide range of scientific disciplines, including climate modeling, genomics, astronomy, and materials science.

The University of Sydney provides its researchers with subsidised access to NCI’s infrastructure. To arrange user credits (service units) for your account, please contact SIH's Research Computing team through our [website](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub/schemes.html).
The University of Sydney provides its researchers with subsidised access to NCI’s infrastructure. After creating an NCI account and project, visit [https://nci.sydney.edu.au/](https://nci.sydney.edu.au/) to request Service Units for your project (note: you must be on the University of Sydney Network/VPN).

## Support
## Support

SIH is limited in the support it can provide for NCI Gadi users. If you are new to HPC and Gadi, we expect you will attend [SIH's Intro to HPC](https://sydney-informatics-hub.github.io/training.artemis/) and NCI's Intro to Gadi courses. Additionally, familiarise yourself with Gadi using the [NCI Gadi user guide](https://opus.nci.org.au/display/Help/Gadi+User+Guide):
SIH is limited in the support it can provide for NCI Gadi users. If you are new to HPC and Gadi, we expect you will attend [SIH's Intro to HPC](https://sydney-informatics-hub.github.io/training.artemis/) and [NCI's Intro to Gadi](https://opus.nci.org.au/spaces/Help/pages/48497461/NCI+Training+and+Educational+Events) courses. Additionally, familiarise yourself with Gadi using the [NCI Gadi user guide](https://opus.nci.org.au/display/Help/Gadi+User+Guide):

* [SIH training calendar](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub/workshops-and-training/training-calendar.html)
* [NCI training calendar](https://opus.nci.org.au/display/Help/NCI+Training+and+Educational+Events)
* [NCI training calendar](https://opus.nci.org.au/display/Help/NCI+Training+and+Educational+Events)

For additional support, please contact the following people depending on your needs:
For additional support, please contact the following people depending on your needs:

| Type of issue | Who | How | Details |
|:---------|:----|:----|:---------|
| Service unit allocation for running jobs | SIH | [Log a ticket](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html) | [SIH schemes](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub/schemes.html) |
| An issue with NCI Gadi | NCI Helpdesk | [Log a ticket](https://nci.org.au/users/nci-helpdesk) | Provide your error, log file, jobid |
| Any error returned in running a job | NCI Helpdesk | [Log a ticket](https://nci.org.au/users/nci-helpdesk) | Provide your error, log file, jobid |
| Service unit allocation for running jobs | SIH | [Make a request](https://nci.sydney.edu.au/) | [SIH schemes](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub/schemes.html) |
| An issue with NCI Gadi | NCI Helpdesk | [Log an NCI ticket](https://nci.org.au/users/nci-helpdesk) | Provide your error, log file, jobid |
| Any error returned in running a job | NCI Helpdesk | [Log an NCI ticket](https://nci.org.au/users/nci-helpdesk) | Provide your error, log file, jobid |
| A bug in an SIH pipeline | SIH | Submit an issue on Github | Provide details e.g. [this issue](https://github.com/Sydney-Informatics-Hub/Germline-StructuralV-nf/issues/65) |
| Bioinformatics advice | SIH| [Log a ticket](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html) | Provide relevant context, errors, tool names, scripts |
| Bioinformatics advice | SIH| [Log an SIH ticket](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html) | Provide relevant context, errors, tool names, scripts |
4 changes: 3 additions & 1 deletion lesson.css
Original file line number Diff line number Diff line change
Expand Up @@ -618,13 +618,15 @@ div.error {
}
.footertext {
text-align: center;
font-size: 14px;
}
footer .license, footer .help-links {
font-size: inherit;
font-size: 16px;
margin-top: 10px;
margin-bottom: 10px;
font-weight: 500;
line-height: 1.1;
text-align: left;
}
img.navbar-logo {
height: 40px;
Expand Down
40 changes: 20 additions & 20 deletions notebooks/02_system_setup.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ This section will point you to the right sections of the NCI documentation and u

- [What is Gadi?](https://opus.nci.org.au/spaces/Help/pages/236881323/What+is+Gadi+...)

**NCI Gadi** is one of Australia's most powerful supercomputers, designed to support advanced computational research.
**NCI Gadi** is one of Australia's most powerful supercomputers, designed to support advanced computational research.

| **Component** | **Details** |
|---------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
Expand All @@ -27,11 +27,11 @@ This section will point you to the right sections of the NCI documentation and u
| **Cloud Systems** | - **Nirin Cloud**: High-availability and high-capacity zone integrated with Gadi and NCI's multi-Petabyte national research data collections, comprising Intel Broadwell and Sandy Bridge processors and NVIDIA K80 GPUs :contentReference[oaicite:5]{index=5} |


## Conditions of use
## Conditions of use

- [General conditions of use](https://opus.nci.org.au/spaces/Help/pages/230490621/Conditions+of+Use+and+Policies...)

All users of NCI agree that they will keep themselves informed of, and comply with, all relevant legislation and The Australian National University policies and rules.
All users of NCI agree that they will keep themselves informed of, and comply with, all relevant legislation and The Australian National University policies and rules.

All users must acknowledge and understand that a breach of these will result in not only a loss of access to NCI resources but the user may be subject to Federal criminal prosecution resulting in fines and/or gaol legislated under the Acts listed above.

Expand All @@ -46,7 +46,7 @@ See the [Gadi Resources Guide](https://opus.nci.org.au/pages/viewpage.action?pag

## Navigating the system

This section contains some example commands to run on the Gadi HPC system. To generalise across multiple groups we have used `<project>` in place of a specific project code. When running these commands, please replace `<project>` with your project code.
This section contains some example commands to run on the Gadi HPC system. To generalise across multiple groups we have used `<project>` in place of a specific project code. When running these commands, please replace `<project>` with your project code.

E.g. if your project code is `aa00`, you would run:

Expand All @@ -55,38 +55,38 @@ cd /scratch/aa00
```
:::

#### Login nodes
#### Login nodes

These nodes are the gateway for Gadi for users to access the resources of the HPC cluster. It is how you log in to Gadi, move around the filesystem, submit jobs to the scheduler, and do small tasks like view the contents of a file.
These nodes are the gateway for Gadi for users to access the resources of the HPC cluster. It is how you log in to Gadi, move around the filesystem, submit jobs to the scheduler, and do small tasks like view the contents of a file.

#### Compute nodes
#### Compute nodes

These nodes are the workhorses of any HPC. They are dedicated for executing computational tasks as delegated by the job scheduler when you submit a job. There are various types of compute nodes with different hardware, built for different purposes on Gadi. Depending on the resource requirements of your job (e.g. high memory, GPUs) and the queue you specify, your job will be sent to a specific type of compute node. You can find a breakdown of their technical specifications [here](https://nci.org.au/our-systems/hpc-systems).
These nodes are the workhorses of any HPC. They are dedicated for executing computational tasks as delegated by the job scheduler when you submit a job. There are various types of compute nodes with different hardware, built for different purposes on Gadi. Depending on the resource requirements of your job (e.g. high memory, GPUs) and the queue you specify, your job will be sent to a specific type of compute node. You can find a breakdown of their technical specifications [here](https://nci.org.au/our-systems/hpc-systems).

::: {.callout-warning}
### :bangbang: Pay Attention :bangbang:
Compute nodes on Gadi don't currently have access to external internet. If any tasks within a submitted job on the compute node need to access the internet, they will fail. These jobs should be run separtately on the `copyq` using the data mover nodes.
Compute nodes on Gadi don't currently have access to external internet. If any tasks within a submitted job on the compute node need to access the internet, they will fail. These jobs should be run separtately on the `copyq` using the data mover nodes. Or else use an ARE job.
:::

#### Data mover nodes
#### Data mover nodes

These nodes are designed specifically for fast data movement. You can use these nodes to transfer files to and from Gadi at high-speed. Steps outlined [here](https://opus.nci.org.au/pages/viewpage.action?pageId=236880317). A script for moving data between USyd RDS and Gadi is provided in `/g/data/scripts` and explained in the following section, [transferring data](./data_transfer.qmd).

### ***Filesystems***

#### `$HOME`

When you first log in to Gadi, you'll be placed in your personal $HOME directory (i.e. `/home/555/aa1234`). You are the only person who can access this directory. **No work** should be done in here, but you may wish to install things like custom R or Python libraries here. It is backed up but you have a 10Gb storage limit.
When you first log in to Gadi, you'll be placed in your personal $HOME directory (i.e. `/home/555/aa1234`). You are the only person who can access this directory. **No work** should be done in here, but you may wish to install things like custom R or Python libraries here. It is backed up but you have a 10Gb storage limit.

You can navigate back here at any point if required by running:
You can navigate back here at any point if required by running:

```bash
cd ~
```

#### `/scratch`

All Gadi projects have a dedicated `/scratch` allocation that is only accessible to members of your project. This is only intended for active work on big files and not for long-term storage. This is not backed up and any files not accessed for 100 days will be purged from the system, so be sure to back up your work to RDS. Your `/scratch` contains a directory for each user (denoted by their Gadi username), however you can organise things however you wish here.
All Gadi projects have a dedicated `/scratch` allocation that is only accessible to members of your project. This is only intended for active work on big files and not for long-term storage. This is not backed up and any files not accessed for 100 days will be purged from the system, so be sure to back up your work to RDS. Your `/scratch` contains a directory for each user (denoted by their Gadi username), however you can organise things however you wish here.

You can navigate to your `/scratch` space by running:

Expand All @@ -108,22 +108,22 @@ cd /gdata/<project>

You can also access a communally maintained bioinformatics software project at `/g/data/if89`. Users currently have to request access to if89, but it is a good place to find bioinformatics software previously installed by others. To request access to __if89__:

1. Log into [MyNCI](https://my.nci.org.au/mancini/login?next=/mancini/)
1. Log into [MyNCI](https://my.nci.org.au/mancini/login?next=/mancini/)
2. Navigate to __Projects and Groups__
3. Search for __if89__ and request access
4. Click join.
4. Click join.

Access needs to be manually approved. If you are experiencing delays of >24 hours, please contact SIH.
Access needs to be manually approved. If you are experiencing delays of >24 hours, please contact SIH.

#### `/apps`

This directory is accessible to all Gadi users. It is a read-only system containing centrally installed software applications and their module files. You can check what software is installed here:
This directory is accessible to all Gadi users. It is a read-only system containing centrally installed software applications and their module files. You can check what software is installed here:

```bash
ls /apps
```

You can use any software that is installed here by first loading the module file, e.g.:
You can use any software that is installed here by first loading the module file, e.g.:

```bash
module load samtools
Expand All @@ -137,6 +137,6 @@ samtools view -H sample.bam

### ***Queues***

Like on Artemis, the job scheduler is PBSPro, however it is implented in a slightly different way. To run jobs on Gadi, users should submit to a specific queue on a corresponding node. The queue and node you choose to run on will depend on the types of resources your job needs. Pipelines your group use have already been configured to run on specific queues.
Like on Artemis, the job scheduler is PBSPro, however it is implented in a slightly different way. To run jobs on Gadi, users should submit to a specific queue on a corresponding node. The queue and node you choose to run on will depend on the types of resources your job needs. Pipelines your group use have already been configured to run on specific queues.

For custom PBS scripts, you can work out what queue to run your job on by checking the [NCI queue documentation](https://opus.nci.org.au/pages/viewpage.action?pageId=236880996) and [queue limits explainer](https://opus.nci.org.au/pages/viewpage.action?pageId=236881198). Most jobs will be suitable for normal or normalbw queues. The normal queues have more nodes available for your jobs, and will allow users, and jobs that require a specialised queue, to get fair access to those resources. Express queues are designs to support work that needs a faster turnaround, but will be charged accordingly at a higher service unit charge.
For custom PBS scripts, you can work out what queue to run your job on by checking the [NCI queue documentation](https://opus.nci.org.au/pages/viewpage.action?pageId=236880996) and [queue limits explainer](https://opus.nci.org.au/pages/viewpage.action?pageId=236881198). Most jobs will be suitable for normal or normalbw queues. The normal queues have more nodes available for your jobs, and will allow users, and jobs that require a specialised queue, to get fair access to those resources. Express queues are designs to support work that needs a faster turnaround, but will be charged accordingly at a higher service unit charge.

0 comments on commit fdcb636

Please sign in to comment.