Sandbox Workshop
+Agenda
++Day 1 +
+Time | +Activity | +
---|---|
9:00 | +Morning coffee (optional) | +
9:30 | +Introduction to the Sandbox project | +
10:00 | +Introduction to HPC: the basics | +
10:30 | +Coffee break | +
10:45 | +DK HPC resources, access, and intro to UCloud | +
11:15 | +UCloud demo: using apps and running jobs | +
12:00 | +Lunch | +
13:00 | +Proteomics App | +
14:15 | +Coffee break | +
14:30 | +Proteomics App | +
16:00 | +End of day! | +
+Day 2 +
+Time | +Activity | +
---|---|
9:00 | +Morning coffee (optional) | +
9:30 | +RDM intro for health data science | +
10:30 | +Coffee break | +
10:45 | +Step-by-step guide: simple solutions | +
12:00 | +Lunch | +
13:00 | +Transcriptomics App | +
14:30 | +Coffee break | +
14:45 | +Transcriptomics App | +
15:45 | +Wrap-up and feedback | +
16:00 | +End of day and goodbye! | +
Workshop
+The Health Data Science Sandbox aims to be a training resource for bioinformaticians, data scientists, and those generally curious about how to investigate large biomedical datasets. We are an active and developing project seeking interested users (both trainees and educators). All of our open-source materials are available on our Github page and can be used on a computing cluster! We work with both UCloud, GenomeDK and Computerome, the major Danish academic supercomputers. See our HPC Access page for more info on each set up.
+Access Sandbox resources
+Our first choice is to provide all the training materials, tutorials and tools as interactive apps on UCloud, the supercomputer located at the University of Southern Denmark. To use these resources, you’ll need the following:
+-
+
- a Danish university ID so you can sign on to UCloud via WAYF1. +
+
+ for UCloud Access click here +
++
-
+
basic ability to navigate in Linux/RStudio/Jupyter. You don’t need to be an expert, but it is beyond our ambitions (and course material) to teach you how to code from zero and how to run analyses simultaneously. We recommend a basic R or Python course before diving in.
+For workshop participants: use our invite link to the correct UCloud workspace that will be shared on the day of the workshop. This way, we can provide you compute resources for the active sessions of the workshop2 Click the link below after your first uCloud access.
+
+
+ Invitation link to uCloud workspace +
++
Our apps can run on other clusters, simply by pulling a docker container. You only need to have either docker
or singularity
installed on the cluster. GenomeDK
supports singularity
and thus can run our learning material as well. Ask us if you want to use the apps out of uCloud
.
Using our modules
+The agenda starts with an introduction to high Performance Computing (HPC) and uCloud
. You will try two apps during the workshop:
+
+Transcriptomics +
+Our sandbox bulk or single cell RNA sequencing analysis and visualization - amongst others two regular workshops and provides stand-alone visualisation tools. In the next update we will introduce advanced tutorials for more complex single cell RNA sequencing analysis from some of our supported courses. + +### Genomics If you’re interested in NGS technologies and applications ranging from genome assembly to variant calling to metagenomics, join Sandbox Data Scientist Samuele Soraggi in testing out our Genomics Sandbox app. This app supports a semester-length course on NGS as well as a Population Genomics course run regularly at Aarhus University. Sign into UCloud and then click this invite link.
+### Transcriptomics If you’re interested in bulk or single cell RNA sequencing analysis and visualization, join Sandbox Data Scientist Jose Alejandro Romero Herrera (Alex) in testing out our Transcriptomics Sandbox app. This app supports regular 3-4 day workshops at University of Copenhagen and provides stand-alone visualisation tools. Sign into UCloud and then click this invite link.
+### Proteomics Interested in modern methods for protein structure prediction? Join Sandbox Data Scientist Jacob Fredegaard Hansen as he walks you through how to use ColabFold on UCloud. Jacob can also demo our Proteomics Sandbox, which contains a suite of proteomics analysis tools that will support a future course in clinical proteomics but is already available on UCloud for interested users. Sign into UCloud and then click this invite link.
+Discussion and feedback
+We hope you enjoyed the live demo. If you have broader questions, suggestions, or concerns, now is the time to raise them! If you are totally toast for the day, remember that you can check out longer versions of our tutorials as well as other topics and tools in each of the Sandbox modules or join us for a multi-day in person course (follow our news here).
+As data scientists, we also would be really happy for some quantifiable info and feedback - we want to build things that the Danish health data science community is excited to use. Please answer these 5 questions for us before you head out for the day (link activated on day of the workshop).
++Nice meeting you and we hope to see you again! +
+ + +Footnotes
+ +-
+
Other institutions (e.g. hospitals, libraries, …) can log-on through WAYF. See all institutions here↩︎
+To use Sandbox materials outside of the workshop: remember that each new user has 1000 krowns of free computing credit and around 50GB of free storage, which can be used to run many hours of code from our workshop. If you run out of credit (which takes long time) you’ll need to check with the local DeiC office at your university about how to request compute hours on UCloud. Contact us at the Sandbox if you need help or want more information.↩︎
+