|Harvard PGP Data Collections for Download|
|CWL Quick Start|
|wes jobs image|
|DREAM infrastructure challenge||
Arvados submissions for the GA4GH-DREAM Workflow Execution Challenge
|GATK Haplotype Caller Project||
This tutorial demonstrates how to run the GATK Haplotype Caller pipeline using GenomeAnalysisTK-3.2-2 from the Broad Institute. These pipelines currently support GATK version 3.2-2 (md5 3163cbeef8fd50d8cb85096758b801a3) (keep content hash 2e98fdc8e90f4c48a0714b711767c9ce+76). You must obtain your own GATK jar file in order to run this pipeline. You can obtain this software by going to the Broad Institute’s GATK licensing site. Further instructions on how to upload your file can be found on the Arvados documentation page or the tutorial below. If you run into any problems, please contact email@example.com
This is a complete GATK workflow written in CWL-v1.0
|FoG Boston 2016 Work Project|
|Old Ancestry Mapper Runs|
|bcbio test runs||
bcbio CWL test runs: https://github.com/chapmanb/bcbio-nextgen/tree/master/cwl
|test parent project|
|GATK bcbio style|
|Mason Lab - Methylkit||
MethylKit is an R package for DNA methylation analysis from high-throughput bisulfite sequencing. It has many features, coverage/methylation statistics, differential methylation analysis, feature annotation, reading methylation calls.
|Public Bioinformatics tools||
Binaries of some Bioinformatics tools
|lobSTR v.3 (Public)||
lobSTR is a tool for profiling Short Tandem Repeats (STRs) from high throughput sequencing data.
|UMC Public Pipeline (BOSC 2015)||
A BWA-GATK Pipeline by the UMC Utrecht Community. Used in the Poster for Developing an Arvados BWA-GATK pipeline at BOSC 2015.
|GATK2 Unified Genotyper (Public)||
Run GATK2 on paired end reads and perform variant calls using Unified Genotyper. To run this pipeline, click on Run a pipeline and select “Demo GATK2 Pipeline”. Feel free to use "PGP HU34D5B9 “FASTQ” exome" as the input data set, which is 2 sets of paired end fastq files.
Complete Genomics whole genome sequencing raw data for Harvard Personal Genome Project participant hu826751 (2014-10-17).
|Outputs of PATHOMAP_P00553.vcf||
Mason Lab – Pathomap Output data for PATHOMAP_P00553.vcf
Mason Lab – Pathomap Docker Images
|Output Demo Data||
Mason Lab – Pathomap Output Data
|GATK3 Haplotype Caller (Public)||
Run GATK3 Best Practices pipeline on paired end reads and perform variant calls using both Haplotype Caller and Unified Genotyper. To run this pipeline, click on Run a pipeline and select “Demo GATK3 Haplotype Caller Pipeline” from the GATK3 Haplotype Caller Project. Feel free to use "PGP HU34D5B9 “FASTQ” exome" as the input data set, which is 2 sets of paired end fastq files.
The bcbio-nextgen project was created by Brad Chapman from the Harvard School of Public Health.
|Mason Lab - Pathomap / Ancestry Mapper (Public)|
|Public Datasets / Collections|
Input fastq files and call variants using Platypus!
|Sample Public Pipelines||
A list of all Public Pipelines currently runs on Arvados.
|Public GA4GH Collection||
|PCA of 174 whole genomes from the Personal Genome Project||
Principal component analysis of 174 whole genome sequences (chromosomes 13 and 17) from the Personal Genome Project. From this project, you can explore the inputs (numpy files, path lengths, and human population data), the environment (docker image), the code (under pipeline templates), the tests ran (under pipelines), and their output. To rerun any analysis or alter inputs, sign in, create an account, and create a copy of this project.