WIKI


EurOPDX Pipelines

EurOPDX Galaxy bioinformatics pipelines are designed to process raw sequencing data with standardized, well known workflows allowing to export the data into formats that can be easily processed within EurOPDX research infrastructure.

Instead of filling a .xlsx template with e.g. mutation and CNA values, a PDX data provider can upload raw experimental data files acquired according to standardized procedures (currently RNAseq and Mutation are supported), together with the required data on samples, patient, diagnosis, treatment etc. 

The computational pipeline processes the raw data files, deriving the quantities of interest (gene mutation, CNA values, gene expression etc.). The final step of the workflow is the upload of the results to the sandbox DataPortal, Data Hub and cBioPortal where the PDX data provider can analyze, visualize and test the data as a whole before making the resulting data set available to the other users.

EurOPDX Galaxy Pipelines are available at pipeline-prod.edirex.ics.muni.cz

Overview of bioinformatic pipeline integration




Start exploring the EurOPDX Pipelines potential now

Are you interrested in using EurOPDX Pipelines for PDX data preparation or want additional information?

Please contact us on it@europdx.eu



RNA-seq Pipeline

RNA-seq Pipeline is available in Galaxy and is integrated with sandbox EuroPDX Data Portal and cBioPortal.

Technical prerequisities

Following prerequisities are necessary to run RNA-seq Pipeline in Galaxy platform:



GALAXY CLUSTER (Openstack VM’s)

HOW TO ACCESS GALAXY

  • Go to this URL: http://pipeline-prod.edirex.ics.muni.cz
  • Log in as “galaxy_user” with password “galaxy_tutorial”
  • SMake sure you are using “RNASeq_Pipeline_Data” history (check the right panel)
  • PART I: Uploading Sample Data + Metadata to Galaxy

  • GFor the purpose of checking pipeline functionality only, the uploads should be skipped; go directly to slide #12 Workflow Selection
  • In the RNASeq_Pipeline_Data history, the input files are available as Expression_Sample_List collection
  • STEP 1: UPLOAD DATA FILES TO THE GALAXY FTP SERVER

    Use FTP Client (e.g., FileZilla) to upload data files to Galaxy server with following credentials:

  • Username: test@nonexistent.cz Password: galaxy_tutorial Server Address: pipeline-prod.edirex.ics.muni.cz
  • STEP 2: UPLOAD DATA FILES FROM FTP SERVER TO THE GALAXY (2/2)

  • Access Galaxy URL: http://pipeline-prod.edirex.ics.muni.cz:/
  • Log in as “ galaxy_user” with password “galaxy_tutorial”
  • STEP 3: CREATE A LIST OF DATASET PAIRS (1/2)

    STEP 3: CREATE A LIST OF DATASET PAIRS (2/2)

    STEP 4: Upload Metadata Files

    If not sure about these files than contact us for the help 😀

    PART II Workflow Selection & Execution

    WORKFLOW SELECTION

    -->Go to this URL: http://pipeline-prod.edirex.ics.muni.cz:/workflows/list

    RNA SEQ WORKFLOW

    INPUT PARAMETERS (1/2)

    INPUT PARAMETERS (2/2)

    WORKFLOW IN EXECUTION...

    RESULTS OF WORKFLOW EXECUTION

    EXPLORING IMPORTED DATA IN DATA PORTAL