Quick Start

From PileLine

(Difference between revisions)
Jump to: navigation, search
Line 31: Line 31:
Executing this code you will obtain 3 output files:
Executing this code you will obtain 3 output files:
-
* FileNAME1: Variants found in Case1 but not in Control1 (i.e. somatic point mutations or SNPs)
+
* [[FileNAME1]]: Variants found in Case1 but not in Control1 (i.e. somatic point mutations or SNPs)
-
* FileNAME2: Variants found in Control1 but not in Case1 (i.e. somatic point mutations or SNPs)
+
* [[FileNAME2]]: Variants found in Control1 but not in Case1 (i.e. somatic point mutations or SNPs)
-
* FileNAME3: Variants found in Case1 and in Control1 (i.e. germinal mutations or SNPs)
+
* [[FileNAME3]]: Variants found in Case1 and in Control1 (i.e. germinal mutations or SNPs)
'''3.'''  You can also compare several sample at variant level:
'''3.'''  You can also compare several sample at variant level:

Revision as of 15:35, 18 June 2010

PipeLine input files

PileLine is capable to handle, filter and compare genomic position files (GP) including standard pileup, BED,GFF, or VCF files.

Basically, GP are tabular files where the two first columns contain chromosome name and position coordinate respectively. Additional optional fields are accepted in PileLine, see an example of GP input file below:

10     118829     optional1     optional2     optional3     ...    
10     121207     optional1     optional2     optional3     ...
10     121337     optional1     optional2     optional3     ...
10     121636     optional1     optional2     optional3     ...

PipeLine Guided example

1. Download GP example files (pileup format) to your working directory:

File:Control1Files.zip
File:Control2Files.zip
File:Case1Files.zip
File:Case2Files.zip

Each .zip file contains 2 pileup files:

  • Whole pileup file
  • Variants (vs reference genome) pileup file.

2. You may compare Case1 vs Control1 at variant level using this command line:

$ cd DOWNLOADED_FILES_DIRECTORY 
$ sh YOUR_PATH_TO_PILELINE/cmd/pileline-2smc.sh 
–a ./Control1.pileup –b ./Case1.pileup
–v ./Control1varfilter.pileup –w ./Case1varfilter.pileup 
–o ./myoutput.txt

Executing this code you will obtain 3 output files:

  • FileNAME1: Variants found in Case1 but not in Control1 (i.e. somatic point mutations or SNPs)
  • FileNAME2: Variants found in Control1 but not in Case1 (i.e. somatic point mutations or SNPs)
  • FileNAME3: Variants found in Case1 and in Control1 (i.e. germinal mutations or SNPs)

3. You can also compare several sample at variant level:

$ sh YOUR_PATH_TO_PILELINE/cmd/pileline-nsmc.sh 
-a ./Control1varfilter.pileup -a ./Control2varfilter.pileup
-b ./Case1varfilter.pileup -b ./Case2varfilter.pileup 

4. At this point it could be useful to annotate SNPs. Execute this command line to this end :

Personal tools