:78 Function create_function() is deprecated [8192]

fixing names of files in exmaples

Marcelo Ponce [2019-07-22 16:33:43]
fixing names of files in exmaples
Filename
README
diff --git a/README b/README
index ccfbfe6..386bd06 100644
--- a/README
+++ b/README
@@ -454,13 +454,14 @@ showing the trend in use of working space.

 I) calling reads for genic regions (ORF)
 I.i) the following command will run the countReads.sh script using:
-	- 'data2/_1_MED1_INPUT_S25_L007_R1_001.fasta.gz'  as the file with the INPUT reads
-	- 'data2/_3_MED1_IP_S27_L007_R1_001.fasta.gz' as the file with IP reads
-	- 'T_thermophila_June2014_assembly.fasta' as the reference genome for the T.thermophila organism
-	- 'T_thermophila_June2014.gff3' as the annotation file for the T.thermophila
+	- 'dataDIR/chipseq_INPUT_file.fasta.gz'  as the file with the INPUT reads
+	- 'dataDIR/chipseq_IP_file.fasta.gz' as the file with IP reads
+	- 'REFfiles/T_thermophila_June2014_assembly.fasta' as the reference genome for the T.thermophila organism
+	- 'REFfiles/T_thermophila_June2014.gff3' as the annotation file for the T.thermophila
 	- '/tmp/' as working space
+	- notice that 'dataDIR' and 'REFfiles' are the directories where the data is located and it can be given as absolute or relative paths

-	PATHtoRACSrepo/core/countReads.sh   data2/_1_MED1_INPUT_S25_L007_R1_001.fastq.gz  data2/_3_MED1_IP_S27_L007_R1_001.fastq.gz  T_thermophila_June2014_assembly.fasta  T_thermophila_June2014.gff3  /tmp/
+	PATHtoRACSrepo/core/countReads.sh   dataDIR/chipseq_INPUT_file.fastq.gz  dataDIR/chipseq_IP_file.fastq.gz  REFfiles/T_thermophila_June2014_assembly.fasta  REFfiles/T_thermophila_June2014.gff3  /tmp/

 In this case, beucase we haven't specfied the number of cores to use, the
 script will try to automatically detect the number of cores in the computer and
@@ -473,21 +474,21 @@ parallel regions of the pipeline.
 Additionally it is using the system's 'time' command to time how long the
 pipeline takes to run.

-	time  PATHtoRACSrepo/core/countReads.sh   data2/_1_MED1_INPUT_S25_L007_R1_001.fastq.gz  data2/_3_MED1_IP_S27_L007_R1_001.fastq.gz  T_thermophila_June2014_assembly.fasta  T_thermophila_June2014.gff3  /dev/shm/  16
+	PATHtoRACSrepo/core/countReads.sh   dataDIR/chipseq_INPUT_file.fastq.gz  dataDIR/chipseq_IP_file.fastq.gz  REFfiles/T_thermophila_June2014_assembly.fasta  REFfiles/T_thermophila_June2014.gff3  /dev/shm/  16


 II) Determination of InterGenic Regions (IGR)
 II.i) the following command will determine the intergenic regions, using:
-	- 'combinedTABLES_MED1-MED2' as the table which was determined in the ORF part of the pipeline
-	- 'dataset/T_thermophila_June2014.sorted.gff3' as the micro-organism reference genome file
-	- 'interGENs_MED1-MED2.csv' is the name of the table that the IGR part of the pipeline will
+	- 'FINAL.table.chipseq_INPUT_file-chipseq_IP_file' as the table which was determined in the ORF part of the pipeline; this file is located in the output subdirectory ORF_RACS_results-YYYYMMDD-HHMM
+	- 'REFfiles/T_thermophila_June2014.gff3' as the micro-organism reference genome file
+	- 'interGENs_chipseqExp_INPUTid-IPid.csv' is the name of the table that the IGR part of the pipeline will
 		generate, ie. this will be the output of this part of the pipeline
 	- 'samples.file' is a text file containing the name of the BAM output file, also generated when
 		running the ORF part from RACS; they are usually named as
                 alnDATASET_INDICATORS_ChIP_SXX_RY_ZZZ.fastq.gz-sorted.bam
                 alnDATASET_INDICATORS_Input_SXX_RY_ZZZ.fastq.gz-sorted.bam

-	PATHtoRACSrepo/core/intergenic/det-interGenes.sh  combinedTABLES_MED1-MED2  dataset/T_thermophila_June2014.sorted.gff3  interGENs_MED1-MED2.csv  samples.file
+	PATHtoRACSrepo/core/intergenic/det-interGenes.sh  FINAL.table.chipseq_INPUT_file-chipseq_IP_file  REFfiles/T_thermophila_June2014.sorted.gff3  interGENs_chipseqExp_INPUTid-IPid.csv  samples.file


 II.ii) we included a submission script in the 'hpc' directory, named "IGR-jobs_parallel.gnu",
@@ -552,11 +553,11 @@ filters have to be specified; eg.

 For instance, the following command will select the micro-RNA for the Tetrahymena thermophila,

-	PATHtoRACSrepo/core/table.sh  T_thermophila_June2014.gff3  PATHtoRACSrepo/core/defns/TT_mRNA.id
+	PATHtoRACSrepo/core/table.sh  REFfiles/T_thermophila_June2014.gff3  PATHtoRACSrepo/core/defns/TT_mRNA.id

 or, the "genes" for the Oxytricha trifallax,

-	PATHtoRACSrepo/core/table.sh  Oxytricha_trifallax_022112.gff3  PATHtoRACSrepo/core/defns/OXY_gene.id
+	PATHtoRACSrepo/core/table.sh  REFfiles/Oxytricha_trifallax_022112.gff3  PATHtoRACSrepo/core/defns/OXY_gene.id


 IV.ii) ORF determination
@@ -564,9 +565,9 @@ IV.ii) ORF determination
 information passed to the "table.sh" script as explained in the previous
 examples for the analysis of ORFs,

-	PATHtoRACSrepo/core/countReads.sh   data2/_1_MED1_INPUT_S25_L007_R1_001.fastq.gz  data2/_3_MED1_IP_S27_L007_R1_001.fastq.gz  T_thermophila_June2014_assembly.fasta  T_thermophila_June2014.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/TT_mRNA.id
+	PATHtoRACSrepo/core/countReads.sh   dataDIR/chipseq_INPUT_file.fastq.gz  dataDIR/chipseq_IP_file.fastq.gz  REFfiles/T_thermophila_June2014_assembly.fasta  REFfiles/T_thermophila_June2014.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/TT_mRNA.id

-        PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/OXY_gene.id
+        PATHtoRACSrepo/core/countReads.sh  dataDIR/SRX483016_1.fastq.gz  dataDIR/SRX483017_1.fastq.gz  REFfiles/Oxytricha_trifallax_022112_assembly.fasta  REFfiles/Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/OXY_gene.id

 where "SRX483017_1.fastq.gz" corresponds to the data for the ChIP-Seq
 run/experiment #1, and "SRX483016_1.fast1.gz" corresponds to the data for the
@@ -578,7 +579,7 @@ solely difference that the script will try to automatically detect and use all
 the cores available in the computer; this is achieved by setting the 6th
 argument to "", which is needed when indicating the seventh argument:

-	PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  ""  PATHtoRACSrepo/core/defns/OXY_gene.id
+	PATHtoRACSrepo/core/countReads.sh  dataDIR/SRX483016_1.fastq.gz  dataDIR/SRX483017_1.fastq.gz  REFfiles/Oxytricha_trifallax_022112_assembly.fasta  REFfiles/Oxytricha_trifallax_022112.gff3  /dev/shm/  ""  PATHtoRACSrepo/core/defns/OXY_gene.id


 IV.iii) IGR determination
@@ -586,7 +587,7 @@ IV.iii) IGR determination
 considerations other than using the "FINAL.table.*" file generated by the RACS'
 ORF tool. Eg.

-	PATHtoRACSrepo/core/intergenic/det-interGenes.sh FINAL.table.SRX483016_1-SRX483017_1 Oxytricha_trifallax_022112.gff3 interGENs_run1_OXY.csv  sample.input
+	PATHtoRACSrepo/core/intergenic/det-interGenes.sh  FINAL.table.SRX483016_1-SRX483017_1  REFfiles/Oxytricha_trifallax_022112.gff3  interGENs_run1_OXY.csv  sample.input

 where the file "sample.input" contains the following files:
ViewGit