:78 Function create_function() is deprecated [8192]

rewritting a little bit the countReads.sh for ORF, more details about the output; normalizing PATHtoRACSrepo

Marcelo Ponce [2019-07-09 01:04:05]
rewritting a little bit the countReads.sh for ORF, more details about the output; normalizing PATHtoRACSrepo
Filename
README
diff --git a/README b/README
index d1def8c..debf06d 100644
--- a/README
+++ b/README
@@ -179,13 +179,26 @@ to the packages SAMtools and BWA.
 This script can utilize multi-core parallelization when possible via threads
 which can be explicitly input by the user or automatically detected by the script.

-At the end of the execution the script will produce a table indicating XXXXX.
-The resulting files, some intermediate files and the final tables, are located
-in a directory created in the directory from where the script is executed named
-'ORF_RACS_results-YYYYMMDD-HHMMSS' where 'YYYYMMDD-HHMMSS' indicates the year-month-day
-and time when the script was executed.
-
-
+At the end of the execution, the script will create a directory named
+'ORF_RACS_results-YYYYMMDD-HHMMSS' where 'YYYYMMDD-HHMMSS' indicates the
+year-month-day and time when the script was executed.
+The resulting files generated when executing the ORF countReads script,
+including some intermediate files and the final table, will be located in this
+"ORF_RACS_results-..." directory.
+Some of these intermediate files located in the "ORF_RACS_results-..." directory,
+e.g. bam and bai files, could be useful to keep for additional analysis, such as,
+visually inspecting some of them with IGV.
+
+In particular within this results' directory, there will be a file named
+"FINAL.table....", which is a plain text file presenting the results for the
+reads in the ORF in tabulated format with the following columns:
+
+	location  name  description  GeneSize  INPUT_reads  IP_reads
+
+This "FINAL.table...." file, which summarizes the results for the reads in the
+ORF, is one of the mandatory 'inputs' for the IGR scripts of the RACS pipeline.
+
+Summary of "Inputs" and "Outputs" of the ORF 'countReads.sh" script:
 Inputs:
 	- INPUT file (fastq.gz)
 	- IP file (fastq.gz)
@@ -205,15 +218,15 @@ located and find the different tools needed during the execution.
 Also, please do not "source" the scripts, they are set to be run as executable scripts,
 ie. execute the scripts in this way,

-	PATHtoRACS/core/../scriptNAME.sh  args
+	PATHtoRACSrepo/core/../scriptNAME.sh  args

 do *NOT* execute them in this way,

-	. PATHtoRACS/core/../scriptNAME.sh  args
+	. PATHtoRACSrepo/core/../scriptNAME.sh  args

 NOR in this way neither,

-	source PATHtoRACS/core/../scriptNAME.sh  args
+	source PATHtoRACSrepo/core/../scriptNAME.sh  args

 Our scripts will also try to detect this situation and prevent the user from doing this.
 Due to the modular implementation we designed for RACS, and in order to allow RACS
@@ -424,8 +437,8 @@ It requires 3 arguments:
    - 3rd argument: "PF-IP-value"  PF value corresponding to the IP file
    - 4th argument: 'A' or 'D' (OPTIONAL), when this 4th argument is specified, an additional table is created being ordered with respect to the IP/INPUT ratio, in "A"scending or "D"ecreasing order

-       PATHtoRACS/core/normalizeORF.sh  FINAL.table.XXXX  14694464  10148171
-       PATHtoRACS/core/normalizeORF.sh  FINAL.table.XXXX  14694464  10148171  A
+       PATHtoRACSrepo/core/normalizeORF.sh  FINAL.table.XXXX  14694464  10148171
+       PATHtoRACSrepo/core/normalizeORF.sh  FINAL.table.XXXX  14694464  10148171  A

 Alternatively one could use a couple of spreadsheets available in the 'datasets'
 directory, if the user prefers to proceed in an interactive manner.
@@ -448,11 +461,11 @@ filters have to be specified; eg.

 For instance, the following command will select the micro-RNA for the Tetrahymena thermophila,

-	PATHtoRACSrepo/core/table.sh  T_thermophila_June2014.gff3  PATHtoRACS/core/defns/TT_mRNA.id
+	PATHtoRACSrepo/core/table.sh  T_thermophila_June2014.gff3  PATHtoRACSrepo/core/defns/TT_mRNA.id

 or, the "genes" for the Oxytricha trifallax,

-	PATHtoRACSrepo/core/table.sh  Oxytricha_trifallax_022112.gff3  PATHtoRACS/core/defns/OXY_gene.id
+	PATHtoRACSrepo/core/table.sh  Oxytricha_trifallax_022112.gff3  PATHtoRACSrepo/core/defns/OXY_gene.id


 IV.ii) ORF determination
@@ -460,9 +473,9 @@ IV.ii) ORF determination
 passed to the "table.sh" script as explained in the previous examples for the analysis of
 ORFs,

-	PATHtoRACSrepo/core/countReads.sh   data2/_1_MED1_INPUT_S25_L007_R1_001.fastq.gz  data2/_3_MED1_IP_S27_L007_R1_001.fastq.gz  T_thermophila_June2014_assembly.fasta  T_thermophila_June2014.gff3  /dev/shm/  16  PATHtoRACS/core/defns/TT_mRNA.id
+	PATHtoRACSrepo/core/countReads.sh   data2/_1_MED1_INPUT_S25_L007_R1_001.fastq.gz  data2/_3_MED1_IP_S27_L007_R1_001.fastq.gz  T_thermophila_June2014_assembly.fasta  T_thermophila_June2014.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/TT_mRNA.id

-        PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACS/core/defns/OXY_gene.id
+        PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/OXY_gene.id

 where "SRX483017_1.fastq.gz" corresponds to the data for run #1 ChIP-Seq, and
 "SRX483016_1.fast1.gz" corresponds to the data for the Input of the
@@ -493,10 +506,10 @@ IV.iv) Full analysis of Oxytricha trifallax
 	# 2.i) first the reference genome
 	PATHtoRACSrepo/datasets/get_GFF3-files.sh
 	# 2.ii) now the ChIP-seq data, you will the NCBI-SRA toolkit for this step!
-	PATHtoRACS/repo/datasets/get_OXYchIPseq-files.sh
+	PATHtoRACSrepo/datasets/get_OXYchIPseq-files.sh

 	# 3) determination of ORFs with RACS
-	PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACS/core/defns/OXY_gene.idf
+	PATHtoRACSrepo/core/countReads.sh  SRX483016_1.fastq.gz  SRX483017_1.fastq.gz  Oxytricha_trifallax_022112_assembly.fasta  Oxytricha_trifallax_022112.gff3  /dev/shm/  16  PATHtoRACSrepo/core/defns/OXY_gene.idf

 	# 4) determination of the IGRs
 	# 4.i) first, cd into the ORF_... directory generated by the previous step where the results from the ORF part of the RACS pipeline were placed
ViewGit