Typing of S. epidermidis samples (HDMx samples)

gene_x 0 like s 534 view s

Tags: processing

SCCmec_HDMs

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2786320/

Classification of Staphylococcal Cassette Chromosome mec (SCCmec): Guidelines for Reporting Novel SCCmec Elements

mec gene complex.

IS1272 is part of the IS1182 family of insertion sequences

  • The mec gene complex is composed of mecA, its regulatory genes, and associated insertion sequences.
  • The class A mec gene complex (class A mec) is the prototype complex, which contains mecA, the complete mecR1 and mecI regulatory genes upstream of mecA, and the hypervariable region (HVR) and insertion sequence IS431 downstream of mecA.
    • The class B mec gene complex is composed of mecA, a truncated mecR1 resulting from the insertion of IS1272 upstream of mecA, and [HVR and IS431 downstream of mecA].
  • The class C mec gene complex contains mecA and truncated mecR1 by the insertion of IS431 upstream of mecA and HVR and IS431 downstream of mecA.
  • There are two distinct class C mec gene complexes; in the class C1 mec gene complex, the IS431 upstream of mecA has the same orientation as the IS431 downstream of mecA (next to HVR), while in the class C2 mec gene complex, the orientation of IS431 upstream of mecA is reversed.
  • C1 and C2 are regarded as different mec gene complexes since they have likely evolved independently.
  • The class D mec gene complex is composed of mecA and ΔmecR1 but does not carry an insertion sequence downstream of ΔmecR1 (as determined by PCR analysis).
  • Several variants within the major classes of the mec gene complex have been described, including insertions of IS431 or IS1182 upstream of mecA in the class A mec gene complex or insertion of Tn4001 upstream of mecA in the class B mec complex.

  • run with bengal3

    1. cd ~/DATA/Data_Denise_CalCov1
    2. cp bacto-0.1.json ../Data_Denise_CalCov2
    3. cp cluster.json ../Data_Denise_CalCov2
    4. cp Snakefile ../Data_Denise_CalCov2
    5. ln -s /home/jhuang/Tools/bacto/local .
    6. ln -s /home/jhuang/Tools/bacto/db .
    7. ln -s /home/jhuang/Tools/bacto/envs .
    8. mkdir raw_data; cd raw_data
    9. ln -s ../Alignment_Imported_1/20240913_174420/Fastq/HDM7_S1_L001_R1_001.fastq.gz HDM7_R1.fastq.gz
    10. ln -s ../Alignment_Imported_1/20240913_174420/Fastq/HDM7_S1_L001_R2_001.fastq.gz HDM7_R2.fastq.gz
    11. ln -s ../Alignment_Imported_1/20240913_174420/Fastq/HDM10_S2_L001_R1_001.fastq.gz HDM10_R1.fastq.gz
    12. ln -s ../Alignment_Imported_1/20240913_174420/Fastq/HDM10_S2_L001_R2_001.fastq.gz HDM10_R2.fastq.gz
    13. ln -s ../20240812_FS10003086_50_BSB09416-2831/Alignment_Imported_1/20240813_202730/Fastq/HDM1_S1_L001_R1_001.fastq.gz HDM1_R1.fastq.gz
    14. ln -s ../20240812_FS10003086_50_BSB09416-2831/Alignment_Imported_1/20240813_202730/Fastq/HDM1_S1_L001_R2_001.fastq.gz HDM1_R2.fastq.gz
    15. ln -s ../20240913/Alignment_Imported_1/20240913_174420/Fastq/HDM7_S1_L001_R1_001.fastq.gz HDM7_R1.fastq.gz
    16. ln -s ../20240913/Alignment_Imported_1/20240913_174420/Fastq/HDM7_S1_L001_R2_001.fastq.gz HDM7_R2.fastq.gz
    17. ln -s ../20240913/Alignment_Imported_1/20240913_174420/Fastq/HDM10_S2_L001_R1_001.fastq.gz HDM10_R1.fastq.gz
    18. ln -s ../20240913/Alignment_Imported_1/20240913_174420/Fastq/HDM10_S2_L001_R2_001.fastq.gz HDM10_R2.fastq.gz
    19. ln -s ../20240919_FS10003086_61_BSB09416-2735/Alignment_Imported_1/20240920_173408/Fastq/HDM11-SF1_S1_L001_R1_001.fastq.gz HDM11-SF1_R1.fastq.gz
    20. ln -s ../20240919_FS10003086_61_BSB09416-2735/Alignment_Imported_1/20240920_173408/Fastq/HDM11-SF1_S1_L001_R2_001.fastq.gz HDM11-SF1_R2.fastq.gz
    21. ln -s ../20240919_FS10003086_61_BSB09416-2735/Alignment_Imported_1/20240920_173408/Fastq/HDM15-SF2_S2_L001_R1_001.fastq.gz HDM15-SF2_R1.fastq.gz
    22. ln -s ../20240919_FS10003086_61_BSB09416-2735/Alignment_Imported_1/20240920_173408/Fastq/HDM15-SF2_S2_L001_R2_001.fastq.gz HDM15-SF2_R2.fastq.gz
    23. # only activate the steps assembly and mlst in bacto-0.1.json.
    24. (bengal3_ac3) jhuang@WS-2290C:~/Documents$ /home/jhuang/miniconda3/envs/snakemake_4_3_1/bin/snakemake --printshellcmds
    25. # -- Results --
    26. shovill/HDM1/contigs.fa sepidermidis 5 arcC(1) aroE(1) gtr(1) mutS(2) pyrR(2) tpiA(1) yqiL(1)
    27. HDM10.mlst.txt:shovill/HDM10/contigs.fa sepidermidis 59 arcC(2) aroE(1) gtr(1) mutS(1) pyrR(2) tpiA(1) yqiL(1)
    28. HDM7.mlst.txt:shovill/HDM7/contigs.fa sepidermidis 59 arcC(2) aroE(1) gtr(1) mutS(1) pyrR(2) tpiA(1) yqiL(1)
  • run with bakta

    1. #under env (bakta)
    2. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    3. bakta --db /mnt/nvme0n1p1/bakta_db shovill/${sample}/contigs.fa --prefix ${sample}
    4. done
  • mapping on assembly to calculate the coverage

    1. #samtools depth input.bam > depth.txt
    2. #samtools depth input.bam | awk '{sum+=$3} END { print "Average coverage:",sum/NR}'
    3. #bedtools coverage -a regions.bed -b input.bam > coverage.txt
    4. #bedtools coverage -a regions.bed -b input.bam -d > coverage_per_base.txt
    5. bwa index ./shovill/HDM1/contigs.fa
    6. bwa mem ./shovill/HDM1/contigs.fa fastq/HDM1_1.fastq fastq/HDM1_2.fastq > aligned.sam
    7. samtools view -Sb aligned.sam > aligned.bam
    8. samtools sort aligned.bam -o sorted.bam
    9. samtools index sorted.bam
    10. samtools depth sorted.bam > depth.txt
    11. awk '{sum+=$3} END { print "Average coverage:",sum/NR}' depth.txt
    12. bedtools coverage -a regions.bed -b sorted.bam > coverage.txt
    13. bedtools genomecov -ibam sorted.bam -d > coverage_per_base.txt
    14. # Step 1: Calculate depth using samtools
    15. samtools depth sorted.bam > depth.txt
    16. # Step 2: Calculate average depth using awk
    17. awk '{sum+=$3; count++} END {print "Average Coverage:", sum/count}' depth.txt
    18. # Step 1: Calculate coverage with bedtools for a BED file
    19. #bedtools coverage -a regions.bed -b input.bam > coverage.txt
    20. # Step 2: Process the output with awk
    21. #awk '{ sum+=$7 } END { print "Average coverage depth:", sum/NR }' coverage.txt
    22. bwa index ./shovill/HDM7/contigs.fa
    23. bwa mem ./shovill/HDM7/contigs.fa fastq/HDM7_1.fastq fastq/HDM7_2.fastq > aligned_HDM7.sam
    24. samtools view -Sb aligned_HDM7.sam > aligned_HDM7.bam
    25. samtools sort aligned_HDM7.bam -o sorted_HDM7.bam
    26. samtools index sorted_HDM7.bam
    27. # Step 1: Calculate depth using samtools
    28. samtools depth sorted_HDM7.bam > depth_HDM7.txt
    29. # Step 2: Calculate average depth using awk
    30. awk '{sum+=$3; count++} END {print "Average Coverage:", sum/count}' depth_HDM7.txt
    31. #Average Coverage: 380.079
    32. bwa index ./shovill/HDM10/contigs.fa
    33. bwa mem ./shovill/HDM10/contigs.fa fastq/HDM10_1.fastq fastq/HDM10_2.fastq > aligned_HDM10.sam
    34. samtools view -Sb aligned_HDM10.sam > aligned_HDM10.bam
    35. samtools sort aligned_HDM10.bam -o sorted_HDM10.bam
    36. samtools index sorted_HDM10.bam
    37. # Step 1: Calculate depth using samtools
    38. samtools depth sorted_HDM10.bam > depth_HDM10.txt
    39. # Step 2: Calculate average depth using awk
    40. awk '{sum+=$3; count++} END {print "Average Coverage:", sum/count}' depth_HDM10.txt
    41. #Average Coverage: 254.704
  • SCCmec typing and drawing with clinker

    1. #1. -- HDM1_contigs.fa --
    2. One SCCmec element detected.
    3. Prediction based on genes:
    4. Predicted SCCmec element: SCCmec_type_IV(2B)
    5. Prediction based on homology to whole cassette:
    6. Predicted whole cassette and %template coverage: SCCmec_type_IV(2B) 79.92%
    7. Predicted genes:
    8. Fasta header % Identity Query/HSP Length Contig Position in contig
    9. ccrA2:7:81108:AB096217 100.00 1350/1350 contig00032 3770..5119
    10. ccrB2:9:JCSC4469:AB097677 99.94 1650/1650 contig00032 5120..6769
    11. IS1272:3:AM292304 99.95 1844/1843 contig00032 8611..10454
    12. dmecR1:1:AB033763 100.00 987/987 contig00032 10443..11429
    13. mecA:12:AB505628 100.00 2010/2010 contig00032 11526..13535
    14. samtools faidx shovill/HDM1_contigs.fa
    15. samtools faidx shovill/HDM1_contigs.fa contig00032:1-13635 > HDM1_sub.fna
    16. bakta --db /mnt/nvme0n1p1/bakta_db HDM1_sub.fna
    17. #2. -- HDM7_contigs.fa --
    18. One SCCmec element detected.
    19. Prediction based on genes:
    20. Predicted SCCmec element: SCCmec_type_IVa(2B)
    21. Prediction based on homology to whole cassette:
    22. Predicted whole cassette and %template coverage: SCCmec_type_IVa(2B) 84.24%
    23. Predicted genes:
    24. Fasta header % Identity Query/HSP Length Contig Position in contig
    25. mecA:12:AB505628 100.00 2010/2010 contig00014 2800..4809
    26. dmecR1:1:AB033763 99.90 987/987 contig00014 4906..5892
    27. IS1272:3:AM292304 100.00 1843/1843 contig00014 5881..7723
    28. ccrB2:3:CA05:AB063172 100.00 1629/1629 contig00014 9565..11193
    29. ccrA2:3:CA05:AB063172 100.00 1350/1350 contig00014 11215..12564
    30. subtype-IVa(2B):1:CA05:AB063172 100.00 1491/1491 contig00014 16461..17951
    31. #IS1272:2:AB033763 91.06 1577/1585 contig00001 369260..370836
    32. samtools faidx shovill/HDM7_contigs.fa
    33. samtools faidx shovill/HDM7_contigs.fa contig00014:2700-18051 > HDM7_sub.fna
    34. bakta --db /mnt/nvme0n1p1/bakta_db HDM7_sub.fna
    35. mecA
    36. dmecR1
    37. Type I restriction enzyme HindI endonuclease subunit-like C-terminal domain-containing protein
    38. IS1272
    39. DUF1643 domain-containing protein
    40. Pyridoxal phosphate-dependent enzyme
    41. hypothetical protein
    42. ccrB2
    43. ccrA2
    44. DUF927 domain-containing protein
    45. hypothetical protein
    46. ACP synthase
    47. AAA family ATPase (= subtype-IVa(2B))
    48. #3. -- HDM10_contigs.fa --
    49. Prediction based on genes:
    50. Predicted SCCmec element: SCCmec_type_IV(2B&5)
    51. Prediction based on homology to whole cassette:
    52. Predicted whole cassette and % template coverage: SCCmec_type_IV(2B) 84.37%
    53. Predicted genes:
    54. Fasta header % Identity Query/HSP Length Contig Position in contig
    55. subtype-IVa(2B):1:CA05:AB063172 100.00 1491/1491 contig00020 4152..5642
    56. ccrA2:3:CA05:AB063172 100.00 1350/1350 contig00020 9539..10888
    57. ccrB2:3:CA05:AB063172 100.00 1629/1629 contig00020 10910..12538
    58. IS1272:3:AM292304 100.00 1843/1843 contig00020 14380..16222
    59. dmecR1:1:AB033763 100.00 987/987 contig00020 16211..17197
    60. mecA:12:AB505628 100.00 2010/2010 contig00020 17294..19303
    61. #IS1272:2:AB033763 90.75 1579/1585 contig00033 2..1580
    62. #ccrC1-allele-2:1:AB512767 90.95 1680/1680 contig00022 9836..11515
    63. samtools faidx shovill/HDM10_contigs.fa
    64. samtools faidx shovill/HDM10_contigs.fa contig00020:4052-19403 > HDM10_sub.fna
    65. bakta --db /mnt/nvme0n1p1/bakta_db HDM10_sub.fna
    66. #4. -- HDM11-SF1_contigs.fa --
    67. No SCCmec element was detected
    68. Prediction based on genes:
    69. Predicted SCCmec element: none
    70. Prediction based on homology to whole cassette:
    71. Predicted whole cassette and %template coverage: none
    72. #5. -- HDM15-SF2_contigs.fa --
    73. SCCmec_type_IV(2B)
    74. SCCmec_type_VI(4B)
    75. Following gene complexes based on prediction of genes was detected :
    76. ccr class 2
    77. ccr class 4
    78. mec class B
    79. Predicted genes:
    80. Fasta header % Identity Query/HSP Length Contig Position in contig
    81. ccrA2:7:81108:AB096217 100.00 1350/1350 contig00004 3823..5172
    82. ccrB2:9:JCSC4469:AB097677 99.94 1650/1650 contig00004 5173..6822
    83. IS1272:3:AM292304 99.95 1844/1843 contig00004 8664..10507
    84. dmecR1:1:AB033763 100.00 987/987 contig00004 10496..11482
    85. mecA:12:AB505628 100.00 2010/2010 contig00004 11579..13588
    86. subtyppe-Vc(5C2&5):10:AB505629 99.84 1935/1935 contig00004 20148..22082
    87. ccrA4:2:BK20781:FJ670542 90.53 1362/1362 contig00004 24570..25931
    88. ccrB4:2:BK20781:FJ670542 91.68 1635/1629 contig00004 25928..27562
    89. subtype-IVa(2B):1:CA05:AB063172 100.00 1491/1491 contig00015 52228..53718
    90. samtools faidx shovill/HDM7_contigs.fa
    91. samtools faidx shovill/HDM7_contigs.fa contig00014:2700-18051 > HDM7_sub.fna
    92. bakta --db /mnt/nvme0n1p1/bakta_db HDM7_sub.fna
    93. samtools faidx shovill/HDM10_contigs.fa
    94. samtools faidx shovill/HDM10_contigs.fa contig00020:4052-19403 > HDM10_sub.fna
    95. bakta --db /mnt/nvme0n1p1/bakta_db HDM10_sub.fna
    96. samtools faidx shovill/HDM15-SF2_contigs.fa
    97. samtools faidx shovill/HDM15-SF2_contigs.fa contig00004:1-27662 > HDM15-SF2_sub.fna
    98. samtools faidx shovill/HDM15-SF2_contigs.fa contig00015:52128-53818 >> HDM15-SF2_sub.fna
    99. bakta --db /mnt/nvme0n1p1/bakta_db HDM15-SF2_sub.fna
    100. #END
    101. #172.104.140.19
    102. mkdir gbff_sub
    103. mv *_sub.gbff gbff_sub
    104. cd gbff_sub
    105. for f in *_sub.gbff; do mv "$f" "${f/_sub.gbff/.gbff}"; done
    106. #mv HDM1_sub.gbff HDM1.gbff
    107. #mv HDM7_sub.gbff HDM7.gbff
    108. #mv HDM10_sub.gbff HDM10.gbff
    109. #mv HDM15-SF2_sub.gbff HDM15-SF2.gbff
    110. rm *.json
    111. clinker *.gbff -p plot_HDRNA.html --dont_set_origin -s session_HDRNA.json -o alignments_HDRNA.csv -dl "," -dc 4
    112. cp ./gbff_HDRNA_01/clinker.png HDRNA_01_clinker.png
  • Arg typing

    1. grep "agrD" *.gbff | sort
    2. HDM1.gbff: /gene="agrD"
    3. HDM1.gbff: /gene="agrD"
    4. HDM7.gbff: /gene="agrD"
    5. HDM7.gbff: /gene="agrD"
    6. HDM10.gbff: /gene="agrD"
    7. HDM10.gbff: /gene="agrD"
    8. HDM11-SF1.gbff: /gene="agrD"
    9. HDM11-SF1.gbff: /gene="agrD"
    10. HDM15-SF2.gbff: /gene="agrD"
    11. HDM15-SF2.gbff: /gene="agrD"
    12. MNLLGGLLLKIFSNFMAVIGNASKYNPCSNYLDEPQVPEELTKLDE
    13. MENIFNLFIKFFTTILEFIGTVAGDSVCASYFDEPEVPEELTKLYE
    14. MENIFNLFIKFFTTILEFIGTVAGDSVCASYFDEPEVPEELTKLYE
    15. MNLLGGLLLKIFSNFMAVIGNASKYNPCSNYLDEPQVPEELTKLDE
    16. MNLLGGLLLKIFSNFMAVIGNASKYNPCSNYLDEPQVPEELTKLDE
    17. #* The agr typing is not defined, as I have compared the sequence with the amino acid sequences of ArgD described in the paper available at https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4187671/. It does not correspond to Type I, Type II, or Type III. (For more details, see below).
    18. -- AgrD I --
    19. Query 1 MENIFNLFIKFFTTILEFIGTVAGDSVCASYFDEPEVPEELTKLYE 46
    20. M + L +K F+ + IG + + C Y DEP+VPEELTKL E
    21. Sbjct 926825 MNLLGGLLLKIFSNFMAVIGNASKYNPCVMYLDEPQVPEELTKLDE 926688
    22. -- AgrD II --
    23. Query 1 MNLLGGLLLKIFSNFMAVIGNASKYNPCSNYLDEPQVPEELTKLDE 46
    24. MNLLGGLLLKIFSNFMAVIGNASKYNPC YLDEPQVPEELTKLDE
    25. Sbjct 926825 MNLLGGLLLKIFSNFMAVIGNASKYNPCVMYLDEPQVPEELTKLDE 926688
    26. -- AgrD III --
    27. Query 1 MNLLGGLLLKLFSNFMAVIGNAAKYNPCASYLDEPQVPEELTKLDE 46
    28. MNLLGGLLLK+FSNFMAVIGNA+KYNPC YLDEPQVPEELTKLDE
    29. Sbjct 926825 MNLLGGLLLKIFSNFMAVIGNASKYNPCVMYLDEPQVPEELTKLDE 926688
  • calulate the presence-absence-matrix for predefined gene list

    1. #start codon: ATG, GTG und TTG
    2. #stop codon: 5'-UAA-3', 5'-UGA-3' und 5'-UAG-3' --> TAA, TGA, TAG
    3. ./Staphylococcus_aureus_MRSA252.fasta
    4. ./Staphylococcus_epidermidis_RP62A.fasta
    5. ./Enterococcus_faecium_isolate_E300_pathogenicity_island.fasta
    6. # -- Hause keeper: gyrB --
    7. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":2609812-2611743 > gyrB.fasta
    8. #revcomp gyrB.fasta > gyrB_revcomp.fasta
    9. gyrB_revcomp.fasta
    10. # -- Metabolic genes: fumC, gltA, icd --
    11. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1444326-1445711 > fumC.fasta
    12. #revcomp fumC.fasta > fumC_revcomp.fasta
    13. ./fumC_revcomp.fasta
    14. ./gltA.fasta
    15. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1296195-1297463 > icd.fasta
    16. #revcomp icd.fasta > icd_revcomp.fasta
    17. icd_revcomp.fasta
    18. # -- Virulence regulartors: apsS, sigB, sarA, agrC, yycG --
    19. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":316151-317191 > apsS.fasta
    20. apsS.fasta
    21. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1722805-1723575 > sigB.fasta
    22. #revcomp sigB.fasta > sigB_revcomp.fasta
    23. sigB_revcomp.fasta
    24. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":279424-279798 > sarA.fasta
    25. #revcomp sarA.fasta > sarA_revcomp.fasta
    26. sarA_revcomp.fasta
    27. ./agrC.fasta
    28. ./yycG.fasta
    29. # -- Toxins: psmβ1, hlb --
    30. ./psm-beta.fasta
    31. #psm-beta1.fasta
    32. ./hlb_.fasta
    33. #./hlb.fasta
    34. # -- Biofilm formation: atlE, sdrG, sdrH, ebh, ebp, tagB --
    35. ./atlE.fasta
    36. ./sdrG.fasta
    37. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1555024-1556469 > sdrH.fasta
    38. #revcomp sdrH.fasta > sdrH_revcomp.fasta
    39. sdrH_revcomp.fasta
    40. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1023531-1053980 > ebh.fasta
    41. #revcomp ebh.fasta > ebh_revcomp.fasta
    42. ebh_revcomp.fasta
    43. #https://www.ncbi.nlm.nih.gov/gene/?term=(Elastin-binding+protein)+AND+%22Staphylococcus+aureus%22%5Bporgn%3A__txid1282%5D
    44. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":1094204-1095586 > ebpS.fasta
    45. #revcomp ebpS.fasta > ebpS_revcomp.fasta
    46. ebpS_revcomp.fasta
    47. ./tagB.fasta
    48. # -- Immune evasion & colonization: capC, sepA, dltA, fmtC, lipA, sceD, SE0760 --
    49. ./capC.fasta
    50. #./capBCA_ywtC.fasta
    51. ./sepA.fasta
    52. #./ORF123_sepA_ORF5.fasta
    53. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":503173-504630 > dltA.fasta
    54. dltA.fasta
    55. ./fmtC.fasta
    56. #samtools faidx Staphylococcus_epidermidis_RP62A.fasta "gi|57865352|ref|NC_002976.3|":498445-499359 > lipA.fasta
    57. lipA.fasta
    58. ./sceD.fasta
    59. #./sceDAE.fasta
    60. ./SE0760.fasta
    61. # -- Serine protease: esp, ecpA --
    62. ./esp.fasta
    63. ./ecpA_.fasta
    64. #./ecpA.fasta
    65. # -- Phage: PhiSepi-HH1, PI-Sepi-HH2, PhiSepi-HH3 (#HH1-HP1, HH3-HP2, HH3-TreR) --
    66. ./MT880870.fasta
    67. ./MT880871.fasta
    68. ./MT880872.fasta
    69. #Note that write a message to Holger, say "ebp gene does not exist, instead of it only ebpS gene exists!"
    70. makeblastdb -in HDM1_contigs.fa -dbtype nucl
    71. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    72. blastn -db ../shovill/${sample}_contigs.fa -query gyrB_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > gyrB_on_${sample}.blastn
    73. done
    74. ./fumC_revcomp.fasta
    75. ./gltA.fasta
    76. icd_revcomp.fasta
    77. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    78. blastn -db ../shovill/${sample}_contigs.fa -query fumC_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > fumC_on_${sample}.blastn
    79. done
    80. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    81. blastn -db ../shovill/${sample}_contigs.fa -query gltA.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > gltA_on_${sample}.blastn
    82. done
    83. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    84. blastn -db ../shovill/${sample}_contigs.fa -query icd_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > icd_on_${sample}.blastn
    85. done
    86. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    87. blastn -db ../shovill/${sample}_contigs.fa -query apsS.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > apsS_on_${sample}.blastn
    88. done
    89. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    90. blastn -db ../shovill/${sample}_contigs.fa -query sigB_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sigB_on_${sample}.blastn
    91. done
    92. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    93. blastn -db ../shovill/${sample}_contigs.fa -query sarA_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sarA_on_${sample}.blastn
    94. done
    95. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    96. blastn -db ../shovill/${sample}_contigs.fa -query agrC.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > agrC_on_${sample}.blastn
    97. done
    98. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    99. blastn -db ../shovill/${sample}_contigs.fa -query yycG.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > yycG_on_${sample}.blastn
    100. done
    101. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    102. blastn -db ../shovill/${sample}_contigs.fa -query psm-beta.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > psm-beta_on_${sample}.blastn
    103. done
    104. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    105. blastn -db ../shovill/${sample}_contigs.fa -query psm-beta1.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > psm-beta1_on_${sample}.blastn
    106. done
    107. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    108. blastn -db ../shovill/${sample}_contigs.fa -query hlb_.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > hlb_on_${sample}.blastn
    109. done
    110. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    111. blastn -db ../shovill/${sample}_contigs.fa -query atlE.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > atlE_on_${sample}.blastn
    112. done
    113. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    114. blastn -db ../shovill/${sample}_contigs.fa -query sdrG.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sdrG_on_${sample}.blastn
    115. done
    116. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    117. blastn -db ../shovill/${sample}_contigs.fa -query sdrH_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sdrH_on_${sample}.blastn
    118. done
    119. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    120. blastn -db ../shovill/${sample}_contigs.fa -query ebh_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ebh_on_${sample}.blastn
    121. done
    122. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    123. blastn -db ../shovill/${sample}_contigs.fa -query ebpS_revcomp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ebpS_on_${sample}.blastn
    124. done
    125. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    126. blastn -db ../shovill/${sample}_contigs.fa -query tagB.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > tagB_on_${sample}.blastn
    127. done
    128. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    129. blastn -db ../shovill/${sample}_contigs.fa -query capC.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > capC_on_${sample}.blastn
    130. done
    131. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    132. blastn -db ../shovill/${sample}_contigs.fa -query sepA.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sepA_on_${sample}.blastn
    133. done
    134. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    135. blastn -db ../shovill/${sample}_contigs.fa -query dltA.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > dltA_on_${sample}.blastn
    136. done
    137. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    138. blastn -db ../shovill/${sample}_contigs.fa -query fmtC.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > fmtC_on_${sample}.blastn
    139. done
    140. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    141. blastn -db ../shovill/${sample}_contigs.fa -query lipA.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > lipA_on_${sample}.blastn
    142. done
    143. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    144. blastn -db ../shovill/${sample}_contigs.fa -query sceD.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > sceD_on_${sample}.blastn
    145. done
    146. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    147. blastn -db ../shovill/${sample}_contigs.fa -query SE0760.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > SE0760_on_${sample}.blastn
    148. done
    149. # -- Serine protease: esp, ecpA --
    150. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    151. blastn -db ../shovill/${sample}_contigs.fa -query esp.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ./esp_on_${sample}.blastn
    152. done
    153. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    154. blastn -db ../shovill/${sample}_contigs.fa -query ecpA_.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ecpA_on_${sample}.blastn
    155. done
    156. # -- Phage: PhiSepi-HH1, PI-Sepi-HH2, PhiSepi-HH3 (#HH1-HP1, HH3-HP2, HH3-TreR) --
    157. #34053 (3000,2510) 36164 (500) 147057 (6618, 15237+3812, 15237+3812, 15233+3814, 15230+3812)
    158. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    159. blastn -db ../shovill/${sample}_contigs.fa -query MT880870.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ./MT880870_on_${sample}.blastn
    160. done
    161. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    162. blastn -db ../shovill/${sample}_contigs.fa -query MT880871.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > ./MT880871_on_${sample}.blastn
    163. done
    164. for sample in HDM1 HDM7 HDM10 HDM11-SF1 HDM15-SF2; do
    165. blastn -db ../shovill/${sample}_contigs.fa -query MT880872.fasta -evalue 1e-50 -num_threads 15 -outfmt 6 -strand both -max_target_seqs 1 > MT880872_on_${sample}.blastn
    166. done

like unlike

点赞本文的读者

还没有人对此文章表态


本文有评论

没有评论

看文章,发评论,不要沉默


© 2023 XGenes.com Impressum