匹配某一行并保留大写字母?
输入文件格式
C4 Alignment:
------------
Query: UN074481
Target: scaffold9929 [revcomp]
Model: est2genome
Raw score: 2379
Query range: 0 -> 510
Target range: 1114739 -> 1048547
1 : CGCACACCACACAACCACTCACGCCATGGAACACACATCACACAACCACCCACCAACTAACACATCCATGGCCACGGAACGCACACCACACAGCCACCCTCCAACACATCCATGGCCGGCGCGGGCAAGCAGGCCATCCGCGGGGGCGGGGAGCAGGGCGGCCGCACTTGGCGGAT : 176
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(44)
||||||
1114739 : CGCACACCACACAACCACTCACGCCATGGAACACACATCACACAACCACCCACCAACTAACACATCCATGGCCACGGAACGCACACCACACAGCCACCCTCCAACACATCCATGGCCGGCGCGGGCAAGCAGGCCATCCGCGGGGGCGGGGAGCAGGGCGGCCGCACTTGGCGGAT : 1114564
177 : GCACGAGCGGTGAGCAGGGCGGTGCCGCGGGCGGCGCCGCGGGCACGGAGCAGGGCCACCGCGCTGGCAGCGAGCTTGGCGGATGCTCGGGCGACGAGCTTGCCGGACGCGCGGGCGACGAGCATGGCGCGCAGCGGCGGCTCACTCCACCGTCGACTGCTCAGCGCAA >>>> : 346
|||||||||+-
1114563 : GCACGAGCGGTGAGCAGGGCGGTGCCGCGGGCGGCGCCGCGGGCACGGAGCAGGGCCACCGCGCTGGCAGCGAGCTTGGCGGATGCTCGGGCGACGAGCTTGCCGGACGCGCGGGCGACGAGCATGGCGCGCAGCGGCGGCTCACTCCACCGTCGACTGCTCAGCGCAAgg..... : 1114392
347 : Target Intron 1 >>>> GGGCGCGACGGATTCTTCCCTCGGGCGCGCGGCAGCCTCTTCGCTCGGGCGCGCGGTGGCATCTTTCCTAGAGCATGGCGCGTGACGGCCACTACAGAGGAGCTCCTCCCTCCGGCGTCGGCCACCCGACACTGCACTGGCGCCCGGCTGTCCC : 499
65682 bp +-||||| | |||
|
|||||| |||| ||| ||||||||
||
| || |||||||
1114391 : ....................aaGGGCGTGGCGGCTTCTTCCCTCGGGCGCGCGGCGGCCTCTTCGCTCGGGCGCGCGGTGGCCTCTTCCCTCGAGCATGGTGCGTGACGGCCACTACAGAGGAGCTCCTCCCTGCGGCGTCGGCCACCCGACACTGCACTGGCGCGCGACTGTCCC : 1048559
500 : CCCCCCCCCCC : 510
|| || | | |
1048558 : CCTCCTCTCTC : 1048548
# --- START OF GFF DUMP ---
#
#
##gff-version 2
##source-version exonerate:est2genome 2.2.0
##date 2016-06-22
##type DNA
#
#
# seqname source feature start end score strand frame attributes
#
scaffold9929 exonerate:est2genome gene 1048548 1114739 2379 - . gene_id 0 ; sequence UN074481 ; gene_orientation +
scaffold9929 exonerate:est2genome utr5 1114395 1114739 . - .
scaffold9929 exonerate:est2genome exon 1114395 1114739 . - . insertions 0 ; deletions 0
scaffold9929 exonerate:est2genome splice5 1114393 1114394 . - . intron_id 1 ; splice_site "GG"
scaffold9929 exonerate:est2genome intron 1048713 1114394 . - . intron_id 1
scaffold9929 exonerate:est2genome splice3 1048713 1048714 . - . intron_id 0 ; splice_site "AA"
scaffold9929 exonerate:est2genome exon 1048548 1048712 . - . insertions 0 ; deletions 0
scaffold9929 exonerate:est2genome similarity 1048548 1114739 2379 - . alignment_id 0 ; Query UN074481 ; Align 1114740 1 345 ; Align 1048713 346 165
# --- END OF GFF DUMP ---
#
-- completed exonerate analysis
Command line: [./exonerate INPUT/UN183704.fa INPUT/scaffold9929.fa --model est2genome --showtargetgff TRUE --showvulgar no --showalignment yes --alignmentwidth 200 --bestn 1 --verbose 2]
Hostname: [node009]
想要匹配竖线(|)下边的行,并保留这一行所有的大写字母
最后的结果
CGCACACCACACAACCACTCACGCCATGGAACACACATCACACAACCACCCACCAACTAACACATCCATGGCCACGGAACGCACACCACACAGCCACCCTCCAACACATCCATGGCCGGCGCGGGCAAGCAGGCCATCCGCGGGGGCGGGGAGCAGGGCGGCCGCACTTGGCGGAT
GCACGAGCGGTGAGCAGGGCGGTGCCGCGGGCGGCGCCGCGGGCACGGAGCAGGGCCACCGCGCTGGCAGCGAGCTTGGCGGATGCTCGGGCGACGAGCTTGCCGGACGCGCGGGCGACGAGCATGGCGCGCAGCGGCGGCTCACTCCACCGTCGACTGCTCAGCGCA
GGGCGTGGCGGCTTCTTCCCTCGGGCGCGCGGCGGCCTCTTCGCTCGGGCGCGCGGTGGCCTCTTCCCTCGAGCATGGTGCGTGACGGCCACTACAGAGGAGCTCCTCCCTGCGGCGTCGGCCACCCGACACTGCACTGGCGCGCGACTGTCCC
CCTCCTCTCTC