用算术中的算术进行故障排除

发布于 01-19 07:37 字数 1217 浏览 2 评论 0原文

我正在尝试编写一个循环，该循环从txt文件的第2列（以full_results.txt结尾）摘下测序指标，并将所有内容都写入组合的TSV（理想情况下也使用标头，但我无法做到这一点）。

以下是我想要的所有列的TSV，但停止循环。在添加最后两列之前，循环一直在工作，因此我不确定是否添加带有算术的字段更改了任何内容。让我知道是否有一种更干净的方式来写这篇文章！感谢您的输入。

{
for i in *full_results.txt;
do
[ -d "$i" ] && continue
[ -s "$1" ] && continue
                total=$(awk ' $1=="NORMALIZED_READ_COUNT" { print $2 } ' $i)
                trimmed=$(awk ' $1=="RAW_FRAGMENT_TOTAL" { print $2 } ' $i)
                aligned=$(awk ' $1=="RAW_FRAGMENT_FILTERED" { print $2 } ' $i)
                molbin=$(awk ' $1=="AVERAGE_UNIQUE_DNA_READS_PER_GSP2" { print $2 } ' $i)
                startsite=$(awk ' $1=="AVERAGE_UNIQUE_DNA_START_SITES_PER_GSP2" { print $2 } ' $i)
                dedup=$(awk ' $1=="ON_TARGET_DEDUPLICATION_RATIO" { print $2 } ' $i)
                printf "%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\n" "$i" $total $trimmed $aligned $molbin $startsite $dedup $((total - trimmed)) "$(( ( (total - trimmed) * 100) / total))"
done;
} > sequencing_metrics.tsv

输出：

nr02_31_s31_merged_r1_001_full_results.txt 7095319 6207119 6206544 1224.43 391.65 2.74：1 888200 12

预期输出预期输出：与上述所有文件相同

原文

I'm trying to write a loop that pulls sequencing metrics from column 2 of a txt file (ending in full_results.txt) and writes everything into a combined tsv (ideally with headers as well, but I haven't been able to do that).

The below makes a tsv with all of the columns I want--but stopped looping. The loop was working before I added the last two columns, so I'm not sure if adding the fields with arithmetic changed anything. Let me know if there is a cleaner way to write this! Thanks for your input.

{
for i in *full_results.txt;
do
[ -d "$i" ] && continue
[ -s "$1" ] && continue
                total=$(awk ' $1=="NORMALIZED_READ_COUNT" { print $2 } ' $i)
                trimmed=$(awk ' $1=="RAW_FRAGMENT_TOTAL" { print $2 } ' $i)
                aligned=$(awk ' $1=="RAW_FRAGMENT_FILTERED" { print $2 } ' $i)
                molbin=$(awk ' $1=="AVERAGE_UNIQUE_DNA_READS_PER_GSP2" { print $2 } ' $i)
                startsite=$(awk ' $1=="AVERAGE_UNIQUE_DNA_START_SITES_PER_GSP2" { print $2 } ' $i)
                dedup=$(awk ' $1=="ON_TARGET_DEDUPLICATION_RATIO" { print $2 } ' $i)
                printf "%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\n" "$i" $total $trimmed $aligned $molbin $startsite $dedup $((total - trimmed)) "$(( ( (total - trimmed) * 100) / total))"
done;
} > sequencing_metrics.tsv

Output: