如何保留两个文本文件中的唯一行并丢弃重复项？

发布于 2024-10-20 04:49:26 字数 255 浏览 4 评论 0原文

我有2个文件。

例如，文件 #1 的内容是：

hi1
hi2
hi4

... 文件 #2 的内容是：

hi1
hi4
hi3
hi5

我想整理这些文档，以便第三个文件只包含：

hi2
hi3
hi5

有人能把我扔到正确的方向吗？我急需！需要 Perl，但也接受 C/C++。

原文

I have 2 files.

For example, the content of file #1 is:

hi1
hi2
hi4

… of file #2 is:

hi1
hi4
hi3
hi5

I would like to sort out these documents so that a third file would contain just:

hi2
hi3
hi5

Can anyone toss me in the right direction? I'm in dire need! Perl is wanted, but C/C++ is accepted.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

醉生梦死 2024-10-27 04:49:26

我知道您要求使用 perl 或 C，但在 Unix 中（或使用 MKS 或 Windows 上的等效 Unix 工具包）：

sort file1 file2 | uniq -u > file3

没有比这更简单的了。

I know you asked for perl or C, but in Unix (or with MKS or equivalent Unix on Windows toolkit):

sort file1 file2 | uniq -u > file3

It doesn't get much simpler than that.

回复收藏 0 原文

木有鱼丸 2024-10-27 04:49:26

这里有一些快速的代码可以完成您想要的操作。没有错误检查，并且我假设您的文本文件不会太大，以至于通过将所有文本加载到哈希数组中会耗尽内存。

open(FILE1, "< file1.txt");
open(FILE2, "< file2.txt");

@file1 = <FILE1>;
@file2 = <FILE2>;

foreach $line (@file1, @file2)
{
    chomp($line);
    $TEXT{$line}++;
}

foreach $line (sort keys %TEXT)
{
    if ($TEXT{$line} == 1)
    {
         print $line . "\n";
    }
}

Here's a quick bit of code to do what you want. There's no error checking, and I'm assuming that your text files are not so huge that you'll run out of memory by loading all the text into a hash array.

open(FILE1, "< file1.txt");
open(FILE2, "< file2.txt");

@file1 = <FILE1>;
@file2 = <FILE2>;

foreach $line (@file1, @file2)
{
    chomp($line);
    $TEXT{$line}++;
}

foreach $line (sort keys %TEXT)
{
    if ($TEXT{$line} == 1)
    {
         print $line . "\n";
    }
}

回复收藏 0 原文

囍笑 2024-10-27 04:49:26

计算每一行的数量，然后打印出计数为 1 的行：

#!/usr/bin/perl
use warnings;
use strict;

local @ARGV = ('file.1', 'file.2');
my %lines;
while (<>) {
    $lines{$_}++;
}

print sort grep $lines{$_} == 1, keys %lines;

Count each line, then print out the ones where the count is one:

#!/usr/bin/perl
use warnings;
use strict;

local @ARGV = ('file.1', 'file.2');
my %lines;
while (<>) {
    $lines{$_}++;
}

print sort grep $lines{$_} == 1, keys %lines;

回复收藏 0 原文