寻找.txt词频列表来测试程序

发布于 2024-07-20 06:51:24 字数 143 浏览 11 评论 0原文

我想要一个包含 200-1000 个左右最常用英语单词的文件。 我已经能够找到包含 200,000 个单词或其他内容的荒谬列表,但没有找到包含更少量更常用单词的列表。

最好每行一个单词,但如果不是,我可以对其进行格式化。

谢谢!

I'd like a file of the 200-1000 or so most frequently used words in the English language. I've been able to find ridiculous lists of 200,000 words or whatever, but nothing with a smaller set of the more frequently used words.

Preferably the words would be one per line but if it's not then I can format it.

THANKS!

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(4

红颜悴 2024-07-27 06:51:24

我在谷歌上搜索“按频率排列的英语单词”,找到了一些很好的来源。 这是wiktionary.org 上的一个

I searched Google for "english words by frequency" and found a number of good sources. Here is one on wiktionary.org.

‖放下 2024-07-27 06:51:24

这里是前 500 名。您可以从 HTML 中抓取该列表。

Here's the top 500. You can probably scrape out the list from the HTML.

半﹌身腐败 2024-07-27 06:51:24

这是来自 McWafflestix 链接的前 250 个(您强调少即是多),直接向上,没有多余的空格等,这要归功于 emacs 中的kill-rectangle。 我不得不说,这是一个非常琐碎且与编程无关的问题。

the
of
to
and
a
in
is
it
you
that
he
was
for
on
are
with
as
I
his
they
be
at
one
have
this
from
or
had
by
hot
but
some
what
there
we
can
out
other
were
all
your
when
up
use
word
how
said
an
each
she
which
do
their
time
if
will
way
about
many
then
them
would
write
like
so
these
her
long
make
thing
see
him
two
has
look
more
day
could
go
come
did
my
sound
no
most
number
who
over
know
water
than
call
first
people
may
down
side
been
now
find
any
new
work
part
take
get
place
made
live
where
after
back
little
only
round
man
year
came
show
every
good
me
give
our
under
name
very
through
just
form
much
great
think
say
help
low
line
before
turn
cause
same
mean
differ
move
right
boy
old
too
does
tell
sentence
set
three
want
air
well
also
play
small
end
put
home
read
hand
port
large
spell
add
even
land
here
must
big
high
such
follow
act
why
ask
men
change
went
light
kind
off
need
house
picture
try
us
again
animal
point
mother
world
near
build
self
earth
father
head
stand
own
page
should
country
found
answer
school
grow
study
still
learn
plant
cover
food
sun
four
thought
let
keep
eye
never
last
door
between
city
tree
cross
since
hard
start
might
story
saw
far
sea
draw
left
late
run
don't
while
press
close
night
real
life
few
stop

Here's the top 250 (you emphasized less is more) from McWafflestix's link, straight up, no extraneous spaces, etc, thanks to kill-rectangle in emacs. I have to say, this is a pretty trivial and non-programming-related.

the
of
to
and
a
in
is
it
you
that
he
was
for
on
are
with
as
I
his
they
be
at
one
have
this
from
or
had
by
hot
but
some
what
there
we
can
out
other
were
all
your
when
up
use
word
how
said
an
each
she
which
do
their
time
if
will
way
about
many
then
them
would
write
like
so
these
her
long
make
thing
see
him
two
has
look
more
day
could
go
come
did
my
sound
no
most
number
who
over
know
water
than
call
first
people
may
down
side
been
now
find
any
new
work
part
take
get
place
made
live
where
after
back
little
only
round
man
year
came
show
every
good
me
give
our
under
name
very
through
just
form
much
great
think
say
help
low
line
before
turn
cause
same
mean
differ
move
right
boy
old
too
does
tell
sentence
set
three
want
air
well
also
play
small
end
put
home
read
hand
port
large
spell
add
even
land
here
must
big
high
such
follow
act
why
ask
men
change
went
light
kind
off
need
house
picture
try
us
again
animal
point
mother
world
near
build
self
earth
father
head
stand
own
page
should
country
found
answer
school
grow
study
still
learn
plant
cover
food
sun
four
thought
let
keep
eye
never
last
door
between
city
tree
cross
since
hard
start
might
story
saw
far
sea
draw
left
late
run
don't
while
press
close
night
real
life
few
stop
韬韬不绝 2024-07-27 06:51:24

可以编写一个简单的解决方案,虽然未经测试,但应该是 99% 好的。

<?php
$fh = fopen('http://domain.tld/path/tofile.txt', 'r');
$wordList = array();
for($i=0;$i<100;$i++)
    $wordList[] = fread($fh, 1024);
print_r($wordList);
?>

A simple solution could be writen this is untested but should be 99% good.

<?php
$fh = fopen('http://domain.tld/path/tofile.txt', 'r');
$wordList = array();
for($i=0;$i<100;$i++)
    $wordList[] = fread($fh, 1024);
print_r($wordList);
?>
~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文