在这种情况下使用生成器有什么好处？

发布于 2024-11-10 09:27:08 字数 864 浏览 2 评论 0原文

我正在从这张幻灯片中学习Python的生成器： http://www.dabeaz.com/generators/Generators .pdf
里面有一个例子，可以这样描述：
你有一个名为log.txt的日志文件，编写一个程序来观察它的内容，如果有新行添加到其中，则打印它们。两种解决方案：

1. with generator:  

    import time

    def follow(thefile):
        while True:
            line = thefile.readline()
            if not line:
                time.sleep(0.1)
                continue
            yield line

    logfile = open("log.txt")
    loglines = follow(logfile)
    for line in loglines:
        print line


2. Without generator:  

    import time

    logfile = open("log.txt")

    while True:
        line = logfile.readline()
        if not line:
            time.sleep(0.1)
            continue
        print line

这里使用生成器有什么好处？

原文

I'm learning Python's generator from this slide: http://www.dabeaz.com/generators/Generators.pdf
There is an example in it, which can be describe like this:
You have a log file called log.txt, write a program to watch the content of it, if there are new line added to it, print them. Two solutions:

1. with generator:  

    import time

    def follow(thefile):
        while True:
            line = thefile.readline()
            if not line:
                time.sleep(0.1)
                continue
            yield line

    logfile = open("log.txt")
    loglines = follow(logfile)
    for line in loglines:
        print line


2. Without generator:  

    import time

    logfile = open("log.txt")

    while True:
        line = logfile.readline()
        if not line:
            time.sleep(0.1)
            continue
        print line

What's the benefit of using generator here?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

玉环 2024-11-17 09:27:08

如果你只有一把锤子，那么一切看起来都像钉子

我几乎只想用上面的引文来回答这个问题。仅仅因为你可以这样做并不意味着你需要一直这样做。

但从概念上讲，生成器版本分离了功能，follow 函数的目的是封装在等待新输入时从文件中连续读取的内容。这使您可以在循环中使用您想要的新行执行任何操作。在第二个版本中，从文件读取并打印的代码与控制循环混合在一起。在这个小例子中，这可能并不是真正的问题，但这是您可能需要考虑的问题。

回复收藏 0 原文

℉絮湮 2024-11-17 09:27:08

一个好处是能够传递生成器（例如不同的函数）并通过调用 .next() 手动迭代。这是初始生成器示例的稍作修改的版本：

import time

def follow(file_name):
    with open(file_name, 'rb') as f:
        for line in f:
            if not line:
                time.sleep(0.1)
                continue
            yield line

loglines = follow(logfile)
first_line = loglines.next()
second_line = loglines.next()
for line in loglines:
    print line

首先，我使用上下文管理器打开了文件（with 语句，当您完成使用它时，它会自动关闭文件，或者在例外）。接下来，在底部我演示了如何使用 .next() 方法，允许您手动单步执行。如果您需要从简单的 for item in gen 循环中打破逻辑，有时这会很有用。

One benefit is the ability to pass your generator around (say to different functions) and iterate manually by calling .next(). Here is a slightly modified version of your initial generator example:

import time

def follow(file_name):
    with open(file_name, 'rb') as f:
        for line in f:
            if not line:
                time.sleep(0.1)
                continue
            yield line

loglines = follow(logfile)
first_line = loglines.next()
second_line = loglines.next()
for line in loglines:
    print line

First of all I opened the file with a context manager (with statement, which auto-closes the file when you're done with it, or on exception). Next, at the bottom I've demonstrated using the .next() method, allowing you to manually step through. This can be useful sometimes if you need to break logic out from a simple for item in gen loop.

回复收藏 0 原文

蓝戈者 2024-11-17 09:27:08

生成器函数的定义与普通函数类似，但每当它需要生成一个值时，它都会使用yield关键字而不是return来生成。它的主要优点是它允许代码随着时间的推移生成一系列值，而不是立即计算它们并像列表一样将它们发送回。例如，

# A Python program to generate squares from 1
# to 100 using yield and therefore generator

# An infinite generator function that prints
# next square number. It starts with 1
def nextSquare():
    i = 1;

    # An Infinite loop to generate squares 
    while True:
        yield i*i                
        i += 1  # Next execution resumes 
                # from this point     

# Driver code to test above generator 
# function
for num in nextSquare():
    if num > 100:
         break   
    print(num)

Return 将指定的值发送回其调用者，而 Yield 可以生成一系列值。当我们想要迭代一个序列，但又不想将整个序列存储在内存中时，我们应该使用yield。

A generator function is defined like a normal function, but whenever it needs to generate a value, it does so with the yield keyword rather than return. Its main advantage is it allows its code to produce a series of values over time, rather than computing them at once and sending them back like a list.For example

# A Python program to generate squares from 1
# to 100 using yield and therefore generator

# An infinite generator function that prints
# next square number. It starts with 1
def nextSquare():
    i = 1;

    # An Infinite loop to generate squares 
    while True:
        yield i*i                
        i += 1  # Next execution resumes 
                # from this point     

# Driver code to test above generator 
# function
for num in nextSquare():
    if num > 100:
         break   
    print(num)

Return sends a specified value back to its caller whereas Yield can produce a sequence of values. We should use yield when we want to iterate over a sequence, but don’t want to store the entire sequence in memory.

回复收藏 0 原文

梦幻之岛 2024-11-17 09:27:08

理想情况下，大多数循环大致具有以下形式：

for element in get_the_next_value():
     process(element)

但是有时（如示例#2 中），循环实际上更复杂，因为您有时会获得一个元素，有时则不会。这意味着在没有该元素的示例中，您将用于生成元素的代码与用于处理该元素的代码混合在一起。它在示例中没有显示得太清楚，因为生成下一个值的代码实际上并不太复杂，并且处理只是一行，但示例 1 更清晰地分离了这两个概念。

一个更好的例子可能是处理文件中的可变长度段落，并用空行分隔每个段落：尝试使用和不使用生成器编写代码，您应该会看到好处。

Ideally most loops are roughly of the form:

for element in get_the_next_value():
     process(element)

However sometimes (as in your example #2), the loop is actually more complex as you sometimes get an element and sometimes don't. That means in your example without the element you have mixed up code for generating an element with the code for processing it. It doesn't show too clearly in the example because the code to generate the next value isn't actually too complex and the processing is just one line, but example number 1 is separating these two concepts more cleanly.

A better example might be one that processes variable length paragraphs from a file with blank lines separating each paragraph: try writing code for that with and without generators and you should see the benefit.

回复收藏 0 原文