更好更快的方案功能？

发布于 2025-01-05 06:19:05 字数 710 浏览 5 评论 0原文

因此，找到列表中的最大元素需要 O(n) 时间复杂度（如果列表有 n 个元素）。我尝试实现一种看起来更快的算法。

(define (clever-max lst)
  (define (odd-half a-list)
    (cond ((null? a-list) (list))
          ((null? (cdr a-list))
           (cons (car a-list) (list)))
          (else
           (cons (car a-list)
                 (odd-half (cdr (cdr a-list)))))))
  (define (even-half a-list)
    (if (null? a-list)
        (list)
        (odd-half (cdr a-list))))
  (cond ((null? lst) (error "no elements in list!"))
        ((null? (cdr lst)) (car lst))
        (else
         (let ((l1 (even-half lst))
               (l2 (odd-half lst)))
           (max (clever-max l1) (clever-max l2))))))

这实际上更快吗？你认为渐近时间复杂度是多少（紧界）？

原文

So finding the maximum element in a list takes O(n) time complexity (if the list has n elements). I tried to implement an algorithm that looks faster.

(define (clever-max lst)
  (define (odd-half a-list)
    (cond ((null? a-list) (list))
          ((null? (cdr a-list))
           (cons (car a-list) (list)))
          (else
           (cons (car a-list)
                 (odd-half (cdr (cdr a-list)))))))
  (define (even-half a-list)
    (if (null? a-list)
        (list)
        (odd-half (cdr a-list))))
  (cond ((null? lst) (error "no elements in list!"))
        ((null? (cdr lst)) (car lst))
        (else
         (let ((l1 (even-half lst))
               (l2 (odd-half lst)))
           (max (clever-max l1) (clever-max l2))))))

Is this actually faster?! What would you say the asymptotic time complexity is (tight bound)?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

聆听风音 2025-01-12 06:19:05

给定一个您一无所知的数据列表，如果不检查每个元素就无法找到最大元素，因此需要 O(n) 时间，因为如果您不检查它，您可能会错过了。所以不，你的算法并不比O(n)快，它实际上是O(n log n)，因为你基本上只是运行合并排序。

这是关于选择问题的更多数据

我思考了一下并意识到我可能应该做一些事情不仅仅是将其陈述为事实。所以我编写了一个快速速度测试。现在完全公开，我不是一个Scheme程序员，所以这是在Common Lisp中，但我想我忠实地转换了你的算法。

;; Direct "iteration" method -- theoretical O(n)
(defun find-max-001 ( list )
  (labels ((fm ( list cur )
             (if (null list) cur
               (let ((head (car list))
                     (rest (cdr list)))
                 (fm rest (if (> head cur) head cur))))))
    (fm (cdr list) (car list))))

;; Your proposed method  
(defun find-max-002 ( list )
  (labels ((odd-half ( list )
             (cond ((null list) list)
                   ((null (cdr list)) (list (car list)))
                   (T (cons (car list) (odd-half (cddr list))))))
           (even-half ( list )
             (if (null list) list (odd-half (cdr list)))))
    (cond ((null list) list)
          ((null (cdr list)) (car list))
          (T (let ((l1 (even-half list))
                   (l2 (odd-half list)))
               (max (find-max-002 l1) (find-max-002 l2)))))))

;; Simplistic speed test              
(let ((list (loop for x from 0 to 10000 collect (random 10000))))
  (progn
    (print "Running find-max-001")
    (time (find-max-001 list))
    (print "Running find-max-002")
    (time (find-max-002 list))))

现在您可能会问自己为什么我只使用 10000 作为列表大小，因为对于渐近计算来说这实际上相当小。事实上， sbcl 认识到第一个函数是尾递归的，因此将其抽象为一个循环，而第二个函数则不然，所以这大约是我在不杀死堆栈的情况下可以获得的最大大小。不过，正如您从下面的结果中看到的那样，这已经足够说明这一点了。

"Running find-max-001"
Evaluation took:
  0.000 seconds of real time
  0.000000 seconds of total run time (0.000000 user, 0.000000 system)
  100.00% CPU
  128,862 processor cycles
  0 bytes consed

"Running find-max-002"
Evaluation took:
  0.012 seconds of real time
  0.012001 seconds of total run time (0.012001 user, 0.000000 system)
  [ Run times consist of 0.008 seconds GC time, and 0.005 seconds non-GC time. ]
  100.00% CPU
  27,260,311 processor cycles
  2,138,112 bytes consed

即使在这个水平上，我们也正在谈论经济大幅放缓。一旦方法减慢到算法的 10k 评估，就需要增加到大约 100 万个项目，然后直接检查每个项目。

 (let ((x (loop for x from 0 to 1000000 collect (random 1000000))))
   (time (find-max-001 x)))

Evaluation took:
  0.007 seconds of real time
  0.008000 seconds of total run time (0.008000 user, 0.000000 system)
  114.29% CPU
  16,817,949 processor cycles
  0 bytes consed

最后的想法和结论

因此，下一个必须问的问题是为什么第二个算法确实需要更长的时间。无需深入了解尾递归消除，有一些事情确实很突出。

第一个是缺点。现在是的，cons 是O(1)，但它仍然是系统要执行的另一个操作。并且它需要系统分配和释放内存（必须启动垃圾收集器）。真正跳出来的第二件事是，您基本上正在运行合并排序，除了不仅仅抓取列表的下半部分和上半部分，您还抓取偶数和奇数节点（这也将花费更长的时间，因为您必须迭代每个是时候建立列表了）。这里最多是一个 O(n log n) 算法（请注意，它是合并排序，非常适合排序），但它会带来很多额外的开销。

Given a list of data which you know nothing about, there is no way to find the maximum element without examining each element and thus taking O(n) time because if you don't check it, you might miss it. So no, your algorithm isn't faster than O(n) it is in fact O(n log n) as you are basically just running merge sort.

Here is more data on the Selection problem

I thought about it and realized I should probably do something a bit more than just state this as fact. So I've coded up a quick speed test. Now full disclosure, I'm not a Scheme programmer, so this is in Common Lisp but I think I converted your algorithm faithfully.

;; Direct "iteration" method -- theoretical O(n)
(defun find-max-001 ( list )
  (labels ((fm ( list cur )
             (if (null list) cur
               (let ((head (car list))
                     (rest (cdr list)))
                 (fm rest (if (> head cur) head cur))))))
    (fm (cdr list) (car list))))

;; Your proposed method  
(defun find-max-002 ( list )
  (labels ((odd-half ( list )
             (cond ((null list) list)
                   ((null (cdr list)) (list (car list)))
                   (T (cons (car list) (odd-half (cddr list))))))
           (even-half ( list )
             (if (null list) list (odd-half (cdr list)))))
    (cond ((null list) list)
          ((null (cdr list)) (car list))
          (T (let ((l1 (even-half list))
                   (l2 (odd-half list)))
               (max (find-max-002 l1) (find-max-002 l2)))))))

;; Simplistic speed test              
(let ((list (loop for x from 0 to 10000 collect (random 10000))))
  (progn
    (print "Running find-max-001")
    (time (find-max-001 list))
    (print "Running find-max-002")
    (time (find-max-002 list))))

Now you may be asking your self why you I am only using 10000 for the list size, because really that is fairly small for asymptotic calculations. The truth is there that sbcl recognizes that the first function is tail recursive and therefore abstracts it into a loop whereas it doesn't with the second so that's about as big as I could get without killing my stack. Though as you can see from the results below this is large enough to illustrate the point.

"Running find-max-001"
Evaluation took:
  0.000 seconds of real time
  0.000000 seconds of total run time (0.000000 user, 0.000000 system)
  100.00% CPU
  128,862 processor cycles
  0 bytes consed

"Running find-max-002"
Evaluation took:
  0.012 seconds of real time
  0.012001 seconds of total run time (0.012001 user, 0.000000 system)
  [ Run times consist of 0.008 seconds GC time, and 0.005 seconds non-GC time. ]
  100.00% CPU
  27,260,311 processor cycles
  2,138,112 bytes consed

Even at this level we are talking about a massive slowdown. It takes an increase to about one million items before the direct check each items once method slows down to the 10k evaluation of your algorithm.

 (let ((x (loop for x from 0 to 1000000 collect (random 1000000))))
   (time (find-max-001 x)))

Evaluation took:
  0.007 seconds of real time
  0.008000 seconds of total run time (0.008000 user, 0.000000 system)
  114.29% CPU
  16,817,949 processor cycles
  0 bytes consed

Final Thoughts and conclusions

So the next question that has to be asked is why the second algorithm really is taking that much longer. Without going into to much detail about tail recursion elimination there are a few things that really jump out.

The first one being cons. Now yes, cons is O(1) but it's still another operation for the system to go through. And it requires the system to allocate and free memory ( have to fire up the garbage collector ). The second thing that really jumps out is that you are basically running a merge sort, except rather than just grabbing the lower and upper half of the list you are grabbing the even and odd nodes ( that also will take longer because you have to iterate every time to build the lists ). What you have here is an O(n log n) algorithm at best ( mind you, it's merge sort which is really good for sorting ) but it carries a lot of extra overhead.

回复收藏 0 原文

~没有更多了~