Hpricot 解析 URI 中的特殊字符时出错

发布于 2024-08-20 04:03:34 字数 796 浏览 9 评论 0原文

我正在编写一个 ruby 脚本来从雅虎获取历史股票价格，使用 Hpricot 来解析页面。这基本上是直截了当的：网址是“http://finance.yahoo.com/q/ hp?s=TickerSymbol" 例如，要查找 Google，我会使用“http://finance.yahoo.com/q/hp?s=GOOG"

不幸的是，当我查找指数价格时，它崩溃了。索引以插入符号为前缀，例如“http://finance.yahoo.com/ q/hp?s=^DJI”为道指。

该行：

ticker_symbol = '^DJI'
doc = Hpricot(open("http://finance.yahoo.com/q/hp?s=#{ticker_symbol}"))

抛出此异常：

bad URI(is not URI?): http://finance.yahoo.com/q/hp?s=^DJI

Hpricot 在插入符号上被阻塞（我认为是因为底层 Ruby URI 库确实如此）。有没有办法逃避该角色或迫使图书馆尝试它？

原文

I'm working on a ruby script to grab historical stock prices from Yahoo, using Hpricot to parse the pages. This is mostly straighforward: the url is "http://finance.yahoo.com/q/hp?s=TickerSymbol" For example, to look up Google, I would use "http://finance.yahoo.com/q/hp?s=GOOG"

Unfortunately, it breaks down when I'm looking up the price of an index. The indexes are prefixed with a caret, such as "http://finance.yahoo.com/q/hp?s=^DJI" for the Dow.

The line:

ticker_symbol = '^DJI'
doc = Hpricot(open("http://finance.yahoo.com/q/hp?s=#{ticker_symbol}"))

throws this exception:

bad URI(is not URI?): http://finance.yahoo.com/q/hp?s=^DJI

Hpricot chokes on the caret (I think because the underlying Ruby URI library does). Is there a way to escape that character or force the library to try it?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

云淡风轻 2024-08-27 04:03:34

嗯，我不觉得自己很蠢吗？再过五分钟，我就开始工作了：

doc = Hpricot(open(URI.encode("http://finance.yahoo.com/q/hp?s=#{ticker_symbol}")))

所以如果其他人想知道，那就这么做吧。捂脸

Well, don't I feel dumb. Five more minutes and I got this working:

doc = Hpricot(open(URI.encode("http://finance.yahoo.com/q/hp?s=#{ticker_symbol}")))

So if anyone else is wondering, that's how you do it. facepalm

回复收藏 0 原文

盛夏已如深秋| 2024-08-27 04:03:34

^ 的转义为 %5E；您可以直接替换 URL。

http://finance.yahoo.com/q/hp?s=%5EDJI< /a>

回复收藏 0 原文

~没有更多了~

关于作者

恰似旧人归

暂无简介

文章

27 人气

关注发私信

Promise

文章 0 评论 0

关注

qq_lbRlsh

文章 0 评论 0

关注

待＂谢繁草

文章 0 评论 0

关注

yy2010hell

文章 0 评论 0

关注

漫无边际

文章 0 评论 0

关注

傲娇萝莉攻

文章 0 评论 0

友情链接

文江博客

Hpricot 解析 URI 中的特殊字符时出错

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

Promise

qq_lbRlsh

待＂谢繁草

yy2010hell

漫无边际

傲娇萝莉攻

友情链接

Hpricot 解析 URI 中的特殊字符时出错

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

Promise

qq_lbRlsh

待＂谢繁草

yy2010hell

漫无边际

傲娇萝莉攻

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。