从网页获取中心内容
获取网页中心内容的可能方法有哪些?
我所说的中心内容是指页面中最重要的内容。
例如:在网页 http: //techcrunch.com/2011/05/27/iphone-app-notizes-you-when-your-laundrys-done/
中心内容是:
<p><img src="http://tctechcrunch.files.wordpress.com/2011/05/screen-shot-2011-05-27-at-10-11-36-pm.png" alt=""><br>
The folks that brought you <a href="http://itsthisforthat.com/">It’sthisforthat</a> have created another way to make your life just a little bit easier and funnier. Meet <a href="http://www.dryerbro.com">DryerBro</a>, an app that uses an accelerometer to let you know when your laundry’s done.</p>
<p>With DryerBro you put your iPhone or iTouch on your laundry machine and it texts you and the remaining members of your laundry party when your laundry’s done. I’m thinking this is going to be HUGE. I mean Facebook took off at colleges right?</p>
<p>Once set up, DryerBro uses an accelerometer and Twilio to send a SMS, email or call to multiple phones when your unmentionables are ready to be picked up.</p>
<p>Says creator Eric Kerr, “We live in a house with 11 dudes, and we’re seriously unorganized about laundry. We all want to use the machine on the weekends, but no one ever knows when the last load was done. It bothered me as hackers that we had the tools (accelerometer, Twilio) to solve the problem, but didn’t do anything about it.”</p>
<p>So they built DryerBro. “We originally looked to see if an app already used the accelerometer to detect when your laundry is done but we couldn’t find anything – it’s a blue ocean strategy,” he says.</p>
<p>Kerr and company are completely ridiculous, but their thing apparently works. When asked about future plans for DryerBro he told TechCrunch:</p>
<p>“Ultimately we want to build out a hyper-local group buying ad platform for laundry detergents. Rough back of the napkin calculations indicate that we’d need roughly $41 million in financing, so we’re asking friends and family to help pony up the dough. We also want to build out the map of every active dryer in the world to hang on the wall of our office.”</p>
<p>Both the DryerBro<a href="http://dryerbro.com/"> FAQ</a> and Promo video are awesome. You can download the iPhone <a href="http://itunes.apple.com/us/app/dryer-bro/id425920156?mt=8">app here.</a> Promo video below.</p>
<div style="text-align:center;">
<object type="application/x-shockwave-flash" width="620" height="300" data="http://www.vimeo.com/moogaloop.swf?clip_id=20732587&server=www.vimeo.com&fullscreen=1&show_title=1&show_byline=0&show_portrait=0&color=01AAEA">
<param name="quality" value="best">
<param name="allowfullscreen" value="true">
<param name="scale" value="showAll">
<param name="movie" value="http://www.vimeo.com/moogaloop.swf?clip_id=20732587&server=www.vimeo.com&fullscreen=1&show_title=1&show_byline=0&show_portrait=0&color=01AAEA">
<param name="wmode" value="opaque">
</object>
</div>
这方面的任何指针都是有帮助。
谢谢
What are the possible ways to get the central content of a web page?
By central content I mean the content which is most important in the page.
Eg: in the web page http://techcrunch.com/2011/05/27/iphone-app-notifies-you-when-your-laundrys-done/
the central content would be:
<p><img src="http://tctechcrunch.files.wordpress.com/2011/05/screen-shot-2011-05-27-at-10-11-36-pm.png" alt=""><br>
The folks that brought you <a href="http://itsthisforthat.com/">It’sthisforthat</a> have created another way to make your life just a little bit easier and funnier. Meet <a href="http://www.dryerbro.com">DryerBro</a>, an app that uses an accelerometer to let you know when your laundry’s done.</p>
<p>With DryerBro you put your iPhone or iTouch on your laundry machine and it texts you and the remaining members of your laundry party when your laundry’s done. I’m thinking this is going to be HUGE. I mean Facebook took off at colleges right?</p>
<p>Once set up, DryerBro uses an accelerometer and Twilio to send a SMS, email or call to multiple phones when your unmentionables are ready to be picked up.</p>
<p>Says creator Eric Kerr, “We live in a house with 11 dudes, and we’re seriously unorganized about laundry. We all want to use the machine on the weekends, but no one ever knows when the last load was done. It bothered me as hackers that we had the tools (accelerometer, Twilio) to solve the problem, but didn’t do anything about it.”</p>
<p>So they built DryerBro. “We originally looked to see if an app already used the accelerometer to detect when your laundry is done but we couldn’t find anything – it’s a blue ocean strategy,” he says.</p>
<p>Kerr and company are completely ridiculous, but their thing apparently works. When asked about future plans for DryerBro he told TechCrunch:</p>
<p>“Ultimately we want to build out a hyper-local group buying ad platform for laundry detergents. Rough back of the napkin calculations indicate that we’d need roughly $41 million in financing, so we’re asking friends and family to help pony up the dough. We also want to build out the map of every active dryer in the world to hang on the wall of our office.”</p>
<p>Both the DryerBro<a href="http://dryerbro.com/"> FAQ</a> and Promo video are awesome. You can download the iPhone <a href="http://itunes.apple.com/us/app/dryer-bro/id425920156?mt=8">app here.</a> Promo video below.</p>
<div style="text-align:center;">
<object type="application/x-shockwave-flash" width="620" height="300" data="http://www.vimeo.com/moogaloop.swf?clip_id=20732587&server=www.vimeo.com&fullscreen=1&show_title=1&show_byline=0&show_portrait=0&color=01AAEA">
<param name="quality" value="best">
<param name="allowfullscreen" value="true">
<param name="scale" value="showAll">
<param name="movie" value="http://www.vimeo.com/moogaloop.swf?clip_id=20732587&server=www.vimeo.com&fullscreen=1&show_title=1&show_byline=0&show_portrait=0&color=01AAEA">
<param name="wmode" value="opaque">
</object>
</div>
Any pointers in this regard would be helpful.
Thanks
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
metaoptimize 上有关于此主题的广泛讨论。
其中一篇文章指向 资源列表和方法概述,都在 Tomaz Kovacic 的博客上,而且都非常好。
There is an extensive discussion about this topic at metaoptimize.
One of the posts there directs to a list of resources and an overview of approaches, both on Tomaz Kovacic's blog, and both very good.
我认为你需要自动摘要它是通过描述的算法从文本中提取最中心的句子。有关其工作原理的示例,您可以查看我的几种算法的实现
I think you need automatic summarisation it is extract most central sentenences from text by described algorithms. For examples how it works you can check my implementation of several algorithms