在哪里可以找到 Twitter 消息存档或搜索旧推文?
此前,谷歌推出了一款应用程序,可以像新闻时间线一样按时间搜索 Twitter 消息。但现在似乎只能提供实时(当前)消息索引,而不能提供旧的和历史上的所有推文。我想对推文进行研究,但不知道在哪里下载或访问基于时间线、地理位置、人口统计或主题列表的此类数据。
Previously Google launched an application that can search Twitter message OVER Time like in news timeline. But it seems it now can only provide real time (current) message index, not the old and all tweets in history. I want to do research on tweets, but do not know where to download or access to such data based on timeline or geography or demographic or topic list.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
旧的推文不能公开访问——即使是对那些写它们的人来说也是如此。
也许你应该联系 Twitter。美国国会图书馆显然也在存档这些数据。
如果这是合法的(基于大学的)研究活动,您可能可以从其中任何一个获得访问权限。
补充:Twitter 曾制作过一些语料库,但应 Twitter 的要求将其从分发中删除。流式 API 使您可以在几个小时/几天内轻松构建自己的语料库,其大小相当不错,但我不知道有任何可用于分发的语料库。根据您的应用程序,社交媒体和博客国际会议拥有(TB 级)数据可用于研究,但我不知道是否包含来自 Twitter 的任何内容。
The old tweets are not publicly accessible - even to the people who wrote them.
Perhaps you should contact Twitter. The US Library of Congress apparently is archiving this data too.
You might be able to get access from either of these if it's a legitimate (university based) research activity.
Addition: There have been a few corpora made from Twitter, but they were removed from distribution at Twitter's request. The streaming API makes it pretty easy to build your own corpus in a few hours/days of a pretty decent size, but I don't know of any that are available for distribution. Depending on your application, the International Conference on Social Media and Weblogs has (terabytes of) data available for research, but I don't know if anything from twitter is included.