有人用过pubchemdb吗?有类似的API吗?
更新:答案中的链接既有趣又有用,但不幸的是没有解决对 java API 的需求,所以我仍然期待任何输入。
我正在构建一个化学数据库化合物。我需要所有同义词(IUPAC 和通用名称)以及每个同义词的安全数据。
我将使用 PubChem (http://pubchem.ncbi.nlm.nih.gov/) 上免费提供的数据,
有一种使用简单的 HTTP 获取来查询每个化合物的简单方法。例如,要获取甘油数据,URL 为:
http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=753
以下 URL 将返回易于解析的格式:
http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=753&disopt=DisplaySDF
但它只会响应非常基本的信息,缺乏安全数据,只有一些常见名称。
有一个 JAVA 公共域 API,看起来非常完整,由 Scripps 的一组开发(引用)。代码位于此处。
不幸的是,这个 API 没有很好的文档记录,并且由于所涉及的数据的复杂性,很难遵循。 据我收集的信息,pubchemdb 使用 PubChem 高级用户网关 (PUG) XML API< /a>
有人使用过这个 API(或任何其他可用的 API)吗?我希望获得有关如何开始使用它的简短描述或教程。
Update: The link in the answer is both interesting and useful, but unfortunately does not address the need for a java API, so I am still looking forward to any input.
I'm building a database of chemical compounds. I need all the synonyms (IUPAC and common names) as well as safety data for each.
I'll be using the freely available data at PubChem (http://pubchem.ncbi.nlm.nih.gov/)
There's an easy way of querying each compound with simple HTTP gets. For example, to obtain glycerol data, the URL is:
http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=753
And the following URL would return an easy to parse format:
http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=753&disopt=DisplaySDF
but it will respond only very basic info, lacking safety data and only a few common names.
There is one public domain API for JAVA that seems a very complete, developed by a group at Scripps (citation). The code is here.
Unfortunately, this API is not very well documented and it's quite difficult to follow due to the complexity of the data involved.
For what I gathered, pubchemdb is using the PubChem Power User Gateway (PUG) XML API
Has anyone used this API (or any other one available)? I would appreciate a short description or tutorial on how to start with it.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
Cactvs Chemoinformatics 工具包(免费供学术/教育使用)具有完整的 PubChem 集成。使用脚本环境,您可以轻松地执行类似的操作
,这隐藏了所有 PUG 的丑陋之处 - 但无论如何,我敢说 PUG 有详细记录。该工具包远远超出了简单的数据下载 - 如果您愿意,您甚至可以像本地 SD 文件一样打开和查询 PubChem。
不过,PubChem 不包含安全数据。安全数据取决于国家/地区,受到严格监管,您应该非常小心,不要承担责任。请法律人员检查您的做法!
The Cactvs Chemoinformatics toolkit (free for academic/educational use) has full PubChem integration. Using the scripting environment, you can easily do something like
This hides all PUG ugliness - but in any case, I dare say that PUG is well documented. The toolkit goes much beyond simple data downloads - you can even open and query PubChem like a local SD file if you want to.
PubChem does not contain safety data, though. And safety data is country/region-dependent, strictly regulated, and you should be really careful not to be hit with liabilities. Have your approach checked by legal personnel!