寻找人工智能 (AI) 食谱读者研究
我正在寻找有关阅读食谱的人工智能技术的研究(已发表)。菜谱是一个非常有限的领域,可以在自然语言识别引擎中以一定程度的准确性实现。
我想编写一个程序,允许将食谱从网络浏览器复制/粘贴到人工智能中,并让它通过“阅读”食谱来确定标题、作者、成分、说明、营养信息等。我还希望能够处理 PDF 文件(我有一个很大的收藏),也许也只是使用复制/粘贴。
输出将是某种(标准)基于 XML 的格式,可供菜谱组织者读取。
我想到的是博士或硕士水平的工作。
I am looking for research (published) on AI techniques for reading cookbook recipes. Recipes are a very limited domain that might be doable in a natural language recognition engine with some degree of accuracy.
I have in mind writing a program that would allow copy/pasting a recipe from a web browser into the AI and having it determine the title, author, ingredients, instructions, nutritional information, etc. by "reading" the recipe. I would also like to be able to process PDF files (I have a large collection), maybe also just using copy/paste.
The output will be some kind of (standard) XML-based format that can be read by a recipe organizer.
I have in mind PhD or Masters-level work.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可能会发现相关的人工智能子领域是信息提取。
信息提取算法通常通过使用规则(例如正则表达式)来识别文本中的实体和关系。这些规则可以手动定义(即 Suiseki 算法),也可以通过监督机器学习算法(即 RAPIER、Wrapper Induction、条件随机场)学习。
例如,信息提取算法可能会从职位发布中获取数据:
职位
:高级 DBMS 顾问位置
:德克萨斯州达拉斯职责
:DBMS 应用顾问与项目团队合作,定义基于 DBMS 的解决方案,支持电子商务、销售人员自动化和客户服务应用程序的企业部署。所需要求
:3-5 年经验。使用 Visual Basic、C/C++、Powerbuilder、Progress 或类似工具开发 Oracle 或 SQL Server 应用程序。最近在两个开发环境中安装和配置 Oracle 或 SQL Server 的经验。和部署环境。所需技能
:了解 UNIX 或 NT、脚本语言。了解结构化软件工程和项目管理的原理...并将其提炼到此模板中:
标题
:高级 DBMS 顾问状态
:TX城市
:达拉斯国家/地区
:美国语言
:Powerbuilder、Progress、C、C++、Visual Basic平台
:UNIX、NT应用程序
:SQL Server、Oracle领域
:电子商务、客户服务所需经验年数
:3所需经验年限
:5Ray Mooney和他在德克萨斯大学奥斯汀分校的团队在信息提取方面做了一些伟大的工作。以下是一些可能成为良好起点的参考文献:
One subfield of AI that you might find relevant is information extraction.
Information extraction algorithms often work by using rules (e.g. regular expressions) to identify entities and relations in text. These rules can either be defined by hand (i.e. the Suiseki algorithm) or learned with supervised machine learning algorithms (i.e. RAPIER, Wrapper Induction, Conditional Random Fields).
For example, an information extraction algorithm might grab data from a job posting:
Job Title
: Senior DBMS ConsultantLocation
: Dallas,TXResponsibilities
: DBMS Applications consultant works with project teams to define DBMS based solutions that support the enterprise deployment of Electronic Commerce, Sales Force Automation, and Customer Service applications.Desired Requirements
: 3-5 years exp. developing Oracle or SQL Server apps using Visual Basic, C/C++, Powerbuilder, Progress, or similar. Recent experience related to installing and configuring Oracle or SQL Server in both dev. and deployment environments.Desired Skills
: Understanding of UNIX or NT, scripting language. Know principles of structured software engineering and project management...and distill it into this template:
title
: Senior DBMS Consultantstate
: TXcity
: Dallascountry
: USlanguage
: Powerbuilder, Progress, C, C++, Visual Basicplatform
: UNIX, NTapplication
: SQL Server, Oraclearea
: Electronic Commerce, Customer Servicerequired years of experience
: 3desired years of experience
: 5Ray Mooney and his group at the University of Texas at Austin have done some great work in information extraction. Here are some references that might make good jumping-off points: