当前位置：文江博客话题详情

寻找人工智能 (AI) 食谱读者研究

发布于 2024-09-15 18:36:55 字数 249 浏览 5 评论 0原文

我正在寻找有关阅读食谱的人工智能技术的研究（已发表）。菜谱是一个非常有限的领域，可以在自然语言识别引擎中以一定程度的准确性实现。

我想编写一个程序，允许将食谱从网络浏览器复制/粘贴到人工智能中，并让它通过“阅读”食谱来确定标题、作者、成分、说明、营养信息等。我还希望能够处理 PDF 文件（我有一个很大的收藏），也许也只是使用复制/粘贴。

输出将是某种（标准）基于 XML 的格式，可供菜谱组织者读取。

我想到的是博士或硕士水平的工作。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

尾戒 2024-09-22 18:36:56

您可能会发现相关的人工智能子领域是信息提取。

信息提取算法通常通过使用规则（例如正则表达式）来识别文本中的实体和关系。这些规则可以手动定义（即 Suiseki 算法），也可以通过监督机器学习算法（即 RAPIER、Wrapper Induction、条件随机场）学习。

例如，信息提取算法可能会从职位发布中获取数据：

职位：高级 DBMS 顾问
位置：德克萨斯州达拉斯
职责：DBMS 应用顾问与项目团队合作，定义基于 DBMS 的解决方案，支持电子商务、销售人员自动化和客户服务应用程序的企业部署。
所需要求：3-5 年经验。使用 Visual Basic、C/C++、Powerbuilder、Progress 或类似工具开发 Oracle 或 SQL Server 应用程序。最近在两个开发环境中安装和配置 Oracle 或 SQL Server 的经验。和部署环境。
所需技能：了解 UNIX 或 NT、脚本语言。了解结构化软件工程和项目管理的原理

...并将其提炼到此模板中：

标题：高级 DBMS 顾问
状态：TX
城市：达拉斯
国家/地区：美国
语言：Powerbuilder、Progress、C、C++、Visual Basic
平台：UNIX、NT
应用程序：SQL Server、Oracle
领域：电子商务、客户服务
所需经验年数：3
所需经验年限：5

Ray Mooney和他在德克萨斯大学奥斯汀分校的团队在信息提取方面做了一些伟大的工作。以下是一些可能成为良好起点的参考文献：

Raymond J. Mooney 和 Razvan Bunescu，使用信息提取从文本中挖掘知识。 SIGKDD 探索，7:1 (2005)，第 3-10 页。
Stephen Soderland，学习半结构化和自由文本信息提取规则。 机器学习，34:1 (1999)，第 233-272 页。
C. Blaschke 和 A. Valencia。 Suiseki信息提取系统的基于框架的模块。 IEEE 智能系统，17:14–20 (2002)。

One subfield of AI that you might find relevant is information extraction.

Information extraction algorithms often work by using rules (e.g. regular expressions) to identify entities and relations in text. These rules can either be defined by hand (i.e. the Suiseki algorithm) or learned with supervised machine learning algorithms (i.e. RAPIER, Wrapper Induction, Conditional Random Fields).

For example, an information extraction algorithm might grab data from a job posting:

Job Title: Senior DBMS Consultant
Location: Dallas,TX
Responsibilities: DBMS Applications consultant works with project teams to define DBMS based solutions that support the enterprise deployment of Electronic Commerce, Sales Force Automation, and Customer Service applications.
Desired Requirements: 3-5 years exp. developing Oracle or SQL Server apps using Visual Basic, C/C++, Powerbuilder, Progress, or similar. Recent experience related to installing and configuring Oracle or SQL Server in both dev. and deployment environments.
Desired Skills: Understanding of UNIX or NT, scripting language. Know principles of structured software engineering and project management

...and distill it into this template:

title: Senior DBMS Consultant
state: TX
city: Dallas
country: US
language: Powerbuilder, Progress, C, C++, Visual Basic
platform: UNIX, NT
application: SQL Server, Oracle
area: Electronic Commerce, Customer Service
required years of experience: 3
desired years of experience: 5

Ray Mooney and his group at the University of Texas at Austin have done some great work in information extraction. Here are some references that might make good jumping-off points: