在 Vertica 中，您是否应该将字符串分解到它们自己的逻辑表中？

发布于 2024-12-07 05:00:02 字数 670 浏览 8 评论 0原文

假设你的逻辑表是：

CREATE TABLE employee(
  name VARCHAR,
  university VARCHAR
);

现在你只有几所大学。因此，您可以提取出大学名称：

CREATE TABLE employee(
  name VARCHAR,
  university integer references university(university)
);

CREATE TABLE university(
  university identity,
  name varchar
);

您有这样的查询：

SELECT employee 
FROM employee as e1 
WHERE EXISTS 
      (SELECT employee 
       FROM employee as e2 
       WHERE e1.name = e2.name AND e1.university <> e2.university)

我想知道的是：第二个逻辑模式（名称被“提取出”）是否可以加快速度？也许是因为那里，e1.university <> e2.university 是整数而不是字符串的比较。

原文

Suppose your logical table is:

CREATE TABLE employee(
  name VARCHAR,
  university VARCHAR
);

Now you have only a few universities. Therefore, you could factor out the university name:

CREATE TABLE employee(
  name VARCHAR,
  university integer references university(university)
);

CREATE TABLE university(
  university identity,
  name varchar
);

You have queries of the sort:

SELECT employee 
FROM employee as e1 
WHERE EXISTS 
      (SELECT employee 
       FROM employee as e2 
       WHERE e1.name = e2.name AND e1.university <> e2.university)

What I'm wondering about is: does the second logical schema, where the name is "factored out", speed up things? Perhaps because there, e1.university <> e2.university is a comparison of integers rather than of strings.

分享到QQ

分享到微博