当前位置：文江博客话题详情

用于产品 ID 管理的 Cassandra Design

发布于 2024-11-04 07:00:39 字数 1365 浏览 0 评论 0原文

我是 Cassandra 的新手，想要开始尝试一个简单的测试

我们使用传统 RDBMS 的数据模型如下

表公司（Id，名称）

表产品（Id，名称，公司 ID）其中公司 ID 是对公司表的 FK 引用

Table ProductInstance (Id, ProductID) 其中 ProductID 是对 Product 表的 FK 引用

Table ProductInstanceRating (Id, ProductInstanceID, Comment) 其中 ProductInstanceID 是对 ProductInstance 表的 FK 引用关于

Cassandra 的设计方式有何建议？

更新：

我尝试从查询角度查看

要捕获的数据

1) 产品参考号是一个复合键，其中包含：产品名称、产品批号和客户 ID 产品名称：12456 产品批号：PQ23 客户 ID ： 879456

那么唯一的产品参考将类似于 12456|PQ23|879456

2) 产品实例将是上述产品的每个实例的唯一哈希值集

，并且产品的每个实例将获得唯一的编号 784A、876T 等，并且该编号对于特定产品参考是唯一的

产品实例参考将类似于 12456|PQ23|879456|784A

3) 每个产品唯一编号可以接收超过 1 个评级

在这种情况下，查询将是类似

查询1) 为了添加/插入产品实例的评级；获取产品的行，即 12456|PQ23|879456

查询2) 获取该行中的 ProductInstance ID（可能是列族的名称）

查询3) 将评级信息添加为列：值对

如果设计类似于

12456|PQ23|879456 {
      784A{timestamp1:{rating:valueA
                      person name:valueX}
           timestamp2:{rating:valueB
                      person name:valueY}}

      876T{timestamp1:{rating:valueC
                      person name:valueX}
           timestamp2:{rating:valueB
                      person name:valueY}}
}

此后，我们希望:

查询4) 获取所有有实例的产品
查询5) 获取所有具有评级的产品实例
查询6) 获取产品的最高评分
Query7) 获取产品的平均评分

是否有更好、更有效的方法来实现这一点？

原文

I'm new to Cassandra and wanted to start up trying a simple test

Our Data model with traditional RDBMS is as follows

Table Company (Id, Name)

Table Product (Id, Name, Company ID) where Company ID is FK reference to Company table

Table ProductInstance (Id, ProductID) where ProductID is FK reference to Product table

Table ProductInstanceRating (Id, ProductInstanceID, Comment) where ProductInstanceID is FK reference to ProductInstance table

Any suggestions on how the design should be with Cassandra ?

Update:

I tried to look from the querying perspective

Data to be captured

1) Product Reference is a composite key consisting of : Product Name, Product Lot Number, AND Customer ID
Product Name : 12456
Product Lot Number : PQ23
Customer ID : 879456

Then unique Product reference will be something like 12456|PQ23|879456

2) Product Instance will be unique set of hash numbers for every instance of the aforementioned Product

AND each instance of the product will get a unique number
784A, 876T ,etc and this number will be unique for a particular Product Reference

Product Instance reference will be something like 12456|PQ23|879456|784A

3) Each Product Unique number can receive more than 1 rating

In that case, the queries will be something like

Query1) In order to add/insert rating for a Product Instance;
Fetch Row for Product i.e 12456|PQ23|879456

Query2) Fetch the ProductInstance ID within this row (maybe name of column family)

Query3) Add the rating information as a column:value pair

Should the design be something like

12456|PQ23|879456 {
      784A{timestamp1:{rating:valueA
                      person name:valueX}
           timestamp2:{rating:valueB
                      person name:valueY}}

      876T{timestamp1:{rating:valueC
                      person name:valueX}
           timestamp2:{rating:valueB
                      person name:valueY}}
}

Thereafter, we would want to:

Query4) fetch all products that have instances
Query5) fetch all product instance that have ratings
Query6) fetch highest rating for product
Query7) fetch average rating for product

Is there a better and more efficient way to implement this ?

分享到QQ

分享到微博