如何建模CQL表,以便可以通过zip_code或zip_code和hash查询它?
大家好,我有一个包含哈希作为主键的Cassandra表,另一列包含列表。我想添加另一列名为ZipCode,以便我可以根据Zipcode或ZipCode查询Cassandra,并可以查询
Hash |列表| Zipcode
select * from table where zip_code = '12345';
select * from table where zip_code = '12345' && hash='abcd';
有什么办法可以做到这一点?
Hi all I have a cassandra Table containing Hash as Primary key and another column containing List. I want to add another column named Zipcode such that I can query cassandra based on either zipcode or zipcode and hash
Hash | List | zipcode
select * from table where zip_code = '12345';
select * from table where zip_code = '12345' && hash='abcd';
Is there any way that I could do this?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
卡桑德拉(Cassandra)的建议是,您根据访问模式设计数据表。例如,在您的情况下,您希望通过zipcode 和
获得zipcode and hash
的,因此理想情况下,您可以拥有两个这样的表
,
然后可能需要重新设计您的表根据您的数据。我建议您在Cassandra 中了解
数据模型设计,然后再继续进行。
允许过滤
可以使用构造,但其用法取决于您的数据大/小。如果您的数据很大,则避免使用此构造,因为它需要对数据库进行完整的扫描,这在资源和时间方面非常昂贵。Recommendation in Cassandra is that you design your data tables based on your access patterns. For example in your case you would like to get results
by zipcode
andby zipcode and hash
, so ideally you can have two tables like thisand
Then you may be required to redesign your tables based on your data. I recommend you understand
data model design in cassandra
before proceeding further.ALLOW FILTERING
construct can be used but its usage depends on how big/small is your data. If you have a very large data then avoid using this construct as it will require complete scan of the database which is quite expensive in terms of resources and time.可以设计一个可以满足两个应用程序查询的单个表。
在此示例架构中,该表由Hash作为聚类密钥对邮政编码进行分区:
使用此设计,每个邮政编码可以具有
hash
的一行或多个行。这是其中包含一些测试数据的表:该表包含两个分区
zipcode = 123
和zipcode = 456
。第一个邮政编码有三个行(abc
,def
,ghi
),第二行有两个行(tuv
,xyz
)。您可以仅使用分区密钥(
ZipCode
)查询表格,例如:也可以使用分区键
Zipcode
和clustering key查询表格。哈希
,例如:欢呼!
It is possible to design a single table that will satisfy both app queries.
In this example schema, the table is partitioned by zip code with hash as the clustering key:
With this design, each zip code can have one or more rows of
hash
. Here's the table with some test data in it:The table contains two partitions
zipcode = 123
andzipcode = 456
. The first zip code has three rows (abc
,def
,ghi
) and the second has two rows (tuv
,xyz
).You can query the table using just the partition key (
zipcode
), for example:It is also possible to query the table with the partition key
zipcode
and clustering keyhash
, for example:Cheers!