返回介绍

代码

调试技术

数据库

前端

GOLANG

JAVA

网络

心理学

PYTHON

创业

UNIX/LINUX

分布式

系统设计

搜索引擎

开发工具

思维

源码

网站架构微服务

软件工程/项目管理

运维

金融理财

写作

互联网

区块链

技术演讲网课

职场

《Getting started with impala》

发布于 2023-09-17 00:26:21 字数 1365 浏览 0 评论 0 收藏 0

1. Why Impala

flexibliity for your big data flow high-performance analytics exploratory business intelligence

extrac-transform-load(ETL) BI: Business intelligence

2. Getting Up and Running with Impala

Cloudera live demo

A view is an alias for a longer query, and takes no time or storage to set up

3. Impala for the database developer

OLTP-style(online transaction processing)

impala implements SQL-92 standard features for queries, with som enhancements from later SQL standards Hadoop Distributed File System(HDFS)

  • Impala currently doesn't have OLTP-style data manipulation language (DML) such as DELETE or UPDATE.
  • Impala also does not have indexes, constraints or foreign keys.
  • No transactions

impala can very effeciently perform full table scans of large tables.

HDFS Storage Model: CDH: Cloudera Distribution with Hadoop Parquet File Format: binary file format

4. Common Developer Tasks for Impala

ETL(Extract-trnasform-load)

Make sure always close query handles when finished(release memory) JDBC or ODBC

with Impala, the biggest I/O savings com from using partitioned tables and choosing the most appropriate file format

Impala partitioned tables are just HDFS directories UDF(user defined functions)

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。
列表为空,暂无数据
    我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
    原文