HBaseOnHive (即通过hive操作hbase表数据)
1、功能说明
1、hbase作为存储,hive作为hbase的访问对象
2、hive内部表关联hbase表=》hive和hbase同时享有数据,同时hive表和hbase表生命周期一致,即删除hive表活hbase表,同步删除另一个表。(ps:创建hive表可以同步创建hbase表)
3、hive外部表关联hbase表=》hive表可以获取操作hbase表,但生命周期和hbase表的无关,即删除hive表,hbase不受影响(ps:通过hive分析hbase,享受hiveql的sql化分析福利)
2、案列展示
①、【hive内部表】创建hive表:hive内部表关联hbase表
CREATE TABLE hive_hbase_emp_table(
empno int,
ename string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table");
0: jdbc:hive2://localhost:10000> CREATE TABLE hive_hbase_emp_table(
. . . . . . . . . . . . . . . .> empno int,
. . . . . . . . . . . . . . . .> ename string)
. . . . . . . . . . . . . . . .> STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
. . . . . . . . . . . . . . . .> WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
. . . . . . . . . . . . . . . .> TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table");
INFO : Compiling command(queryId=work_20210720163827_26c0541c-a557-4d48-a812-34343fbc171a): CREATE TABLE hive_hbase_emp_table(
empno int,
ename string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table")
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:null, properties:null)
INFO : Completed compiling command(queryId=work_20210720163827_26c0541c-a557-4d48-a812-34343fbc171a); Time taken: 0.006 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720163827_26c0541c-a557-4d48-a812-34343fbc171a): CREATE TABLE hive_hbase_emp_table(
empno int,
ename string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table")
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720163827_26c0541c-a557-4d48-a812-34343fbc171a); Time taken: 9.895 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (9.911 seconds)
- 【hive内部表】hive和hbase中表创建情况查询(hive建立,同步hbase建立)
0: jdbc:hive2://localhost:10000> show tables;
INFO : Compiling command(queryId=work_20210720163841_79e0f006-3a04-4c81-a44c-36901a15cd93): show tables
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:tab_name, type:string, comment:from deserializer)], properties:null)
INFO : Completed compiling command(queryId=work_20210720163841_79e0f006-3a04-4c81-a44c-36901a15cd93); Time taken: 0.006 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720163841_79e0f006-3a04-4c81-a44c-36901a15cd93): show tables
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720163841_79e0f006-3a04-4c81-a44c-36901a15cd93); Time taken: 0.007 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------+
| tab_name |
+-----------------------+
| dh |
| hive_hbase_emp_table |
+-----------------------+
2 rows selected (0.026 seconds)
hbase:016:0> list
TABLE
dhtest
hbase_emp_table
student
t1
teacher
wangwu
yangmi
zhaosi
demo:student1
9 row(s)
Took 0.0154 seconds
=> ["dhtest", "hbase_emp_table", "student", "t1", "teacher", "wangwu", "yangmi", "zhaosi", "demo:student1"]
- 【hive内部表】通过hive表插入数据,hbase表同步插入数据
0: jdbc:hive2://localhost:10000> insert into table hive_hbase_emp_table values(1,'dinghao');
INFO : Compiling command(queryId=work_20210720164557_288fb52e-0068-462a-853c-0226e6cdf401): insert into table hive_hbase_emp_table values(1,'dinghao')
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:col1, type:int, comment:null), FieldSchema(name:col2, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720164557_288fb52e-0068-462a-853c-0226e6cdf401); Time taken: 0.361 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720164557_288fb52e-0068-462a-853c-0226e6cdf401): insert into table hive_hbase_emp_table values(1,'dinghao')
WARN : Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
INFO : Query ID = work_20210720164557_288fb52e-0068-462a-853c-0226e6cdf401
INFO : Total jobs = 1
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Starting task [Stage-1:DDL] in serial mode
INFO : Launching Job 1 out of 1
INFO : Starting task [Stage-2:MAPRED] in serial mode
INFO : Number of reduce tasks is set to 0 since there's no reduce operator
INFO : number of splits:1
INFO : Submitting tokens for job: job_local376335399_0003
INFO : Executing with tokens: []
INFO : The url to track the job: http://localhost:8080/
INFO : Job running in-process (local Hadoop)
INFO : 2021-07-20 16:46:02,525 Stage-2 map = 100%, reduce = 0%
INFO : Ended Job = job_local376335399_0003
INFO : MapReduce Jobs Launched:
INFO : Stage-Stage-2: HDFS Read: 5 HDFS Write: 0 SUCCESS
INFO : Total MapReduce CPU Time Spent: 0 msec
INFO : Completed executing command(queryId=work_20210720164557_288fb52e-0068-462a-853c-0226e6cdf401); Time taken: 4.412 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (4.783 seconds)
0: jdbc:hive2://localhost:10000> select * from hive_hbase_emp_table;
INFO : Compiling command(queryId=work_20210720164833_d121fd07-7622-4820-b014-896fa7834dc5): select * from hive_hbase_emp_table
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:hive_hbase_emp_table.empno, type:int, comment:null), FieldSchema(name:hive_hbase_emp_table.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720164833_d121fd07-7622-4820-b014-896fa7834dc5); Time taken: 0.162 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720164833_d121fd07-7622-4820-b014-896fa7834dc5): select * from hive_hbase_emp_table
INFO : Completed executing command(queryId=work_20210720164833_d121fd07-7622-4820-b014-896fa7834dc5); Time taken: 0.0 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------------+-----------------------------+
| hive_hbase_emp_table.empno | hive_hbase_emp_table.ename |
+-----------------------------+-----------------------------+
| 1 | dinghao |
+-----------------------------+-----------------------------+
1 row selected (0.989 seconds)
hbase:017:0> scan "hbase_emp_table";
ROW COLUMN+CELL
1 column=info:ename, timestamp=2021-07-20T16:46:01.937, value=dinghao
1 row(s)
Took 0.7799 seconds
②、【hive外部表】创建hive外部表,数据关联hbase表
CREATE EXTERNAL TABLE relevance_hbase_emp(
empno int,
ename string)
STORED BY
'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table");
0: jdbc:hive2://localhost:10000> CREATE EXTERNAL TABLE relevance_hbase_emp(
. . . . . . . . . . . . . . . .> empno int,
. . . . . . . . . . . . . . . .> ename string)
. . . . . . . . . . . . . . . .> STORED BY
. . . . . . . . . . . . . . . .> 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
. . . . . . . . . . . . . . . .> WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
. . . . . . . . . . . . . . . .> TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table");
INFO : Compiling command(queryId=work_20210720165348_4c758276-318a-4bab-89ed-38e083606772): CREATE EXTERNAL TABLE relevance_hbase_emp(
empno int,
ename string)
STORED BY
'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table")
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:null, properties:null)
INFO : Completed compiling command(queryId=work_20210720165348_4c758276-318a-4bab-89ed-38e083606772); Time taken: 1.026 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720165348_4c758276-318a-4bab-89ed-38e083606772): CREATE EXTERNAL TABLE relevance_hbase_emp(
empno int,
ename string)
STORED BY
'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,info:ename")
TBLPROPERTIES ("hbase.table.name" = "hbase_emp_table")
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720165348_4c758276-318a-4bab-89ed-38e083606772); Time taken: 0.732 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (1.768 seconds)
0: jdbc:hive2://localhost:10000> select * from relevance_hbase_emp;
INFO : Compiling command(queryId=work_20210720165517_11244894-7b6c-4bad-80de-1880c71243b4): select * from relevance_hbase_emp
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:relevance_hbase_emp.empno, type:int, comment:null), FieldSchema(name:relevance_hbase_emp.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720165517_11244894-7b6c-4bad-80de-1880c71243b4); Time taken: 0.148 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720165517_11244894-7b6c-4bad-80de-1880c71243b4): select * from relevance_hbase_emp
INFO : Completed executing command(queryId=work_20210720165517_11244894-7b6c-4bad-80de-1880c71243b4); Time taken: 0.0 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+----------------------------+----------------------------+
| relevance_hbase_emp.empno | relevance_hbase_emp.ename |
+----------------------------+----------------------------+
| 1 | dinghao |
+----------------------------+----------------------------+
1 row selected (0.389 seconds)
0: jdbc:hive2://localhost:10000> insert into table relevance_hbase_emp values(2,'zyh');
INFO : Compiling command(queryId=work_20210720165915_f5879fec-0614-40d7-9030-461a2b5df4b2): insert into table relevance_hbase_emp values(2,'zyh')
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:col1, type:int, comment:null), FieldSchema(name:col2, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720165915_f5879fec-0614-40d7-9030-461a2b5df4b2); Time taken: 1.238 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720165915_f5879fec-0614-40d7-9030-461a2b5df4b2): insert into table relevance_hbase_emp values(2,'zyh')
WARN : Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
INFO : Query ID = work_20210720165915_f5879fec-0614-40d7-9030-461a2b5df4b2
INFO : Total jobs = 1
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Starting task [Stage-1:DDL] in serial mode
INFO : Launching Job 1 out of 1
INFO : Starting task [Stage-2:MAPRED] in serial mode
INFO : Number of reduce tasks is set to 0 since there's no reduce operator
INFO : number of splits:1
INFO : Submitting tokens for job: job_local60016057_0004
INFO : Executing with tokens: []
INFO : The url to track the job: http://localhost:8080/
INFO : Job running in-process (local Hadoop)
INFO : 2021-07-20 16:59:18,808 Stage-2 map = 100%, reduce = 0%
INFO : Ended Job = job_local60016057_0004
INFO : MapReduce Jobs Launched:
INFO : Stage-Stage-2: HDFS Read: 5 HDFS Write: 0 SUCCESS
INFO : Total MapReduce CPU Time Spent: 0 msec
INFO : Completed executing command(queryId=work_20210720165915_f5879fec-0614-40d7-9030-461a2b5df4b2); Time taken: 2.341 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (3.589 seconds)
hbase:018:0> scan "hbase_emp_table";
ROW COLUMN+CELL
1 column=info:ename, timestamp=2021-07-20T16:46:01.937, value=dinghao
2 column=info:ename, timestamp=2021-07-20T16:59:17.933, value=zyh
2 row(s)
Took 0.0125 seconds
0: jdbc:hive2://localhost:10000> select * from relevance_hbase_emp;
INFO : Compiling command(queryId=work_20210720170115_35af9cc9-c42d-403e-93b9-90d3a01fb1b3): select * from relevance_hbase_emp
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:relevance_hbase_emp.empno, type:int, comment:null), FieldSchema(name:relevance_hbase_emp.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170115_35af9cc9-c42d-403e-93b9-90d3a01fb1b3); Time taken: 0.899 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170115_35af9cc9-c42d-403e-93b9-90d3a01fb1b3): select * from relevance_hbase_emp
INFO : Completed executing command(queryId=work_20210720170115_35af9cc9-c42d-403e-93b9-90d3a01fb1b3); Time taken: 0.0 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+----------------------------+----------------------------+
| relevance_hbase_emp.empno | relevance_hbase_emp.ename |
+----------------------------+----------------------------+
| 1 | dinghao |
| 2 | zyh |
+----------------------------+----------------------------+
2 rows selected (1.465 seconds)
0: jdbc:hive2://localhost:10000> select * from hive_hbase_emp_table;
INFO : Compiling command(queryId=work_20210720170151_e12ca46b-167d-4fa2-9a98-ef31a939e4e9): select * from hive_hbase_emp_table
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:hive_hbase_emp_table.empno, type:int, comment:null), FieldSchema(name:hive_hbase_emp_table.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170151_e12ca46b-167d-4fa2-9a98-ef31a939e4e9); Time taken: 0.156 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170151_e12ca46b-167d-4fa2-9a98-ef31a939e4e9): select * from hive_hbase_emp_table
INFO : Completed executing command(queryId=work_20210720170151_e12ca46b-167d-4fa2-9a98-ef31a939e4e9); Time taken: 0.001 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------------+-----------------------------+
| hive_hbase_emp_table.empno | hive_hbase_emp_table.ename |
+-----------------------------+-----------------------------+
| 1 | dinghao |
| 2 | zyh |
+-----------------------------+-----------------------------+
2 rows selected (1.396 seconds)
③、【hbase表】调整hbase表数据观察hive内部表和外部表的变化
hbase:019:0> put 'hbase_emp_table','3','info:ename','dz'
Took 0.0148 seconds
hbase:020:0> scan "hbase_emp_table";
ROW COLUMN+CELL
1 column=info:ename, timestamp=2021-07-20T16:46:01.937, value=dinghao
2 column=info:ename, timestamp=2021-07-20T16:59:17.933, value=zyh
3 column=info:ename, timestamp=2021-07-20T17:04:54.739, value=dz
3 row(s)
Took 0.0050 seconds
0: jdbc:hive2://localhost:10000> select * from relevance_hbase_emp;
INFO : Compiling command(queryId=work_20210720170526_1d60b71b-056f-4c68-8f0b-5933557f00f8): select * from relevance_hbase_emp
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:relevance_hbase_emp.empno, type:int, comment:null), FieldSchema(name:relevance_hbase_emp.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170526_1d60b71b-056f-4c68-8f0b-5933557f00f8); Time taken: 0.132 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170526_1d60b71b-056f-4c68-8f0b-5933557f00f8): select * from relevance_hbase_emp
INFO : Completed executing command(queryId=work_20210720170526_1d60b71b-056f-4c68-8f0b-5933557f00f8); Time taken: 0.001 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+----------------------------+----------------------------+
| relevance_hbase_emp.empno | relevance_hbase_emp.ename |
+----------------------------+----------------------------+
| 1 | dinghao |
| 2 | zyh |
| 3 | dz |
+----------------------------+----------------------------+
3 rows selected (0.801 seconds)
0: jdbc:hive2://localhost:10000> select * from hive_hbase_emp_table;
INFO : Compiling command(queryId=work_20210720170542_05a85c64-ded6-414c-a8fa-80667410192c): select * from hive_hbase_emp_table
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:hive_hbase_emp_table.empno, type:int, comment:null), FieldSchema(name:hive_hbase_emp_table.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170542_05a85c64-ded6-414c-a8fa-80667410192c); Time taken: 0.153 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170542_05a85c64-ded6-414c-a8fa-80667410192c): select * from hive_hbase_emp_table
INFO : Completed executing command(queryId=work_20210720170542_05a85c64-ded6-414c-a8fa-80667410192c); Time taken: 0.0 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------------+-----------------------------+
| hive_hbase_emp_table.empno | hive_hbase_emp_table.ename |
+-----------------------------+-----------------------------+
| 1 | dinghao |
| 2 | zyh |
| 3 | dz |
+-----------------------------+-----------------------------+
3 rows selected (1.089 seconds)
④、【删除hive外部表】观察hbase表和hive内部表状态
0: jdbc:hive2://localhost:10000> drop table relevance_hbase_emp
INFO : Compiling command(queryId=work_20210720170740_e68314a0-9699-436f-a442-b17a2a4eabe9): drop table relevance_hbase_emp
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:null, properties:null)
INFO : Completed compiling command(queryId=work_20210720170740_e68314a0-9699-436f-a442-b17a2a4eabe9)
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170740_e68314a0-9699-436f-a442-b17a2a4eabe9): drop table relevance_hbase_emp
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720170740_e68314a0-9699-436f-a442-b17a2a4eabe9)
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (3.253 seconds)
0: jdbc:hive2://localhost:10000> show tables
INFO : Compiling command(queryId=work_20210720170808_524bbbb7-5cb8-4f16-ace6-cac83a45657d): show tables
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:tab_name, type:string, comment:from deserializer)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170808_524bbbb7-5cb8-4f16-ace6-cac83a45657d)
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170808_524bbbb7-5cb8-4f16-ace6-cac83a45657d): show tables
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720170808_524bbbb7-5cb8-4f16-ace6-cac83a45657d)
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------+
| tab_name |
+-----------------------+
| dh |
| hive_hbase_emp_table |
+-----------------------+
2 rows selected (0.02 seconds)
0: jdbc:hive2://localhost:10000> select * from hive_hbase_emp_table;
INFO : Compiling command(queryId=work_20210720170900_f3228fcd-873b-432f-904b-008d3e6bc8d9): select * from hive_hbase_emp_table
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:hive_hbase_emp_table.empno, type:int, comment:null), FieldSchema(name:hive_hbase_emp_table.ename, type:string, comment:null)], properties:null)
INFO : Completed compiling command(queryId=work_20210720170900_f3228fcd-873b-432f-904b-008d3e6bc8d9); Time taken: 0.126 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720170900_f3228fcd-873b-432f-904b-008d3e6bc8d9): select * from hive_hbase_emp_table
INFO : Completed executing command(queryId=work_20210720170900_f3228fcd-873b-432f-904b-008d3e6bc8d9); Time taken: 0.0 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------------------------+-----------------------------+
| hive_hbase_emp_table.empno | hive_hbase_emp_table.ename |
+-----------------------------+-----------------------------+
| 1 | dinghao |
| 2 | zyh |
| 3 | dz |
+-----------------------------+-----------------------------+
3 rows selected (0.551 seconds)
hbase:021:0> scan "hbase_emp_table";
ROW COLUMN+CELL
1 column=info:ename, timestamp=2021-07-20T16:46:01.937, value=dinghao
2 column=info:ename, timestamp=2021-07-20T16:59:17.933, value=zyh
3 column=info:ename, timestamp=2021-07-20T17:04:54.739, value=dz
3 row(s)
Took 0.1460 seconds
⑤、【删除hive内部表】观察hbase表状态
0: jdbc:hive2://localhost:10000> drop table hive_hbase_emp_table;
INFO : Compiling command(queryId=work_20210720171120_dac76802-57ff-4bd8-9962-e4d41593e62e): drop table hive_hbase_emp_table
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:null, properties:null)
INFO : Completed compiling command(queryId=work_20210720171120_dac76802-57ff-4bd8-9962-e4d41593e62e); Time taken: 0.013 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720171120_dac76802-57ff-4bd8-9962-e4d41593e62e): drop table hive_hbase_emp_table
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720171120_dac76802-57ff-4bd8-9962-e4d41593e62e); Time taken: 7.43 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
No rows affected (7.451 seconds)
0: jdbc:hive2://localhost:10000> show tables;
INFO : Compiling command(queryId=work_20210720171132_6a5478b2-1ee8-4a3d-8f44-94b761e9903e): show tables
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:tab_name, type:string, comment:from deserializer)], properties:null)
INFO : Completed compiling command(queryId=work_20210720171132_6a5478b2-1ee8-4a3d-8f44-94b761e9903e); Time taken: 0.004 seconds
INFO : Concurrency mode is disabled, not creating a lock manager
INFO : Executing command(queryId=work_20210720171132_6a5478b2-1ee8-4a3d-8f44-94b761e9903e): show tables
INFO : Starting task [Stage-0:DDL] in serial mode
INFO : Completed executing command(queryId=work_20210720171132_6a5478b2-1ee8-4a3d-8f44-94b761e9903e); Time taken: 0.007 seconds
INFO : OK
INFO : Concurrency mode is disabled, not creating a lock manager
+-----------+
| tab_name |
+-----------+
| dh |
+-----------+
1 row selected (0.022 seconds)
hbase:022:0> list
TABLE
dhtest
student
t1
teacher
wangwu
yangmi
zhaosi
demo:student1
8 row(s)
Took 0.0114 seconds
=> ["dhtest", "student", "t1", "teacher", "wangwu", "yangmi", "zhaosi", "demo:student1"]
3、结论
- hive内部表关联hbase,hive生命周期和hbase一致,且数据同步
- hive外部表关联hbase,hive的操作同步hbase,但hive删除不影响hbase表
- 可以通过hive操作hbase