技术库 > 网站架构

使用MYSQL作为HIVE的METASTORE

技术库:tec.5lulu.com

from:tec.5lulu.com

先确保你已经成功安装了HIVEMYSQL

hive-site.xml中添加如下内容,指定METASTORE的地址以及连接方式

<property>     
<name>javax.jdo.option.ConnectionURL</name>     
<value>jdbc:mysql://10.20.151.10:3306/hive?characterEncoding=UTF-8</value>     
<description>JDBC connect string for a JDBC metastore</description>  
</property>  
<property>     
<name>javax.jdo.option.ConnectionDriverName</name>     
<value>com.mysql.jdbc.Driver</value>     
<description>Driver class name for a JDBC metastore</description>  
</property>  
<property>     
<name>javax.jdo.option.ConnectionUserName</name>     
<value>hive_user</value>     
<description>username to use against metastore database</description>  
</property>  
<property>     
<name>javax.jdo.option.ConnectionPassword</name>     
<value>123</value>     
<description>password to use against metastore database</description>  
</property>

然后登陆到HIVE客户端,创建一个表试试

[gpadmin1@hadoop5 hive-0.6.0]$ bin/hive
Hive history file=/tmp/gpadmin1/hive_job_log_gpadmin1_201106081130_1156785421.txt
hive> show tables;
FAILED: Error in metadata: javax.jdo.JDOFatalDataStoreException: Unknown database 'hive'
NestedThrowables:
com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: Unknown database 'hive'
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask

报错了,提示很明显,识别不到名称为hive的database,难道要自己创建?试试

[Intranet root@hadoop6 /var/lib/mysql]
#mysql -u root -p
Enter password: 
Welcome to the MySQL monitor.  Commands end with ; or /g.
Your MySQL connection id is 41
Server version: 5.5.12 MySQL Community Server (GPL)
Copyright (c) 2000, 2010, Oracle and/or its affiliates. All rights reserved.
Oracle is a registered trademark of Oracle Corporation and/or its
affiliates. Other names may be trademarks of their respective
owners.
Type 'help;' or '/h' for help. Type '/c' to clear the current input statement.
mysql> show databases;
+--------------------+
| Database           |
+--------------------+
| information_schema |
| mysql              |
| performance_schema |
| test               |
+--------------------+
4 rows in set (0.00 sec)
mysql> create database hive;
Query OK, 1 row affected (0.00 sec)
mysql> show databases;
+--------------------+
| Database           |
+--------------------+
| information_schema |
| hive               |
| mysql              |
| performance_schema |
| test               |
+--------------------+
5 rows in set (0.00 sec)

再登陆到HIVE里看看

[gpadmin1@hadoop5 hive-0.6.0]$ bin/hive
Hive history file=/tmp/gpadmin1/hive_job_log_gpadmin1_201106081130_544334815.txt
hive> show table;                       
FAILED: Parse Error: line 0:-1 mismatched input '<EOF>' expecting EXTENDED in show statement
hive> show tables;
OK
Time taken: 5.173 seconds
hive>         CREATE TABLE u_tmp1 (id1 INT,
    >            id2 int
    >            )
    >          ROW FORMAT DELIMITED
    >          FIELDS TERMINATED BY ',';
OK
Time taken: 0.266 seconds
hive> show tables;                         
OK
u_tmp1
Time taken: 0.197 seconds
hive>

OK了,果然是这个问题

另外,HIVE会在数据库中创建一些保存元数据的表,我们可以看下都有哪些

mysql> use hive;
Reading table information for completion of table and column names
You can turn off this feature to get a quicker startup with -A
Database changed
mysql> show tables;
+----------------+
| Tables_in_hive |
+----------------+
| BUCKETING_COLS |
| COLUMNS        |
| DBS            |
| PARTITION_KEYS |
| SDS            |
| SD_PARAMS      |
| SEQUENCE_TABLE |
| SERDES         |
| SERDE_PARAMS   |
| SORT_COLS      |
| TABLE_PARAMS   |
| TBLS           |
+----------------+
12 rows in set (0.00 sec)
mysql> select * from TBLS;     
+--------+-------------+-------+------------------+----------+-----------+-------+----------+---------------+--------------------+--------------------+
| TBL_ID | CREATE_TIME | DB_ID | LAST_ACCESS_TIME | OWNER    | RETENTION | SD_ID | TBL_NAME | TBL_TYPE      | VIEW_EXPANDED_TEXT | VIEW_ORIGINAL_TEXT |
+--------+-------------+-------+------------------+----------+-----------+-------+----------+---------------+--------------------+--------------------+
|      1 |  1307504073 |     1 |                0 | gpadmin1 |         0 |     1 | u_tmp1   | MANAGED_TABLE | NULL               | NULL               |
+--------+-------------+-------+------------------+----------+-----------+-------+----------+---------------+--------------------+--------------------+
1 row in set (0.00 sec)
mysql>

刚才我们创建的表u_tmp1也可以查到

另外有一个需要注意的地方是,需要把一个jar包mysql-connector-java-5.1.15-bin.jar拷贝到hive的lib目录下才行,否则执行语句的时候会报错,类似下面这样

hive> show tables;
FAILED: Error in metadata: javax.jdo.JDOFatalInternalException: Error creating transactional connection factory
NestedThrowables:
java.lang.reflect.InvocationTargetException
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask

我当时安装MYSQL的时候是使用RPM来安装的,没找到mysql-connector-java-5.1.15-bin.jar。后来同事发给我了个压缩包,解压开以后就有了。

使用MYSQL作为HIVE的METASTORE


标签: mysql hive本文链接 http://tec.5lulu.com/detail/105dan2h986t48s6b.html

我来评分 :6.5
2

转载注明:转自5lulu技术库

本站遵循:署名-非商业性使用-禁止演绎 3.0 共享协议

www.5lulu.com