SerDe library 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
I have a Hive external table created on a CSV file using the SerDe - 'org.apache.hadoop.hive.serde2.OpenCSVSerde'. The external table created OK and am able access from Hive.
But when I try to access the same table from Impala, it starts throwing the error - "SerDe library 'org.apache.hadoop.hive.serde2.OpenCSVSerde' is not supported."
Looks like Impala does not recognize the Hive's OpenCSVSerde.
To get around the problem, I had to follow these steps :
- Create Hive External Table - stored as TextFile
- Create Hive table ( managed ) using CTAS - stored as PARQUET
- Access the Parquet table from Impala
Hope if any one is facing the same issue will find this solution useful.
Nice post.Keep sharing. Thanks for sharing.
ReplyDelete