Adding compression codec to Hortonworks data platform

by robin Published September 20, 2016 Updated September 20, 2016

Lately I tried installing xz/lzma codec on my local VM setup. The compression ratios are pretty awesome. Won’t do a benchmark here, try it out yourself :wink:

Steps Download codec JAR https://github.com/yongtang/hadoop-xz or https://mvnrepository.com/artifact/io.sensesecure/hadoop-xz Copy downloaded JAR to HDPs’libs folder find /usr/hdp/ -name *snappy*jar | xargs -L1 dirname | xargs -L1 sudo cp ~/hadoop-xz-1.4.jar Setup compression in HDFS config using Ambari Ambari -> HDFS -> Configs -> Advanced core-site -> io.compression.codecs -> add 'io.sensesecure.hadoop.xz.XZCodec' Testing with Hive

create a big sample file in local dir /tmp/sample.txt

Operations in hive create table orig_sample(val string); !sh hdfs dfs -put /tmp/sample.txt /tmp; LOAD DATA INPATH '/tmp/sample.txt' OVERWRITE INTO TABLE orig_sample; -- test lzma set hive.exec.compress.output=true; set io.seqfile.compression.type=BLOCK; set mapreduce.output.fileoutputformat.compress.codec=io.sensesecure.hadoop.xz.XZCodec; drop table test_table_lzma; CREATE TABLE test_table_lzma ROW FORMAT DELIMITED FIELDS TERMINATED BY "," LINES TERMINATED BY "\n" STORED AS TEXTFILE LOCATION "/tmp/test_table_lzma" as select * from orig_sample; Checking results hdfs dfs -du -s -h /tmp/sample.txt hdfs dfs -du -s -h /tmp/test_table_lzma

Related posts: Uninstall Hortonworks HDP 2.2 HDFS disk consumption Find what is taking hdfs space Query escaped JSON string in Hive Hive statistics using beeline and expect script

Adding compression codec to Hortonworks data platform

Trending Articles

[3D制作类]PackMage3.0破解版免费绿色版

[北宇治字幕组] 夏日口袋 / Summer Pockets [08][WebRip][HEVC_AAC][繁日内嵌]

出售: B&W DM604 S3

曾新有陶瓷甕走紅大陸市場

二零一七年五月二十八日大陆综合消息

名詞解釋：FATP (Final Assembly Test & Pack)最後整機組裝測試包裝

[閒聊] 資深玩家週年禮

iOS 原生工程为swift + swiftUI实现，怎么封装UTS插件，官方Demo 也没找到案例

新店瑠公圳老宅拆除案新北：依法辦理文資審議程序

NotOnlySuccess-C++零基础可视化(215节)

[转载]煞貢、直星、人專吉日\金神七煞歌

中国网民投诉：DeepSeek输出错误讯息

石怡洁变身高丽妹　寻李英爱《师任堂》足迹遭冻伤

UltraEdit 25.0.0.68 中文版 (24.20.0.45 英文免安裝版) - 銷售第一的文字編輯器

RAV4 E-Mirror電子式後視鏡無法連線

【詢問】五代Rav4 車仕美4G車機 mp3讀不到

Devart UniDAC v10.3.0 SOURCES Delphi / Lazarus [含附件]

免费翻墙节点大全

【漫画推荐】色情！血腥！重口！这些毁三观的漫画你看过吗？

小游戏获取微信关系链数据报错