Apache > Hadoop > Core
 

Hadoop Archives

浠涔堟槸Hadoop archives?

Hadoop archives鏄壒娈婄殑妗f鏍煎紡銆備竴涓狧adoop archive瀵瑰簲涓涓枃浠剁郴缁熺洰褰曘 Hadoop archive鐨勬墿灞曞悕鏄*.har銆侶adoop archive鍖呭惈鍏冩暟鎹紙褰㈠紡鏄痏index鍜宊masterindx锛夊拰鏁版嵁锛坧art-*锛夋枃浠躲俖index鏂囦欢鍖呭惈浜嗘。妗堜腑鐨勬枃浠剁殑鏂囦欢鍚嶅拰浣嶇疆淇℃伅銆

濡備綍鍒涘缓archive?

鐢ㄦ硶: hadoop archive -archiveName name <src>* <dest>

鐢-archiveName閫夐」鎸囧畾浣犺鍒涘缓鐨刟rchive鐨勫悕瀛椼傛瘮濡俧oo.har銆俛rchive鐨勫悕瀛楃殑鎵╁睍鍚嶅簲璇ユ槸*.har銆傝緭鍏ユ槸鏂囦欢绯荤粺鐨勮矾寰勫悕锛岃矾寰勫悕鐨勬牸寮忓拰骞虫椂鐨勮〃杈炬柟寮忎竴鏍枫傚垱寤虹殑archive浼氫繚瀛樺埌鐩爣鐩綍涓嬨傛敞鎰忓垱寤篴rchives鏄竴涓狹ap/Reduce job銆備綘搴旇鍦╩ap reduce闆嗙兢涓婅繍琛岃繖涓懡浠ゃ備笅闈㈡槸涓涓緥瀛愶細

hadoop archive -archiveName foo.har /user/hadoop/dir1 /user/hadoop/dir2 /user/zoo/

鍦ㄤ笂闈㈢殑渚嬪瓙涓紝 /user/hadoop/dir1 鍜 /user/hadoop/dir2 浼氳褰掓。鍒拌繖涓枃浠剁郴缁熺洰褰曚笅 -- /user/zoo/foo.har銆傚綋鍒涘缓archive鏃讹紝婧愭枃浠朵笉浼氳鏇存敼鎴栧垹闄ゃ

濡備綍鏌ョ湅archives涓殑鏂囦欢?

archive浣滀负鏂囦欢绯荤粺灞傛毚闇茬粰澶栫晫銆傛墍浠ユ墍鏈夌殑fs shell鍛戒护閮借兘鍦╝rchive涓婅繍琛岋紝浣嗘槸瑕佷娇鐢ㄤ笉鍚岀殑URI銆 鍙﹀锛宎rchive鏄笉鍙敼鍙樼殑銆傛墍浠ラ噸鍛藉悕锛屽垹闄ゅ拰鍒涘缓閮戒細杩斿洖閿欒銆侶adoop Archives 鐨刄RI鏄

har://scheme-hostname:port/archivepath/fileinarchive

濡傛灉娌℃彁渚泂cheme-hostname锛屽畠浼氫娇鐢ㄩ粯璁ょ殑鏂囦欢绯荤粺銆傝繖绉嶆儏鍐典笅URI鏄繖绉嶅舰寮

har:///archivepath/fileinarchive

杩欐槸涓涓猘rchive鐨勪緥瀛愩俛rchive鐨勮緭鍏ユ槸/dir銆傝繖涓猟ir鐩綍鍖呭惈鏂囦欢filea锛宖ileb銆 鎶/dir褰掓。鍒/user/hadoop/foo.bar鐨勫懡浠ゆ槸

hadoop archive -archiveName foo.har /dir /user/hadoop

鑾峰緱鍒涘缓鐨刟rchive涓殑鏂囦欢鍒楄〃锛屼娇鐢ㄥ懡浠

hadoop dfs -lsr har:///user/hadoop/foo.har

鏌ョ湅archive涓殑filea鏂囦欢鐨勫懡浠-

hadoop dfs -cat har:///user/hadoop/foo.har/dir/filea