HIVE数据清洗

INSERT OVERWRITE TABLE mktmall.tmall_201412_uid_pid select regexp_extract(uid,
'^[0-9]*$', 0),regexp_extract(pid, '^[0-9]*$', 0) from mktmall.tmall_201412 where
regexp_extract(uid, '^[0-9]*$', 0) is not null and regexp_extract(uid, '^[0-9]*$',
0) != 'NULL' and regexp_extract(uid, '^[0-9]*$', 0) !='' and regexp_extract(uid,
'^[0-9]*$', 0) != ' ' and regexp_extract(uid, '^[0-9]*$', 0) != 'null'
and regexp_extract(pid, '^[0-9]*$', 0) is not null and regexp_extract(pid,
'^[0-9]*$', 0) != 'NULL' and regexp_extract(pid, '^[0-9]*$', 0) !='' and
regexp_extract(pid, '^[0-9]*$', 0) != ' ' and regexp_extract(pid, '^[0-9]*$', 0) !=
'null' ;

上一篇:[再寄小读者之数学篇](2014-12-04 $\left(1+\frac{1}{x}\right)^x>\frac{2ex}{2x+1},\forall\ x>0.$)


下一篇:cookie转为字典