猪:我的过滤器没有给出结果

Pig: my filter doesn't give result

您好,我是 hadoop 环境的新手。 我请求在 csv 上获取数据。

>LoadHomicide = LOAD '/user/admin/Crimes_samples.csv' USING PigStorage('\t') >AS >(Date:chararray,Block:chararray,PrimaryType:chararray,
>Description:chararray,
>LocationDescription:chararray,Arrest:chararray,Domestic:chararray,District:c>hararray,Year:chararray);

>uniq_arrest = FILTER LoadHomicide BY ( matches'%FALSE%');
>dump uniq_arrest;

我没有任何错误,但脚本的日志在这里给出了成功的 csv 答案。

ID","Case Number","Date","Block","IUCR","Primary Type","Description","Location Description", "Arrest","Domestic","Beat","District","Ward","Community Area","FBI Code","X Coordinate","Y Coordinate","Year","Updated On","Latitude","Longitude","Location"

0442761,"HZ181379",3/9/16 11:55 PM,"023XX N HAMLIN AVE","0560","ASSAULT","SIMPLE","APARTMENT","false","false","2525","025",35,"22","08A",1150660,1915214,2016,03/16/2016,41.92,-87.72,"(41.923245915, -87.721845939)" 10442848,"HZ181470",3/9/16 11:55 PM,"0000X W JACKSON BLVD","1310","CRIMINAL DAMAGE","TO PROPERTY","CTA GARAGE / OTHER PROPERTY","false","false","0113","001",2,"32","14",1176304,1898987,2016,03/16/2016,41.88,-87.63,"(41.878177799, -87.628111493)" 10442789,"HZ181391",3/9/16 11:55 PM,"052XX W HURON ST","1150","DECEPTIVE PRACTICE","CREDIT CARD FRAUD","ALLEY","false","false","1524","015",28,"25","11",1141433,1904126,2016,03/16/2016,41.89,-87.76,"(41.892994741, -87.756023813)" 10447046,"HZ185157",3/9/16 11:50 PM,"055XX N LINCOLN AVE","0460","BATTERY","SIMPLE","HOTEL

matches 语法是 incorrect.Also 在 [=17= 中没有第 6 个($5 指模式中的第 6 个字段,位置符号从 $0 开始)字段具有 "false" ] 正确的字段和右边的 syntax.Assuming 第 6 个字段中有 "false",那么这就是您在其上应用过滤器的方式。

uniq_arrest = FILTER LoadHomicide BY ( matches '.*false.*');