2016-10-13 4 views
-1

Я совершенно новичок в программировании R. пытается учиться самостоятельно. У меня есть сценарий вроде, у меня есть куча твитов. , и я хотел бы отфильтровать эти твиты на основе слова, если он выходит из текста твитов, тогда он должен получить все твиты, а также другие его атрибуты.Фильтрация твитов по критериям соответствия в R

sample.json

{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Google https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"} 
{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Google https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"}{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Facebook https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"} 
{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Google https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"}{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Google https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"} 
{"created_at":"Wed Oct 12 06:33:06 +0000 2016","id":786092312011112448,"id_str":"786092312011112448","text":"I posted a new photo to Twiiter https:\/\/t.co\/5SBpo628DF","source":"\u003ca href=\"http:\/\/www.facebook.com\/twitter\" rel=\"nofollow\"\u003eFacebook\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":null,"in_reply_to_user_id_str":null,"in_reply_to_screen_name":null,"user":{"id":359430735,"id_str":"359430735","name":"Life Changing Quotes","screen_name":"dadhwalashish","location":"Jalandhar India","url":"http:\/\/www.CompassionKindness.com","description":"@armyofkind , Human Resources professional, Blogger, Happiness ambassador, Kindness warrior, Love agent, Humanity over all religion.","protected":false,"verified":false,"followers_count":1919,"friends_count":1973,"listed_count":41,"favourites_count":24456,"statuses_count":20738,"created_at":"Sun Aug 21 15:39:19 +0000 2011","utc_offset":19800,"time_zone":"New Delhi","geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"C0DEED","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"0084B4","profile_sidebar_border_color":"C0DEED","profile_sidebar_fill_color":"DDEEF6","profile_text_color":"333333","profile_use_background_image":true,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/759986435453837312\/26b_K_9u_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/359430735\/1472921460","default_profile":true,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/5SBpo628DF","expanded_url":"http:\/\/fb.me\/1nqsSEAnu","display_url":"fb.me\/1nqsSEAnu","indices":[33,56]}],"user_mentions":[],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"en","timestamp_ms":"1476253986729"} 

words.txt

google 
facebook 
twitter 

Если Google слово существует в твиты: текст, то он должен получить все твиты, которые это слово содержит и как и для остальных слов. Я пробовал это самостоятельно, но мне нужны некоторые эксперты. Каков наилучший способ сделать это. Пожалуйста, предложите мне обратиться к ссылкам и книгам или документам.

Заранее спасибо. Mohan.V

+0

Здравствуйте и добро пожаловать в StackOverflow. Пожалуйста, найдите время, чтобы прочитать страницу справки, особенно разделы с названием [«Какие темы можно задать здесь?»] (Http://stackoverflow.com/help/on-topic) и [«Какие типы вопросов я должен не спрашивать? "] (http://stackoverflow.com/help/dont-ask). И что еще более важно, прочитайте [контрольный список вопросов переполнения стека] (http://meta.stackexchange.com/q/156810/204922). Вы также можете узнать о [Минимальных, Полных и Подтверждаемых примерах] (http://stackoverflow.com/help/mcve). – symbolrush

ответ

0

Вы могли бы начать с чего-то вроде этого:

# words to match 
words <- c("google","facebook","twitter") 
# sample words. put real jsons here 
sample.jsons <- c("google","twitch","twitter","twitch","facebook","twitch","google","facebook","twitter") 
# run through all jsons and target words 
for(i in seq_along(sample.jsons)){ 
    for(j in seq_along(words)){ 
    # if this condition is TRUE, your criteria is fullfilled 
    if(words[[j]] %in% sample.jsons[[i]]){ 
     # do whatever you want with the jsons which match your criteria 
     # right now the following command just outputs all index values at which the criteria is fulfilled 
     print(i) 
    } 
    } 
} 

Однако я настоятельно рекомендую вам (а) посмотрите на так руководящих принципов и правил, и (б) по крайней основы программирования R перед продолжением. Потому что, если вы этого не сделаете, у вас наверняка будет следующая проблема сразу.

Хорошие места для начала являются:

+0

@Bunny, если этот ответ помог, пожалуйста, выберите его в качестве ответа, нажав на галочку. Это позволяет сообществу понять, что ответ сработал, и проблема закрыта. – jd1338

Смежные вопросы