{{category R}} !!! tagger {{outline}} ---- https://github.com/trinker/tagger * openNLPの品詞タグ付与 {{pre tagger wraps the NLP and openNLP packages for easier part of speech tagging. tagger uses the openNLP annotator to compute "Penn Treebank parse annotations using the Apache OpenNLP chunking parser for English." }} !!必要なパッケージをインストール {{pre install.packages("pacman") pacman::p_load_gh(c( "trinker/termco", "trinker/coreNLPsetup", "trinker/tagger" )) library(dplyr) library(tagger) install.packages('rJava') library(rJava) }} !!タグ一覧 !penn_tags() {{pre Tag Description 1 $ dollar 2 `` opening quotation mark 3 '' closing quotation mark 4 ( opening parenthesis 5 ) closing parenthesis 6 , comma 7 - dash 8 . sentence terminator 9 : colon or ellipsis 10 CC conjunction, coordinating 11 CD numeral, cardinal 12 DT determiner 13 EX existential there 14 FW foreign word 15 IN preposition or conjunction, subordinating 16 JJ adjective or numeral, ordinal 17 JJR adjective, comparative 18 JJS adjective, superlative 19 LS list item marker 20 MD modal auxiliary 21 NN noun, common, singular or mass 22 NNP noun, proper, singular 23 NNPS noun, proper, plural 24 NNS noun, common, plural 25 PDT pre-determiner 26 POS genitive marker 27 PRP pronoun, personal 28 PRP$ pronoun, possessive 29 RB adverb 30 RBR adverb, comparative 31 RBS adverb, superlative 32 RP particle 33 SYM symbol 34 TO "to" as preposition or infinitive marker 35 UH interjection 36 VB verb, base form 37 VBD verb, past tense 38 VBG verb, present participle or gerund 39 VBN verb, past participle 40 VBP verb, present tense, not 3rd person singular 41 VBZ verb, present tense, 3rd person singular 42 WDT WH-determiner 43 WP WH-pronoun 44 WP$ WH-pronoun, possessive 45 WRB Wh-adverb }} !Penn Tree Bank式のタグではなく、一般的な品詞記号にまとめることもできる as_universial() !品詞を明記することもできる as_basic() !!コマンド !タグ付与 tag_pos() !タグ頻度 count_tags() !タグ付きのテキストの出力 as_word_tag() !文の構成素ごとにまとめる as_tuple()