トップ 履歴 一覧 Farm ソース 検索 ヘルプ PDF RSS ログイン

SCA

*disclaimer
640117

L2 Syntactic Complexity Analyzer


Lu, X. (2010). Automatic analysis of syntactic complexity in second language writing. International Journal of Corpus Linguistics, 15(4), 474-496.
http://www.personal.psu.edu/xxl13/downloads/l2sca.html


 指標一覧

9 structures in the text:

  1. words (W)
  2. sentences (S)
  3. verb phrases (VP)
  4. clauses (C)
  5. T-units (T)
  6. dependent clauses (DC)
  7. complex T-units (CT)
    • 従属節を含むT-unit
  1. coordinate phrases (CP)
    • 形容詞、副詞、名詞、動詞
  1. complex nominals (CN)
    • 修飾


14 syntactic complexity indices of the text:

  1. mean length of sentence (MLS)
  2. mean length of T-unit (MLT
  3. mean length of clause (MLC)
  4. clauses per sentence (C/S)
  5. verb phrases per T-unit (VP/T)
  6. clauses per T-unit (C/T)
  7. dependent clauses per clause (DC/C)
  8. dependent clauses per T-unit (DC/T)
  9. T-units per sentence (T/S)
  10. complex T-unit ratio (CT/T)
  11. coordinate phrases per T-unit (CP/T)
  12. coordinate phrases per clause (CP/C)
  13. complex nominals per T-unit (CN/T)
  14. complex nominals per clause (CN/C)

名詞句の複雑性 (CN/C)

  • Lu (2010: 483)
Complex nominals. 
Complex nominals comprise 
(i) nouns plus 
    adjective, 
    possessive, 
    prepositional phrase, 
    relative clause, 
    participle, or 
    appositive, 
(ii) nominal clauses, and 
(iii) gerunds and infinitives in subject position (Cooper 1976). 

These are operationalized using the 
Tregex patterns in (12), (13), and (14) respectively. 

The pattern in (12) matches an NP node 
    that is not immediately dominated by another NP and 
    that dominates 
        an adjective (JJ), 
        possessive (POS), 
        prepositional phrase (PP), 
        relative clause (S), 
        participle (VBG), or 
        appositive (an NP that is a left sister of another NP and that is not the immediate left sister of a CC). 
    For example, this pattern matches 
        the two NP nodes in (2) that represent the noun phrases 
            a girl in our dorm and a spoiled child 
        respectively. 

The pattern in (13) retrieves nominal clauses by matching 
    an SBAR node in subject or object position 
        (i.e. it is either an immediate left sister of a VP or is immediately dominated by a VP) 
    that satisfies one of the following two conditions: 
        (i) it immediately dominates 
            a wh-noun phrase (WHNP) (e.g. what I like) or 
            a complementizer (i.e. that or for tagged as a preposition, as in that you like to read), or 
        (ii) it has an S node as its first child (i.e. a clausal object without a complementizer, as in I know you like to read). 

The pattern in (14) retrieves 
    gerunds and infinitives 
        in subject position by matching 
    an S node that immediately dominates a VP headed by 
        a gerund or 
        the infinitive “to” and 
    that is an immediate left sister of a VP (e.g. Saving energy is really important). 

(12) “NP !> NP [<< JJ|POS|PP|S|VBG |<< (NP $++ NP !$+ CC)]” 

(13) “SBAR [$+ VP | > VP] & [<# WHNP |<# (IN <That|that|For|for) |<, S]” 

(14) “S < (VP <# VBG|TO) $+ VP”

 どうやって使うか

手元にプログラムをインストール


  1. L2SCA-2022-07-12.tgzをダウンロード
    1. Stanford Parserも含まれている
      1. Stanford ParserがJavaで書かれている
  2. READMEファイルを読む

Web-based L2 Syntactic Complexity Analyzer

https://aihaiyang.com/software/

  • Single Mode
    • 2ファイルまで
    • 1000語以内のテキストファイル


  • Batch Mode
    • 30ファイルまで

TAASSC内で

Kyle, K. (2016). Measuring syntactic development in L2 writing: Fine grained indices of syntactic complexity and usage-based indices of syntactic sophistication (Doctoral Dissertation). Retrieved from http://scholarworks.gsu.edu/alesl_diss/35.
https://www.linguisticanalysistools.org/taassc.html