3. Alternation patterns¶
Alternation patterns or analogies serve as a basis for all other scripts. Patterns describe an alternation between two forms from two different cells.
Input¶
For instance, when provided with a pair of forms ba ~ bab, qumin will abstract away the following pattern: Ø ~ b / ba_. Moreover, Qumin is able to generalize patterns that apply to different lexemes in a phonological meaningful way and then to select the best matching pattern.
Let us compute patterns for three cells on our Karelian dataset:
qumin action=patterns data=parakar/parakar.package.json cells="[abl.sg, gen.sg, part.sg]" hydra.run.dir=results/patterns
Tip
We used action=patterns to tell Qumin to compute patterns. In practice, this is not necessary as Qumin’s default action is to compute patterns.
Output¶
Qumin produces the following output folder:
cli.log: a log from the computationmetadata.json: the metadata descriptormicroclasses.txt: classes of lexemes that behave identically (i.e. that have the same patterns)/patterns/human_readable: readable outputs, for human eyes./machine_readable: machine readable outputs, for Qumin’s eyes.
The last two folders contain one file per pair of cells. Since we asked for 3 cells, it contains 3 files. Human readable outputs can be of a great help to understand morphological alternations. Let’s have a look at the beginning of one:
## Pattern u͜ɑ ⇌ ɑn / X+[-syll]_
Pairs of forms instantiating this pattern: 1315
Full pattern: u͜ɑ ⇌ ɑn / {b,bʲ,d,dʲ,dʲː,dː,d͡ʒ,e,eː,e͜i,e͜u,e͜y,f,fː,g,h,i,iː,i͜e,i͜u,i͜y,i͜æ,i︧eu︨,i︧ey︨,i︧æy︨,j,jː,k,kː,l,lʲ,lʲː,lː,m,mː,n,nʲ,nʲː,nː,o,oː,o͜i,o͜u,p,pʲ,pː,r,rʲ,rʲː,rː,s,sʲ,sʲː,sː,t,tʲ,tʲː,tː,t͡s,t͡ʃ,t͡ʃː,u,uː,u͜i,u͜o,u͜ɑ,u︧ou︨,u︧ɑu︨,y,yː,y͜i,y͜ø,y︧øy︨,z,zʲ,æ,æː,æ͜i,æ͜y,ø,øː,ø͜i,ø͜y,ŋ,ŋː,ɑ,ɑː,ɑ͜i,ɑ͜u,ʃ,ʃː,ʋ,ʋʲ,ʋː,ʒ}+{b,bʲ,d,dʲ,dʲː,dː,d͡ʒ,f,fː,g,h,j,jː,k,kː,l,lʲ,lʲː,lː,m,mː,n,nʲ,nʲː,nː,p,pʲ,pː,r,rʲ,rʲː,rː,s,sʲ,sʲː,sː,t,tʲ,tʲː,tː,t͡s,t͡ʃ,t͡ʃː,z,zʲ,ŋ,ŋː,ʃ,ʃː,ʋ,ʋʲ,ʋː,ʒ}_ <1315>
Examples:
| lexeme | part.sg | gen.sg |
|:-------------------------|:-------------------------------|:------------------------------|
| abai | ɑbɑju͜ɑ | ɑbɑjɑn |
| abeiččii | ɑbe͜it͡ʃːiju͜ɑ | ɑbe͜it͡ʃːijɑn |
| abiturientu | ɑbiturʲi͜entu͜ɑ | ɑbiturʲi͜entɑn |
| abudengu | ɑbudʲeŋːu͜ɑ | ɑbudʲeŋːɑn |
| adru | ɑdru͜ɑ | ɑdrɑn |
| ad’jektiivu | ɑdʲjektʲiːʋu͜ɑ | ɑdʲjektʲiːʋɑn |
| agan | ɑgɑnu͜ɑ | ɑgɑnɑn |
The file lists all patterns. For each patterns it contains following information:
Short, human readable representation of the pattern.
Expanded, internally used representation of the pattern.
Number of pairs of forms that instanciate the pattern
A table containing all these pairs of forms.
The patterns are sorted by frequency.
Tip
Have a look at the patterns reference to see all available options for pattern computations.