Johann-Mattis List
Department of Linguistic and Cultural Evolution
Max Planck Institute for Evolutionary Anthropology
Deutscher Platz 6
04103 Leipzig
phone: +49 (0) 341 3550 283
mattis_list@[>>> Please remove the text! <<<]
ORCiD Academia GoogleScholar CALC GitHub CALC Blog Personal Website Personal Blog
About me

I am a full professor at the University of Passau, leading the Chair of Multilingual Computational Linguistics. The research carried out in our CALC/MCL Lab at Passau and in collaboration with researchers from Leipzig takes inspiration from bioinformatics and computer science to tackle problems in comparative linguistics and to provide solutions for multilingual problems in computational linguistics. In my research, I generally follow a data-driven, empirical, and quantitative perspective on language change and language history. In contrast to pure computational approaches, however, I try to keep my research closely connected to traditional historical linguistics and theory, following a computer-assisted rather than a computer-based framework of quantitative research in historical linguistics.
Curriculum Vitae
since 01/2023 | Full professor at the University of Passau, leading the Chair of Multilingual Computational Linguistics |
03/2021-03/2024 | Senior scientist in the Department of Linguistic and Cultural Evolution at the Max Planck Institute for Evolutionary Anthropology |
01/2017-02/2021 | Senior scientist in the Department of Linguistic and Cultural Evolution at the Max Planck Institute for the Science of Human History |
1/2015-12/2016 | DFG research fellow at the Centre de Recherches Linguistique sur l'Asie Orientale (EHESS) and Université Pierre et Marie Curie (Team "Adaptation, Integration, Reticulation, Evolution") |
10/2012-12/2014 | Post-doctoral researcher at Philipps-University Marburg |
02/2009-09/2012 | Doctoral student at Heinrich Heine University Düsseldorf (Historical Linguistics) |
25.06.2008 | Magister Artium: Major Subject: Comparative-Historical Linguistics (Freie University Berlin), Minor Subjects: Sinology (Freie University Berlin), Russian Philology (Humboldt-University Berlin) |
09/2007-01/2008 | Visiting student at Fúdàn University Shànghǎi (Linguistics and Applied Linguistics, taught in Chinese) |
09/2005-07/2006 | Language student at Fudan University Shanghai |
2003-2008 | Studies at Freie University Berlin and Humbold-University Berlin (Major Subject: Comparative-Historical Linguistics, Minor Subjects: Sinology, Russian Philology) |
10/2002-09/2003 | Studies at Eberhard-Karls-University Tübingen (Major Subject: Rhetorics, Minor Subjects: Modern History, Comparative Literature Studies, Russian Philology) |
My full and always up-to-date CV is available for download here.
List, J.-M., Forkel, R., Greenhill, S. J., Rzymski, C., Englisch, J., & Gray, R. D. (2022). Lexibank, a public repository of standardized wordlists with computed phonological and lexical features. Scientific Data, 9: 316. |
Bodt, T. A., & List, J.-M. (2022). Reflex prediction: A case study of Western Kho-Bwa. Diachronica. |
Power, J. M., Grimm, G. W., & List, J.-M. (2020). Evolutionary dynamics in the dispersal of sign languages. Royal Society Open Science, 7(1). |
Jackson, J. C., Watts, J., Henry, T. R., List, J.-M., Forkel, R., Mucha, P. J., Greenhill, S. J., Gray, R. D., & Lindquist, K. A. (2019). Emotion semantics show both cultural variation and universal structure. Science, 366, 1517-1522. |
Jacques, G., & List, J. M. (2019). Save the trees why we need tree models in linguistic reconstruction (and when we should apply them). Journal of Historical Linguistics, 9(1), 128-166. |
Sagart, L., Jacques, G., Lai, Y., Ryder, R. J., Thouzeau, V., Greenhill, S. J., & List, J.-M. (2019). Dated language phylogenies shed light on the ancestry of Sino-Tibetan. Proceedings of the National Academy of Sciences of the United States of America, 116(21), 10317-10322. |
List, J.-M. (2019). Automatic inference of sound correspondence patterns across multiple languages. Computational Linguistics, 45(1), 137-161. |
Blum, F., Barrientos, C., Ingunza, A., & List, J.-M. (2024). Cognate reflex prediction as hypothesis test for a genealogical relation between the Panoan and Takanan language families. Scientific Reports, 14: 30636. |
List, J.-M., Hill, N. W., Blum, F., & Juárez, C. (2024). Grouping sounds into evolving units for the purpose of historical language comparison. Open Research Europe, 4: 31. |
List, J.-M. (2024). Open problems in computational historical linguistics. Open Research Europe, 3: 201. |
Tjuka, A., Forkel, R., & List, J.-M. (2024). Universal and cultural factors shape body part vocabularies. Scientific Reports, 14: 10486. |
Blum, F., Barrientos, C., Zariquiey, R., & List, J.-M. (2024). A comparative wordlist for investigating distant relations among languages in Lowland South America. Scientific Data, 11(1): 92. |
Blum, F., Englisch, J., Rodriguez, A. H., van Gijn, R., & List, J.-M. (2024). Resource acquisition for understudied languages: Extracting wordlists from dictionaries for computer-assisted language comparison. In N. Calzolari, M.-Y. Kan, V. Hoste, A. Lenci, & S. Sakti ( |
Dhakal, D. N., List, J.-M., & Roberts, S. G. (2024). A phylogenetic study of South-Western Tibetic. Journal of Language Evolution, 9(1-2), 14-28. |
Forkel, R., List, J.-M., Rzymski, C., & Segerer, G. (2024). Linguistic survey of India and Polyglotta Africana: Two retrostandardized digital editions of large historical collections of multilingual wordlists. In N. Calzolari, M.-Y. Kan, V. Hoste, A. Lenci, & S. Sakti ( |
Pulini, M., & List, J.-M. (2024). First steps towards the integration of resources on historical glossing traditions in the history of Chinese: A collection of standardized Fǎnqiè spellings from the Guǎngyùn. In N. Calzolari, M.-Y. Kan, V. Hoste, A. Lenci, & S. Sakti ( |
Tjuka, A., & List, J.-M. (2024). Partial colexifications reveal directional tendencies in object naming. Yearbook of the German Cognitive Linguistics Association, 12(1), 95-112. |
List, J.-M., Hill, N., Forkel, R., & Blum, F. (2023). Representing and computing uncertainty in phonological reconstruction. In N. Tahmasebi, S. Montariol, H. Dubossarsky, A. Kutuzov, S. Hengchen, D. Alfter, F. Periti, & P. Cassotti ( |
Wu, M.-S., & List, J.-M. (2023). Annotating cognates in phylogenetic studies of Southeast Asian languages. Language Dynamics and Change, 13(2), 161-197. |
List, J.-M. (2023). Evolutionary aspects of language change. In A. du Crest ( |
List, J.-M. (2023). Inference of partial colexifications from multilingual wordlists. Frontiers in Psychology, 14: 1156540. |
Tjuka, A., Forkel, R., & List, J.-M. (2023). Curating and extending data for language comparison in Concepticon and NoRaRe. Open Research Europe, 2: 141. |
Steuer, J., List, J.-M., Abdullah, B. M., & Klakow, D. (2023). Information-theoretic characterization of vowel harmony: A cross-linguistic study on word lists. In Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (pp. 96-109). Dubrovnik, Croatia: Association for Computational Linguistics. |
Zariquiey, R., Vera, J., Greenhill, S. J., Valenzuela, P., Gray, R. D., & List, J.-M. (2023). Untangling the evolution of body-part terminology in Pano: conservative versus innovative traits in body-part lexicalization. Interface Focus, 13(1): 20220053. |
Blum, F., & List, J.-M. (2023). Trimming phonetic alignments improves the inference of sound correspondence patterns from multilingual wordlists. In L. Beinborn, K. Goswami, S. Muradoğlu, A. Sorokin, R. Kumar, A. Scherbakov, E. M. Ponti, R. Cotterell, & E. Vylomova ( |
Greenhill, S. J., Haynie, H. J., Ross, R. M., Chira, A.-M., List, J.-M., Campbell, L., Botero, C. A., & Gray, R. D. (2023). A recent northern origin for the Uto-Aztecan family. Language, 99(1), 81-107. |
Lai, Y., & List, J.-M. (2023). Lexical data for the historical comparison of Rgyalrongic languages. Open Research Europe, 3: 99. |
Miller, J. E., & List, J.-M. (2023). Detecting lexical borrowings from dominant languages in multilingual wordlists. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. Short Papers (pp. 2591-2597). Association of Computational Linguistics. |
Brid, N., Messineo, C., & List, J.-M. (2022). A comparative wordlist for the languages of The Gran Chaco, South America [version 2; peer review: 2 approved]. Open Research Europe, 2(90). |
Hantgan, A., & List, J.-M. (2022). Bangime: secret language, language isolate, or language island? A computer‐assisted case study. Papers in Historical Phonology, 7, 1-43. |
Brid, N., List, J.-M., & Messineo, C. (2022). Patrones léxicos compartidos en el dominio etnobiológico de las lenguas del Chaco: Análisis preliminar de patrones léxicos compartidos en el dominio etnobiológico. [The languages of the Gran Chaco from the perspective of lexical semantics: Preliminary analysis of shared lexical structures in the ethnobotanical domain]. LIAMES: Línguas Indígenas Americanas, 22: e022005. |
List, J.-M., Forkel, R., Greenhill, S. J., Rzymski, C., Englisch, J., & Gray, R. D. (2022). Lexibank, a public repository of standardized wordlists with computed phonological and lexical features. Scientific Data, 9: 316. |
Tjuka, A., Forkel, R., & List, J.-M. (2022). Linking norms, ratings, and relations of words and concepts across multiple language varieties. Behavior Research Methods, 54, 864-884. |
Bodt, T. A., & List, J.-M. (2022). Reflex prediction: A case study of Western Kho-Bwa. Diachronica. |
Lai, Y., & List, J.-M. (2022). [Book review] Geoffrey Sampson: Voices from early China: The odes demystified. 445 pp. Newcastle upon Tyne: Cambridge Scholars’ Press, 2020. £67.99. ISBN 978-1-5275-5212-8. Bulletin of the School of Oriental and African Studies, 85(1), 136-138. |
Hantgan, A., Babiker, H., & List, J.-M. (2022). First steps towards the detection of contact layers in Bangime: A multi-disciplinary, computer-assisted approach. Open Research Europe, 2: 10. |
Geisler, H., & List, J.-M. (2022). Of word families and language trees: New and old metaphors in studies on language history. Moderna, 24(1-2), 134-148. |
Jackson, J., Watts, J., List, J.-M., Drabble, R., & Lindquist, K. (2022). From text to thought: How analyzing language can advance psychological science. Perspectives on Psychological Science. |
List, J. M. (2022). Chances and challenges for quantitative approaches in Chinese historical phonology. Bulletin of Chinese Linguistics, 15, 131-143. |
List, J. M. (2022). Correcting a bias in TIGER rates resulting from high amounts of invariant and singleton cognate sets. Journal of Language Evolution, 7(1), 53-58. |
List, J.-M., Forkel, R., & Hill, N. (2022). A new framework for fast automated phonological reconstruction using trimmed alignments and sound correspondence patterns. In N. Tahmasebi, S. Montariol, A. Kutozov, S. Hengchen, H. Dubossarsky, & L. Borin ( |
List, J.-M., Vylomova, E., Forkel, R., Hill, N. W., & Cotterell, R. D. (2022). The SIGTYP 2022 shared task on the prediction of cognate reflexes. In E. Vylomova, E. Ponti, & R. Cotterell ( |
Tresoldi, T., Rzymski, C., Forkel, R., Greenhill, S. J., List, J.-M., & Gray, R. D. (2022). Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison. In A. L. Berez-Kroeker, B. McDonnel, & E. Koller ( |
Evans, C. L., Greenhill, S. J., Watts, J., List, J.-M., Botero, C. A., Gray, R. D., & Kirby, K. (2021). The uses and abuses of tree thinking in cultural evolution. Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences, 376(1828): 20200056. |
Geisler, H., Forkel, R., & List, J.-M. (2021). A digital, retro-standardized edition of the Tableaux Phonétiques des Patois Suisses Romands (TPPSR). In A. Thibault, M. Avanzi, & N. Lo Vecchio ( |
List, J.-M. (2021). Computer-assisted approaches to historical language comparison. Habilitation Thesis, Friedrich-Schiller-Universität, Jena. |
List, J.-M., & Forkel, R. (2021). Automated identification of borrowings in multilingual wordlists. Open Research Europe, 1: 79. |
List, J.-M., Sims, N. A., & Forkel, R. (2021). Toward a sustainable handling of interlinear-glossed text in language documentation. ACM Transactions on Asian and Low-Resource Language Information Processing, 20(2), 1-15. |
Miller, J. E., Tresoldi, T., Zariquiey, R., Castañón, C. A. B., Morozova, N., & List, J.-M. (2020). Using lexical language models to detect borrowings in monolingual wordlists. PLoS One, 0242709. |
List, J.-M. (2020). Improving data handling and analysis in the study of rhyme patterns. Cahiers de linguistique Asie orientale, 49(1), 43-57. |
Schweikhard, N. E., & List, J.-M. (2020). Developing an annotation framework for word formation processes in comparative linguistics. SKASE Journal of Theoretical Linguistics, 17(1), 2-26. |
Wu, M.-S., Schweikhard, N. E., Bodt, T. A., Hill, N. W., & List, J.-M. (2020). Computer-Assisted Language Comparison: State of the Art. Journal of Open Humanities Data, 6(2). |
Power, J. M., Grimm, G. W., & List, J.-M. (2020). Evolutionary dynamics in the dispersal of sign languages. Royal Society Open Science, 7(1). |
Rzymski, C., Tresoldi, T., Greenhill, S. J., Wu, M.-S., Schweikhard, N. E., Koptjevskaja-Tamm, M., Gast, V., Bodt, T. A., Hantgan, A., Kaiping, G. A., Chang, S., Lai, Y., Morozova, N., Arjava, H., Hübler, N., Koile, E., Pepper, S., Proos, M., Epps, B. V., Blanco, I., Hundt, C., Monakhov, S., Pianykh, K., Ramesh, S., Gray, R. D., Forkel, R., & List, J.-M. (2020). The database of cross-linguistic colexifications, reproducible analysis of cross-linguistic polysemies. Scientific Data, 7: 13. |
Forkel, R., & List, J.-M. (2020). CLDFBench: Give your cross-linguistic data a lift. In N. Calzolari, F. Béchet, P. Blache, K. Choukri, C. Cieri, T. Declerck, S. Goggi, H. Ishara, B. Maegaard, H. M. Mariani, A. Moreno, J. Odijk, & S. Piperidis ( |
Jackson, J. C., Watts, J., Henry, T. R., List, J.-M., Forkel, R., Mucha, P. J., Greenhill, S. J., Gray, R. D., & Lindquist, K. A. (2019). Emotion semantics show both cultural variation and universal structure. Science, 366, 1517-1522. |
List, J.-M. (2019). Beyond edit distances: Comparing linguistic reconstruction systems. Theoretical Linguistics, 45(3-4), 247-258. |
List, J.-M. (2019). Automated methods for the investigation of language contact, with a focus on lexical borrowing. Language and Linguistics Compass, 13(10): e12355. |
Jacques, G., & List, J. M. (2019). Save the trees why we need tree models in linguistic reconstruction (and when we should apply them). Journal of Historical Linguistics, 9(1), 128-166. |
Bodt, T. A., & List, J.-M. (2019). Testing the predictive strength of the comparative method: An ongoing experiment on unattested words in Western Kho‐Bwa languages. Papers in Historical Phonology, 4, 22-44. |
Sagart, L., Jacques, G., Lai, Y., Ryder, R. J., Thouzeau, V., Greenhill, S. J., & List, J.-M. (2019). Dated language phylogenies shed light on the ancestry of Sino-Tibetan. Proceedings of the National Academy of Sciences of the United States of America, 116(21), 10317-10322. |
List, J.-M. (2019). Automatic inference of sound correspondence patterns across multiple languages. Computational Linguistics, 45(1), 137-161. |
Hill, N. W., & List, J.-M. (2019). Using Chinese character formation graphs to test proposals in Chinese historical Phonology. Bulletin of Chinese Linguistics, 12(2), 186-200. |
List, J.-M., Hill, N. W., & Foster, C. J. (2019). Towards a standardized annotation of rhyme judgments in Chinese historical phonology (and beyond). Journal of Language Relationship, 17(1), 26-43. |
List, J.-M., Lai, Y., & Starostin, G. S. (2019). „Old chinese and friends“: New approaches to historical linguistics of the Sino-Tibetan area. Journal of Language Relationship, 17(1-2). |
Forkel, R., List, J.-M., Greenhill, S. J., Rzymski, C., Bank, S., Cysouw, M., Hammarström, H., Haspelmath, M., Kaiping, G. A., & Gray, R. D. (2018). Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics. Scientific Data, 5: 180205. |
List, M., Walworth, M., Greenhill, S. J., Tresoldi, T., & Forkel, R. (2018). Sequence comparison in computational historical linguistics. Journal of Language Evolution, 3(2), 130-144. |
Rama, T., List, J.-M., Wahle, J., & Jäger, G. (2018). Are automatic methods for cognate detection good enough for phylogenetic reconstruction in historical linguistics? In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 393-400). Association for Computational Linguistics. |
Anderson, C., Tresoldi, T., Chacon, T., Fehn, A.-M., Walworth, M., Forkel, R., & List, J.-M. (2018). A cross-linguistic database of phonetic transcription systems. Yearbook of the Poznan Linguistic Meeting, 4(1), 21-53. |
Jäger, G., & List, J.-M. (2018). Using ancestral state reconstruction methods for onomasiological reconstruction in multilingual word lists. Language Dynamics and Change, 8(1), 22-54. |
List, J.-M., Greenhill, S. J., Anderson, C., Mayer, T., Tresoldi, T., & Forkel, R. (2018). CLICS2: An improved database of cross-linguistic colexifications assembling lexical data with the help of cross-linguistic data formats. Linguistic Typology, 22(2), 277-306. |
Hill, N. W., & List, J.-M. (2017). Challenges of annotation and analysis in computer-assisted language comparison: A case study on Burmish languages. Yearbook of the Poznan Linguistic Meeting, 3(1), 47-76. |
List, J.-M. (2017). A web-based interactive tool for creating, inspecting, editing, and publishing etymological datasets. In Association for Computational Linguistics (EACL) ( |
List, J.-M., Greenhill, S. J., & Gray, R. D. (2017). The Potential of automatic word comparison for historical linguistics. PLoS One, 12(1): 0170046. |
List, J.-M. (2017). Contraction. In R. Sybesma ( |
List, J.-M. (2017). Fāngyán 方言. In R. Sybesma ( |
List, J.-M. (2017). Using network models to analyze Old Chinese rhyme data. Bulletin of Chinese Linguistics, 9(2), 218-241. |
List, J.-M., Pathmanathan, J. S., Hill, N. W., Bapteste, E., & Lopez, P. (2017). Vowel purity and rhyme evidence in Old Chinese reconstruction. Lingua sinica, 3: 5. |
List, J.-M., Pathmanathan, J. S., Lopez, P., & Bapteste, E. (2016). Unity and disunity in evolutionary sciences: Process-based analogies open common research avenues for biology and linguistics. Biology Direct, 11(1): 39. |
List, J.-M., Nelson-Sathi, S., Geisler, H., & Martin, W. (2014). Networks of lexical borrowing and lateral gene transfer in language and genome evolution. Bioessays, 36(2), 141-150. |
List, J.-M., Nelson-Sathi, S., Martin, W., & Geisler, H. (2014). Using phylogenetic networks to model Chinese dialect history. Language Dynamics and Change, 4(2), 222-252. |
List, J.-M. (2012). SCA: Phonetic alignment based on sound classes. In D. Lassite, & M. Slavkovik ( |
Holman, E. W., Brown, C. H., Wichmann, S., Müller, A., Velupillai, V., Hammarström, H., Sauppe, S., Jung, H., Bakker, D., Brown, P., Belyaev, O., Urban, M., Mailhammer, R., List, J.-M., & Egorov, D. (2011). Automated dating of the world's language families based on lexical similarity. Current Anthropology, 52(6), 841-875. |
Nelson-Sathi, S., List, J.-M., Geisler, H., Fangerau, H., Gray, R. D., Martin, W., & Dagan, T. (2011). Networks uncover hidden lexical borrowing in Indo-European language evolution. Proceedings of the Royal Society B: Biological Sciences, 278(1713), 1794-1803. |
Wichmann, S., Holman, E. W., Müller, A., Velupillai, V., List, J.-M., Belyaev, O., Urban, M., & Bakker, D. (2010). Glottochronology as a heuristic for genealogical language relationships. Journal of Quantitative Linguistics, 17(4), 303-316. |