home edit page issue tracker

This page pertains to UD version 2.

Universal Dependencies

Universal Dependencies (UD) is a framework for cross-linguistically consistent grammatical annotation and an open community effort with over 200 contributors producing almost 100 treebanks in over 50 languages.

If you want to receive news about Universal Dependencies, you can subscribe to the UD mailing list.

Current UD Languages

Information about language families (and genera for families with multiple branches) is normally taken from WALS Online (IE = Indo-European).

Afrikaans 1 49K IE, Germanic

Afrikaans treebanks

Original 49K
Please add a summary section to the treebank readme file
  • Contributors: Peter Dirix, Liesbeth Augustinus, Daniel van Niekerk
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Ancient Greek 2 414K IE, Greek

Ancient Greek treebanks

PROIEL 211K
Please add a summary section to the treebank readme file

 

Original 202K
Please add a summary section to the treebank readme file
  • Contributors: Giuseppe G. A. Celano, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Arabic 3 1,042K Afro-Asiatic, Semitic

Arabic treebanks

NYUAD 738K
Please add a summary section to the treebank readme file

 

Original 282K
Please add a summary section to the treebank readme file
  • Contributors: Daniel Zeman, Zdeněk Žabokrtský, Shadi Saleh
  • Repository master dev
  • README

 

PUD 20K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Luma Ateyah, Martin Popel, Daniel Zeman, Nizar Habash, Dima Taji
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Basque 1 121K Basque

Basque treebanks

Original 121K
Please add a summary section to the treebank readme file
  • Contributors: Maria Jesus Aranzabe, Aitziber Atutxa, Kepa Bengoetxea, Arantza Diaz de Ilarraza, Iakes Goenaga, Koldo Gojenola, Larraitz Uria
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Belarusian 1 8K IE, Slavic

Belarusian treebanks

Original 8K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Bulgarian 1 156K IE, Slavic

Bulgarian treebanks

Original 156K
Please add a summary section to the treebank readme file
  • Contributors: Kiril Simov, Petya Osenova, Martin Popel
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Buryat 1 10K Altaic

Buryat treebanks

Original 10K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Catalan 1 531K IE, Romance

Catalan treebanks

Original 531K ?
Please add a summary section to the treebank readme file
  • Contributors: Héctor Martínez Alonso, Elena Pascual, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Chinese 4 146K Sino-Tibetan

Chinese treebanks

Original 123K
Please add a summary section to the treebank readme file
  • Contributors: Mo Shen, Ryan McDonald, Daniel Zeman
  • Repository master dev
  • README

 

PUD 21K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Josie Li, Cheuk Ying Li, Martin Popel, Daniel Zeman, Herman Leung
  • Repository master dev
  • README

 

HK 1K
Please add a summary section to the treebank readme file
  • Contributors: Kim Gerdes, John Lee, Herman Leung, Tak-sum Wong
  • Repository master dev
  • README

 

CFL 0K
Please add a summary section to the treebank readme file
  • Contributors: John Lee, Herman Leung, Keying Li
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Coptic 1 11K Afro-Asiatic, Egyptian

Coptic treebanks

Original 11K
Please add a summary section to the treebank readme file
  • Contributors: Elizabeth Davidson, Amir Zeldes
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Croatian 1 197K IE, Slavic

Croatian treebanks

Original 197K
Please add a summary section to the treebank readme file
  • Contributors: Željko Agić, Nikola Ljubešić, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Czech 5 2,055K IE, Slavic

Czech treebanks

Original 1,506K ?
The Czech UD treebank is based on the Prague Dependency Treebank 3.0 (PDT), created at the Charles University in Prague.

 

CAC 494K ?
Please add a summary section to the treebank readme file

 

CLTT 35K ?
Please add a summary section to the treebank readme file
  • Contributors: Barbora Hladká, Daniel Zeman, Martin Popel
  • Repository master dev
  • README

 

PUD 18K ?
Please add a summary section to the treebank readme file
  • Contributors: Václava Kettnerová, Jan Hajič jr., Silvie Cinková, Zdeňka Urešová, Milan Straka, Jan Hajič, Jaroslava Hlaváčová, Daniel Zeman
  • Repository master dev
  • README

 

FicTree 0K ?
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Danish 1 100K IE, Germanic

Danish treebanks

Original 100K
Please add a summary section to the treebank readme file
  • Contributors: Anders Johannsen, Héctor Martínez Alonso, Barbara Plank
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Dutch 2 310K IE, Germanic

Dutch treebanks

Original 208K ?
Please add a summary section to the treebank readme file
  • Contributors: Daniel Zeman, Zdeněk Žabokrtský, Gosse Bouma, Gertjan van Noord
  • Repository master dev
  • README

 

LassySmall 101K
Please add a summary section to the treebank readme file
  • Contributors: Gosse Bouma, Gertjan van Noord
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
English 5 496K IE, Germanic

English treebanks

Original 254K
Please add a summary section to the treebank readme file
  • Contributors: Natalia Silveira, Timothy Dozat, Christopher Manning, Sebastian Schuster, John Bauer, Miriam Connor, Marie-Catherine de Marneffe, Sam Bowman, Hanzhi Zhu, Daniel Galbraith
  • Repository master dev
  • README

 

ESL 88K
Please add a summary section to the treebank readme file
  • Contributors: Yevgeni Berzak, Jessica Kenney, Carolyn Spadine, Jing Xian Wang, Lucia Lam, Keiko Sophie Mori, Sebastian Garza, Boris Katz
  • Repository master dev
  • README

 

LinES 82K
Please add a summary section to the treebank readme file

 

ParTUT 49K
Please add a summary section to the treebank readme file
  • Contributors: Cristina Bosco, Manuela Sanguinetti
  • Repository master dev
  • README

 

PUD 21K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Jesse Kirchner, Lorenzo Lambertino, Martin Popel, Daniel Zeman, Christopher Manning, Sebastian Schuster, Siva Reddy
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Estonian 1 106K Uralic, Finnic

Estonian treebanks

Original 106K
Please add a summary section to the treebank readme file
  • Contributors: Kadri Muischnek, Kaili Müürisep, Tiina Puolakainen
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Finnish 3 377K Uralic, Finnic

Finnish treebanks

Original 202K ?
UD_Finnish is based on the Turku Dependency Treebank (TDT), a broad-coverage dependency treebank of general Finnish covering numerous genres. The conversion to UD was followed by extensive manual checks and corrections, and the treebank closely adheres to the UD guidelines.
  • Contributors: Filip Ginter, Jenna Kanerva, Veronika Laippala, Anna Missilä, Stina Ojala, Sampo Pyysalo
  • Repository master dev
  • README

 

FTB 159K
Please add a summary section to the treebank readme file
  • Contributors: Jussi Piitulainen, Hanna Nurmi
  • Repository master dev
  • README

 

PUD 15K
Please add a summary section to the treebank readme file
  • Contributors: Jenna Kanerva, Filip Ginter, Stina Ojala, Anna Missilä
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
French 6 1,099K IE, Romance

French treebanks

FTB 573K
Please add a summary section to the treebank readme file
  • Contributors: Marie Candito, Bruno Guillaume, Teresa Lynn, Héctor Martínez Alonso, Benoit Sagot, Djamé Seddah, Eric de la Clergerie
  • Repository master dev
  • README

 

Original 402K
Please add a summary section to the treebank readme file
  • Contributors: Marie-Catherine de Marneffe, Bruno Guillaume, Ryan McDonald, Alane Suhr, Joakim Nivre, Matias Grioni
  • Repository master dev
  • README

 

Sequoia 70K
Please add a summary section to the treebank readme file
  • Contributors: Marie Candito, Djamé Seddah, Guy Perrier, Bruno Guillaume
  • Repository master dev
  • README

 

ParTUT 28K
Please add a summary section to the treebank readme file
  • Contributors: Cristina Bosco, Manuela Sanguinetti
  • Repository master dev
  • README

 

PUD 24K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Jana Strnadová, Gauthier Caron, Martin Popel, Daniel Zeman, Marie-Catherine de Marneffe
  • Repository master dev
  • README

 

Spoken 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Galician 2 164K IE, Romance

Galician treebanks

Original 138K
Please add a summary section to the treebank readme file

 

TreeGal 25K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
German 2 313K IE, Germanic

German treebanks

Original 292K ?
Please add a summary section to the treebank readme file
  • Contributors: Slav Petrov, Wolfgang Seeker, Ryan McDonald, Joakim Nivre, Daniel Zeman
  • Repository master dev
  • README

 

PUD 21K ?
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Georg Rehm, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Sebastian Bank, Martin Popel, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Gothic 1 55K IE, Germanic

Gothic treebanks

Original 55K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Greek 1 63K IE, Greek

Greek treebanks

Original 63K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Hebrew 1 161K Afro-Asiatic, Semitic

Hebrew treebanks

Original 161K
Please add a summary section to the treebank readme file
  • Contributors: Yoav Goldberg, Reut Tsarfaty, Amir More
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Hindi 2 375K IE, Indic

Hindi treebanks

Original 351K
Please add a summary section to the treebank readme file
  • Contributors: Riyaz Ahmad Bhat, Daniel Zeman
  • Repository master dev
  • README

 

PUD 23K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Esha Banerjee, Pinkey Nainwani, Martin Popel, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Hungarian 1 42K Uralic, Ugric

Hungarian treebanks

Original 42K
Please add a summary section to the treebank readme file
  • Contributors: Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Indonesian 2 147K Austronesian

Indonesian treebanks

Original 121K
Please add a summary section to the treebank readme file
  • Contributors: Ryan McDonald, Joakim Nivre, Daniel Zeman
  • Repository master dev
  • README

 

PUD 25K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Ruli Manurung, Muh Shohibussirri, Martin Popel, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Irish 1 23K IE, Celtic

Irish treebanks

Original 23K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Italian 4 426K IE, Romance

Italian treebanks

Original 282K
Please add a summary section to the treebank readme file
  • Contributors: Cristina Bosco, Alessandro Lenci, Simonetta Montemagni, Maria Simi
  • Repository master dev
  • README

 

PoSTWITA 64K
Please add a summary section to the treebank readme file
  • Contributors: Manuela Sanguinetti, Cristina Bosco
  • Repository master dev
  • README

 

ParTUT 55K
Please add a summary section to the treebank readme file
  • Contributors: Cristina Bosco, Manuela Sanguinetti
  • Repository master dev
  • README

 

PUD 23K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Antonio Stella, Davide Rovati, Martin Popel, Daniel Zeman, Maria Simi, Manuela Sanguinetti
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Japanese 3 402K Japanese

Japanese treebanks

KTC 189K
Please add a summary section to the treebank readme file
  • Contributors: Masayuki Asahara, Hiroshi Kanayama, Yuji Matsumoto, Yusuke Miyao, Shunsuke Mori, Takaaki Tanaka, Sumire Uematsu
  • Repository master dev
  • README

 

Original 186K
Please add a summary section to the treebank readme file
  • Contributors: Ryan McDonald, Joakim Nivre, Daniel Zeman, Masayuki Asahara, Hiroshi Kanayama, Yuji Matsumoto, Yusuke Miyao, Shunsuke Mori, Takaaki Tanaka, Sumire Uematsu
  • Repository master dev
  • README

 

PUD 26K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Atsuko Shimada, Anna Trukhina, Martin Popel, Daniel Zeman, Hiroshi Kanayama
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Kazakh 1 10K Turkic

Kazakh treebanks

Original 10K
Please add a summary section to the treebank readme file
  • Contributors: Aibek Makazhanov, Jonathan North Washington, Francis Tyers
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Korean 2 86K Korean

Korean treebanks

Original 63K
Please add a summary section to the treebank readme file
  • Contributors: Ryan McDonald, Joakim Nivre, Daniel Zeman, Jinho Choi
  • Repository master dev
  • README

 

PUD 22K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Sookyoung Kwak, Yongseok Cho, Martin Popel, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Kurmanji 1 10K IE, Iranian

Kurmanji treebanks

Original 10K
Please add a summary section to the treebank readme file
  • Contributors: Memduh Gökırmak, Francis Tyers
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Latin 3 491K IE, Latin

Latin treebanks

ITTB 291K
Please add a summary section to the treebank readme file
  • Contributors: Marco Passarotti, Daniel Zeman, Berta Gonzáles Saavedra
  • Repository master dev
  • README

 

PROIEL 171K
Please add a summary section to the treebank readme file

 

Original 29K
Please add a summary section to the treebank readme file
  • Contributors: Giuseppe G. A. Celano, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Latvian 1 90K IE, Baltic

Latvian treebanks

Original 90K
Please add a summary section to the treebank readme file
  • Contributors: Lauma Pretkalniņa, Laura Rituma, Baiba Saulīte, Gunta Nešpore-Bērzkalne, Normunds Grūzītis
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Lithuanian 2 5K IE, Baltic

Lithuanian treebanks

Original 5K
Please add a summary section to the treebank readme file
  • Contributors: Olga Lyashevskaya, Dmitry Sichinava
  • Repository master dev
  • README

 

Alksnis 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Marathi 1 3K IE, Indic

Marathi treebanks

Original 3K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
North Sami 1 10K Uralic, Sami

North Sami treebanks

Original 10K
Please add a summary section to the treebank readme file
  • Contributors: Trond Trosterud, Lene Antonsen, Francis Tyers
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Norwegian 3 625K IE, Germanic

Norwegian treebanks

Bokmaal 310K
Please add a summary section to the treebank readme file
  • Contributors: Lilja Øvrelid, Fredrik Jørgensen
  • Repository master dev
  • README

 

Nynorsk 301K
Please add a summary section to the treebank readme file
  • Contributors: Lilja Øvrelid, Fredrik Jørgensen, Petter Hohle
  • Repository master dev
  • README

 

NynorskLIA 13K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Old Church Slavonic 1 57K IE, Slavic

Old Church Slavonic treebanks

Original 57K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Persian 1 152K IE, Iranian

Persian treebanks

Original 152K
Please add a summary section to the treebank readme file
  • Contributors: Mojgan Seraji, Filip Ginter, Joakim Nivre
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Polish 1 83K IE, Slavic

Polish treebanks

Original 83K ?
Please add a summary section to the treebank readme file
  • Contributors: Daniel Zeman, Jan Mašek, Rudolf Rosa
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Portuguese 3 570K IE, Romance

Portuguese treebanks

BR 319K
Please add a summary section to the treebank readme file
  • Contributors: Ryan McDonald, Joakim Nivre, Daniel Zeman
  • Repository master dev
  • README

 

Original 227K
Please add a summary section to the treebank readme file
  • Contributors: Cláudia Freitas, Eckhard Bick, Fabricio Chalub, Alexandre Rademaker, Livy Real, Valeria de Paiva, Daniel Zeman, Martin Popel, David Mareček, Natalia Silveira, André Martins
  • Repository master dev
  • README

 

PUD 23K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Gustavo Mendonça, Larissa Rinaldi, Martin Popel, Daniel Zeman, Valeria de Paiva
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Romanian 1 218K IE, Romance

Romanian treebanks

Original 218K
Please add a summary section to the treebank readme file
  • Contributors: Verginica Mititelu, Elena Irimia, Cenel-Augusto Perez, Radu Ion, Radu Simionescu, Cătălina Mărănduc, Martin Popel
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Russian 3 1,226K IE, Slavic

Russian treebanks

SynTagRus 1,107K
Please add a summary section to the treebank readme file
  • Contributors: Kira Droganova, Olga Lyashevskaya, Daniel Zeman, Lena Shakurova, Nina Mustafina
  • Repository master dev
  • README

 

Original 99K
Please add a summary section to the treebank readme file
  • Contributors: Ryan McDonald, Vitaly Nikolaev, Olga Lyashevskaya
  • Repository master dev
  • README

 

PUD 19K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Tatiana Lando, Olga Loginova, Martin Popel, Daniel Zeman, Kira Droganova
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Sanskrit 1 1K IE, Indic

Sanskrit treebanks

Original 1K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Serbian 1 86K IE, Slavic

Serbian treebanks

Original 86K
Please add a summary section to the treebank readme file
  • Contributors: Tanja Samardžić, Nikola Ljubešić
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Slovak 1 106K IE, Slavic

Slovak treebanks

Original 106K ?
Please add a summary section to the treebank readme file
  • Contributors: Katarína Gajdošová, Mária Šimková, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Slovenian 2 170K IE, Slavic

Slovenian treebanks

Original 140K
Please add a summary section to the treebank readme file
  • Contributors: Kaja Dobrovoljc, Tomaž Erjavec, Simon Krek
  • Repository master dev
  • README

 

SST 29K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Spanish 3 1,004K IE, Romance

Spanish treebanks

AnCora 549K ?
Please add a summary section to the treebank readme file
  • Contributors: Héctor Martínez Alonso, Daniel Zeman
  • Repository master dev
  • README

 

Original 431K ?
Please add a summary section to the treebank readme file
  • Contributors: Miguel Ballesteros, Héctor Martínez Alonso, Ryan McDonald, Elena Pascual, Natalia Silveira, Daniel Zeman, Joakim Nivre
  • Repository master dev
  • README

 

PUD 23K ?
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Hector Fernandez Alcalde, Laura Moreno Romero, Martin Popel, Daniel Zeman, Héctor Martínez Alonso
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Swedish 3 195K IE, Germanic

Swedish treebanks

Original 96K ?
UD Swedish-TP is a conversion of the Prose section of Talbanken, originally annotated in the MAMBA annotation scheme, and consisting of a variety of informative text genres, including textbooks, information brochures and newspaper articles.

 

LinES 79K
Please add a summary section to the treebank readme file

 

PUD 19K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Swedish Sign Language 1 1K Sign Language

Swedish Sign Language treebanks

Original 1K
Please add a summary section to the treebank readme file
  • Contributors: Moa Gärdenfors, Carl Börstell, Robert Östling, Lars Wallin, Mats Wirén
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Tamil 1 9K Dravidian

Tamil treebanks

Original 9K
Please add a summary section to the treebank readme file
  • Contributors: Loganathan Ramasamy, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Thai 1 23K Tai-Kadai

Thai treebanks

PUD 23K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Rattima Nitisaroj, Yanin Sawanakunanon, Martin Popel, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Turkish 2 74K Turkic

Turkish treebanks

Original 58K
Please add a summary section to the treebank readme file
  • Contributors: Çağrı Çöltekin, Gülşen Cebiroğlu Eryiğit, Memduh Gökırmak, Hüner Kaşıkara, Umut Sulubacak, Francis Tyers
  • Repository master dev
  • README

 

PUD 16K
Please add a summary section to the treebank readme file
  • Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Savas Cetin, Martin Popel, Daniel Zeman, Francis Tyers, Çağrı Çöltekin
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Ukrainian 1 100K IE, Slavic

Ukrainian treebanks

Original 100K
Please add a summary section to the treebank readme file
  • Contributors: Natalia Kotsyba, Bohdan Moskalevskyi
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Upper Sorbian 1 10K IE, Slavic

Upper Sorbian treebanks

Original 10K ?
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Urdu 1 138K IE, Indic

Urdu treebanks

Original 138K
Please add a summary section to the treebank readme file
  • Contributors: Riyaz Ahmad Bhat, Daniel Zeman
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Uyghur 1 15K Turkic

Uyghur treebanks

Original 15K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Vietnamese 1 43K Austro-Asiatic

Vietnamese treebanks

Original 43K
Please add a summary section to the treebank readme file
  • Contributors: Lương Nguyễn Thị, Linh Hà Mỹ, Phương Lê Hồng, Huyền Nguyễn Thị Minh
  • Repository master dev
  • README

 

Language documentation

Some language documentation.

Upcoming UD Languages

Amharic 1 0K Afro-Asiatic, Semitic

Amharic treebanks

Original 0K ? ?
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Armenian 1 0K IE, Armenian

Armenian treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Bangla 1 0K IE, Indic

Bangla treebanks

Original 0K
Please add a summary section to the treebank readme file
  • Contributors: Siratun Jannat, Mizanur Rahoman, Shafi Sourov, Jannatul Ferdaousi, Syeda Shahzadi
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Bengali 1 0K IE, Indic

Bengali treebanks

DDS 0K ?
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Cantonese 1 0K Sino-Tibetan

Cantonese treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Dargwa 1 0K Nakho-Dagestanian

Dargwa treebanks

Original 0K
Please add a summary section to the treebank readme file
  • Contributors: Sasha Kozhukhar, Olga Lyashevskaya
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Erzya 1 0K Uralic, Mordvin

Erzya treebanks

Original 0K ?
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Faroese 1 0K IE, Germanic

Faroese treebanks

Original 0K
Please add a summary section to the treebank readme file
  • Contributors: Daniel Zeman, Bjartur Mortensen, Francis Tyers
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Maltese 1 0K Afro-Asiatic, Semitic

Maltese treebanks

Original 0K
Please add a summary section to the treebank readme file
  • Contributors: Daniel Zeman, Martin Popel, Mike Rosner, Vinit Ravishankar
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Naija 1 0K Creole

Naija treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Old French 1 0K IE, Romance

Old French treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Romansh 2 0K IE, Romance

Romansh treebanks

Original 0K
Please add a summary section to the treebank readme file
  • Contributors: Sascha Brawer, Martin Cantieni
  • Repository master dev
  • README

 

Sursilv 0K
Please add a summary section to the treebank readme file
  • Contributors: Sascha Brawer, Martin Cantieni
  • Repository master dev
  • README

 

Language documentation

Some language documentation.
Somali 1 0K Afro-Asiatic, Cushitic

Somali treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Sorani 1 0K IE, Iranian

Sorani treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.
Telugu 1 0K Dravidian

Telugu treebanks

Original 0K
Please add a summary section to the treebank readme file

 

Language documentation

Some language documentation.

Disclaimer: Our use of flags to symbolise languages is only intended as a visual enhancement of the website and should not be interpreted as a political statement in any way.

Download

The data is released through LINDAT/CLARIN.