NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Tanja Samardzic
Number of Papers:- 30
Number of Citations:- 42
First ACL Paper:- 2010
Latest ACL Paper:- 2024
Venues:-
NAACL
CoNLL
L
P
COLING
BSNLP
EMNLP
s
-
VarDial
E
C
LREC
M
N
AmericasNLP
g
CL
A
ArabicNLP
d
WS
ACL
CAtoCL
EACL
LAW
i
CL4LC
LaTeCH
F
n
Co-Authors:-
Ahmed Ali
Alexander Koplenig
Alexandros Lazaridis
Anastassia Shaitarova
Andrea Gesmundo
Similar Authors:-
Julio Gonzales
Synny Diwakar
Jochen Schopp
Mariana Kaiseler
Eddi Gbery
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2012
2010
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets
F
i
n
d
i
n
g
s
-
N
A
A
C
L
Tanja Samardzic |
Ximena Gutierrez |
Christian Bentz |
Steven Moran |
Olga Pelloni |
System Description of the NordicsAlps Submission to the AmericasNLP 2024 Machine Translation Shared Task
AmericasNLP
WS
Joseph Attieh |
Zachary Hopton |
Yves Scherrer |
Tanja Samardžić |
NLP_DI at NADI 2024 shared task: Multi-label Arabic Dialect Classifications with an Unsupervised Cross-Encoder
ArabicNLP
WS
Vani Kanjirangat |
Tanja Samardzic |
Ljiljana Dolamic |
Fabio Rinaldi |
Languages Through the Looking Glass of BPE Compression
CL
Ximena Gutierrez-Vasques |
Christian Bentz |
Tanja Samardžić |
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
ACL
Michel Plüss |
Jan Deriu |
Yanick Schraner |
Claudio Paonessa |
Julia Hartmann |
Larissa Schmidt |
Christian Scheller |
Manuela Hürlimann |
Tanja Samardžić |
Manfred Vogel |
Mark Cieliebak |
TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP
LREC
Steven Moran |
Christian Bentz |
Ximena Gutierrez-Vasques |
Olga Pelloni |
Tanja Samardzic |
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
CoNLL
Tanja Samardžić |
Ximena Gutierrez-Vasques |
Rob van der Goot |
Max Müller-Eberstein |
Olga Pelloni |
Barbara Plank |
Subword Evenness (SuE) as a Predictor of Cross-lingual Transfer to Low-resource Languages
EMNLP
Olga Pelloni |
Anastassia Shaitarova |
Tanja Samardzic |
Early Guessing for Dialect Identification
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Vani Kanjirangat |
Tanja Samardzic |
Fabio Rinaldi |
Ljiljana Dolamic |
From characters to words: the turning point of BPE merges
EACL
Ximena Gutierrez-Vasques |
Christian Bentz |
Olga Sozinova |
Tanja Samardzic |
Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules
EACL
Tatyana Ruzsics |
Olga Sozinova |
Ximena Gutierrez-Vasques |
Tanja Samardzic |
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German
COLING
VarDial
Iuliia Nigmatulina |
Tannon Kew |
Tanja Samardzic |
A Swiss German Dictionary: Variation in Speech and Writing
LREC
Larissa Schmidt |
Lucy Linder |
Sandra Djambazovska |
Alexandros Lazaridis |
Tanja Samardžić |
Claudiu Musat |
A Report on the Third VarDial Evaluation Campaign
NAACL
WS
Marcos Zampieri |
Shervin Malmasi |
Yves Scherrer |
Tanja Samardžić |
Francis Tyers |
Miikka Silfverberg |
Natalia Klyueva |
Tung-Le Pan |
Chu-Ren Huang |
Radu Tudor Ionescu |
Andrei M. Butnaru |
Tommi Jauhiainen |
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign
COLING
VarDial
WS
Marcos Zampieri |
Shervin Malmasi |
Preslav Nakov |
Ahmed Ali |
Suwon Shon |
James Glass |
Yves Scherrer |
Tanja Samardžić |
Nikola Ljubešić |
Jörg Tiedemann |
Chris van der Lee |
Stefan Grondelaers |
Nelleke Oostdijk |
Dirk Speelman |
Antal van den Bosch |
Ritesh Kumar |
Bornini Lahiri |
Mayank Jain |
Encoder-Decoder Methods for Text Normalization
COLING
VarDial
WS
Massimo Lusetti |
Tatyana Ruzsics |
Anne Göhring |
Tanja Samardžić |
Elisabeth Stark |
Neural Sequence-to-sequence Learning of Internal Word Structure
CoNLL
Tatyana Ruzsics |
Tanja Samardžić |
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages
BSNLP
WS
Tanja Samardžić |
Mirjana Starović |
Željko Agić |
Nikola Ljubešić |
TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data
COLING
Nikola Ljubešić |
Tanja Samardžić |
Curdin Derungs |
ArchiMob - A Corpus of Spoken Swiss German
LREC
Tanja Samardžić |
Yves Scherrer |
Elvira Glaser |
A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora
LREC
Tanja Samardžić |
Maja Miličević |
A Comparison Between Morphological Complexity Measures: Typological Data vs. Language Corpora
CL4LC
WS
Christian Bentz |
Tatyana Ruzsics |
Alexander Koplenig |
Tanja Samardžić |
Automatic interlinear glossing as two-level sequence classification
LaTeCH
WS
Tanja Samardžić |
Robert Schikowski |
Sabine Stoll |
Regional Linguistic Data Initiative (ReLDI)
BSNLP
WS
Tanja Samardžić |
Nikola Ljubešić |
Maja Miličević |
Likelihood of External Causation in the Structure of Events
CAtoCL
WS
Tanja Samardžić |
Paola Merlo |
Part-of-Speech Tag Disambiguation by Cross-Linguistic Majority Vote
VarDial
WS
Noëmi Aepli |
Ruprecht von Waldenfels |
Tanja Samardžić |
Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification
LREC
Andrea Gesmundo |
Tanja Samardžić |
Lemmatisation as a Tagging Task
ACL
Andrea Gesmundo |
Tanja Samardžić |
Cross-Lingual Validity of PropBank in the Manual Annotation of French
LAW
WS
Lonneke van der Plas |
Tanja Samardz̆ić |
Paola Merlo |
Cross-Lingual Variation of Light Verb Constructions: Using Parallel Corpora and Automatic Alignment for Linguistic Research
WS
Tanja Samardžić |
Paola Merlo |
Linguistic
Task
Approach
Language
Dataset Type
.